Picture for Guo Chen

Guo Chen

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing

Add code
Mar 30, 2025
Viaarxiv icon

An Egocentric Vision-Language Model based Portable Real-time Smart Assistant

Add code
Mar 06, 2025
Viaarxiv icon

Token-Efficient Long Video Understanding for Multimodal LLMs

Add code
Mar 06, 2025
Viaarxiv icon

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

Add code
Mar 02, 2025
Viaarxiv icon

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

Add code
Dec 30, 2024
Figure 1 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 2 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 3 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 4 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Viaarxiv icon

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

Add code
Dec 16, 2024
Figure 1 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 2 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 3 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 4 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Viaarxiv icon

FPE-LLM: Highly Intelligent Time-Series Forecasting and Language Interaction LLM in Energy Systems

Add code
Oct 30, 2024
Figure 1 for FPE-LLM: Highly Intelligent Time-Series Forecasting and Language Interaction LLM in Energy Systems
Figure 2 for FPE-LLM: Highly Intelligent Time-Series Forecasting and Language Interaction LLM in Energy Systems
Figure 3 for FPE-LLM: Highly Intelligent Time-Series Forecasting and Language Interaction LLM in Energy Systems
Figure 4 for FPE-LLM: Highly Intelligent Time-Series Forecasting and Language Interaction LLM in Energy Systems
Viaarxiv icon

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Add code
Oct 02, 2024
Viaarxiv icon

SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

Add code
Oct 02, 2024
Figure 1 for SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Figure 2 for SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Figure 3 for SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Figure 4 for SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
Viaarxiv icon