Picture for Yifei Huang

Yifei Huang

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

Add code
Dec 30, 2024
Viaarxiv icon

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

Add code
Dec 16, 2024
Figure 1 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 2 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 3 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 4 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Viaarxiv icon

Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers

Add code
Oct 10, 2024
Figure 1 for Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers
Figure 2 for Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers
Figure 3 for Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers
Figure 4 for Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers
Viaarxiv icon

Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild

Add code
Sep 15, 2024
Viaarxiv icon

ActionVOS: Actions as Prompts for Video Object Segmentation

Add code
Jul 10, 2024
Viaarxiv icon

Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition

Add code
Jul 09, 2024
Viaarxiv icon

EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation

Add code
Jun 27, 2024
Viaarxiv icon

TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation

Add code
Apr 18, 2024
Viaarxiv icon

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World

Add code
Mar 24, 2024
Viaarxiv icon

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Add code
Mar 22, 2024
Viaarxiv icon