Picture for Jilan Xu

Jilan Xu

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

Add code
Dec 30, 2024
Viaarxiv icon

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

Add code
Dec 16, 2024
Figure 1 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 2 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 3 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 4 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Viaarxiv icon

EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation

Add code
Jun 27, 2024
Viaarxiv icon

Concept-Attention Whitening for Interpretable Skin Lesion Diagnosis

Add code
Apr 09, 2024
Viaarxiv icon

QMix: Quality-aware Learning with Mixed Noise for Robust Retinal Disease Diagnosis

Add code
Apr 08, 2024
Viaarxiv icon

EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World

Add code
Mar 24, 2024
Viaarxiv icon

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Add code
Mar 22, 2024
Viaarxiv icon

Advancing COVID-19 Detection in 3D CT Scans

Add code
Mar 18, 2024
Figure 1 for Advancing COVID-19 Detection in 3D CT Scans
Figure 2 for Advancing COVID-19 Detection in 3D CT Scans
Figure 3 for Advancing COVID-19 Detection in 3D CT Scans
Viaarxiv icon

Domain Adaptation Using Pseudo Labels for COVID-19 Detection

Add code
Mar 18, 2024
Viaarxiv icon

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding

Add code
Mar 14, 2024
Viaarxiv icon