Picture for Zixu Cheng

Zixu Cheng

V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning

Add code
Mar 14, 2025
Viaarxiv icon

CoS: Chain-of-Shot Prompting for Long Video Understanding

Add code
Feb 10, 2025
Figure 1 for CoS: Chain-of-Shot Prompting for Long Video Understanding
Figure 2 for CoS: Chain-of-Shot Prompting for Long Video Understanding
Figure 3 for CoS: Chain-of-Shot Prompting for Long Video Understanding
Figure 4 for CoS: Chain-of-Shot Prompting for Long Video Understanding
Viaarxiv icon

INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation

Add code
Jan 30, 2025
Figure 1 for INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation
Figure 2 for INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation
Figure 3 for INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation
Figure 4 for INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation
Viaarxiv icon

SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding

Add code
Jul 06, 2024
Viaarxiv icon