Picture for Xi Tang

Xi Tang

Adaptive Keyframe Sampling for Long Video Understanding

Add code
Feb 28, 2025
Viaarxiv icon

IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning

Add code
Feb 04, 2025
Viaarxiv icon

Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction

Add code
Jan 19, 2025
Figure 1 for Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Figure 2 for Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Figure 3 for Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Figure 4 for Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Viaarxiv icon

Artemis: Towards Referential Understanding in Complex Videos

Add code
Jun 01, 2024
Figure 1 for Artemis: Towards Referential Understanding in Complex Videos
Figure 2 for Artemis: Towards Referential Understanding in Complex Videos
Figure 3 for Artemis: Towards Referential Understanding in Complex Videos
Figure 4 for Artemis: Towards Referential Understanding in Complex Videos
Viaarxiv icon

ChatterBox: Multi-round Multimodal Referring and Grounding

Add code
Jan 24, 2024
Viaarxiv icon