Picture for Li Su

Li Su

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

Add code
Mar 25, 2025
Viaarxiv icon

Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants

Add code
Dec 25, 2024
Figure 1 for Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Figure 2 for Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Figure 3 for Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Figure 4 for Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants
Viaarxiv icon

Computational Analysis of Yaredawi YeZema Silt in Ethiopian Orthodox Tewahedo Church Chants

Add code
Dec 25, 2024
Viaarxiv icon

Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning

Add code
Dec 18, 2024
Viaarxiv icon

Distortion Recovery: A Two-Stage Method for Guitar Effect Removal

Add code
Jul 23, 2024
Figure 1 for Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Figure 2 for Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Figure 3 for Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Figure 4 for Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
Viaarxiv icon

Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning

Add code
Jul 16, 2024
Figure 1 for Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Figure 2 for Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Figure 3 for Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Figure 4 for Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
Viaarxiv icon

A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons

Add code
Jun 26, 2024
Figure 1 for A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons
Figure 2 for A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons
Figure 3 for A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons
Figure 4 for A Study on Synthesizing Expressive Violin Performances: Approaches and Comparisons
Viaarxiv icon

MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing

Add code
Jun 10, 2024
Figure 1 for MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Figure 2 for MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Figure 3 for MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Figure 4 for MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing
Viaarxiv icon

Context-aware Difference Distilling for Multi-change Captioning

Add code
May 31, 2024
Viaarxiv icon

BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer

Add code
Jan 05, 2024
Viaarxiv icon