Picture for Junfu Pu

Junfu Pu

Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models

Add code
Dec 06, 2024
Viaarxiv icon

mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA

Add code
Nov 22, 2024
Figure 1 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 2 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 3 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 4 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Viaarxiv icon

Taming Rectified Flow for Inversion and Editing

Add code
Nov 07, 2024
Figure 1 for Taming Rectified Flow for Inversion and Editing
Figure 2 for Taming Rectified Flow for Inversion and Editing
Figure 3 for Taming Rectified Flow for Inversion and Editing
Figure 4 for Taming Rectified Flow for Inversion and Editing
Viaarxiv icon

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses

Add code
Aug 07, 2024
Figure 1 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 2 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 3 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 4 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Viaarxiv icon

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

Add code
Jul 10, 2024
Viaarxiv icon

Music-driven Dance Regeneration with Controllable Key Pose Constraints

Add code
Jul 08, 2022
Figure 1 for Music-driven Dance Regeneration with Controllable Key Pose Constraints
Figure 2 for Music-driven Dance Regeneration with Controllable Key Pose Constraints
Figure 3 for Music-driven Dance Regeneration with Controllable Key Pose Constraints
Figure 4 for Music-driven Dance Regeneration with Controllable Key Pose Constraints
Viaarxiv icon

Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization

Add code
Jul 07, 2022
Figure 1 for Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization
Figure 2 for Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization
Figure 3 for Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization
Figure 4 for Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization
Viaarxiv icon

Improving Sign Language Translation with Monolingual Data by Sign Back-Translation

Add code
May 26, 2021
Figure 1 for Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
Figure 2 for Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
Figure 3 for Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
Figure 4 for Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
Viaarxiv icon

Boosting Continuous Sign Language Recognition via Cross Modality Augmentation

Add code
Oct 11, 2020
Figure 1 for Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Figure 2 for Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Figure 3 for Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Figure 4 for Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Viaarxiv icon

Global-local Enhancement Network for NMFs-aware Sign Language Recognition

Add code
Aug 24, 2020
Figure 1 for Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Figure 2 for Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Figure 3 for Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Figure 4 for Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Viaarxiv icon