Picture for Yixing Peng

Yixing Peng

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

Add code
Jan 25, 2025
Viaarxiv icon

Cross-Modal Adaptive Dual Association for Text-to-Image Person Retrieval

Add code
Dec 04, 2023
Viaarxiv icon