Picture for Qingpei Guo

Qingpei Guo

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Add code
Oct 07, 2024
Figure 1 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 2 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 3 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 4 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Viaarxiv icon

Social Debiasing for Fair Multi-modal LLMs

Add code
Aug 13, 2024
Viaarxiv icon

Hummer: Towards Limited Competitive Preference Dataset

Add code
May 21, 2024
Viaarxiv icon

SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval

Add code
Apr 22, 2024
Viaarxiv icon

M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining

Add code
Feb 04, 2024
Viaarxiv icon

M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval

Add code
Jan 31, 2024
Viaarxiv icon

SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks

Add code
Jan 31, 2024
Viaarxiv icon

Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition

Add code
Jan 09, 2024
Viaarxiv icon

SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment

Add code
Jan 04, 2024
Viaarxiv icon

Text as Image: Learning Transferable Adapter for Multi-Label Classification

Add code
Dec 07, 2023
Viaarxiv icon