Picture for Alex Jinpeng Wang

Alex Jinpeng Wang

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Add code
Feb 11, 2025
Viaarxiv icon

Vision-centric Token Compression in Large Language Model

Add code
Feb 04, 2025
Viaarxiv icon

Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Add code
Jun 04, 2024
Viaarxiv icon

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Add code
Jan 01, 2024
Viaarxiv icon

Parrot Captions Teach CLIP to Spot Text

Add code
Dec 28, 2023
Viaarxiv icon

UniVTG: Towards Unified Video-Language Temporal Grounding

Add code
Aug 18, 2023
Viaarxiv icon

Too Large; Data Reduction for Vision-Language Pre-Training

Add code
Jun 01, 2023
Figure 1 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 2 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 3 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 4 for Too Large; Data Reduction for Vision-Language Pre-Training
Viaarxiv icon

Position-guided Text Prompt for Vision-Language Pre-training

Add code
Dec 19, 2022
Viaarxiv icon

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

Add code
Jul 04, 2022
Figure 1 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 2 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 3 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 4 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Viaarxiv icon

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

Add code
Jul 04, 2022
Figure 1 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 2 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 3 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 4 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Viaarxiv icon