Picture for Pengpeng Zeng

Pengpeng Zeng

Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves

Add code
Dec 16, 2024
Viaarxiv icon

GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark

Add code
Dec 13, 2024
Viaarxiv icon

SeMv-3D: Towards Semantic and Mutil-view Consistency simultaneously for General Text-to-3D Generation with Triplane Priors

Add code
Oct 10, 2024
Viaarxiv icon

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Add code
Sep 09, 2024
Figure 1 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 2 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 3 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Figure 4 for MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct
Viaarxiv icon

Text-Video Retrieval with Global-Local Semantic Consistent Learning

Add code
May 21, 2024
Viaarxiv icon

Context-based Transfer and Efficient Iterative Learning for Unbiased Scene Graph Generation

Add code
Dec 29, 2023
Viaarxiv icon

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval

Add code
Dec 19, 2023
Viaarxiv icon

Generalized Unbiased Scene Graph Generation

Add code
Aug 09, 2023
Viaarxiv icon

Visual Commonsense-aware Representation Network for Video Captioning

Add code
Nov 17, 2022
Viaarxiv icon

Progressive Tree-Structured Prototype Network for End-to-End Image Captioning

Add code
Nov 17, 2022
Viaarxiv icon