Picture for Daoyuan Chen

Daoyuan Chen

HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data

Add code
Dec 23, 2024
Viaarxiv icon

ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction

Add code
Dec 18, 2024
Viaarxiv icon

Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models

Add code
Aug 09, 2024
Figure 1 for Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Figure 2 for Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Figure 3 for Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Figure 4 for Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Viaarxiv icon

Data-Juicer Sandbox: A Comprehensive Suite for Multimodal Data-Model Co-development

Add code
Jul 16, 2024
Viaarxiv icon

The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

Add code
Jul 11, 2024
Viaarxiv icon

Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining

Add code
May 23, 2024
Viaarxiv icon

Dynamic Demonstration Retrieval and Cognitive Understanding for Emotional Support Conversation

Add code
Apr 03, 2024
Viaarxiv icon

ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization

Add code
Mar 17, 2024
Viaarxiv icon

AgentScope: A Flexible yet Robust Multi-Agent Platform

Add code
Feb 21, 2024
Viaarxiv icon

On the Convergence of Zeroth-Order Federated Tuning for Large Language Models

Add code
Feb 20, 2024
Viaarxiv icon