Picture for Dingjie Song

Dingjie Song

Aligning Multimodal LLM with Human Preference: A Survey

Add code
Mar 18, 2025
Viaarxiv icon

A Survey on Post-training of Large Language Models

Add code
Mar 08, 2025
Viaarxiv icon

From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education

Add code
Feb 19, 2025
Viaarxiv icon

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Add code
Dec 28, 2024
Figure 1 for On the Compositional Generalization of Multimodal LLMs for Medical Imaging
Figure 2 for On the Compositional Generalization of Multimodal LLMs for Medical Imaging
Figure 3 for On the Compositional Generalization of Multimodal LLMs for Medical Imaging
Figure 4 for On the Compositional Generalization of Multimodal LLMs for Medical Imaging
Viaarxiv icon

BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement

Add code
Dec 16, 2024
Viaarxiv icon

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Add code
Nov 06, 2024
Figure 1 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 2 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 3 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 4 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Viaarxiv icon

Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs

Add code
Sep 17, 2024
Viaarxiv icon

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Add code
Sep 04, 2024
Viaarxiv icon

MileBench: Benchmarking MLLMs in Long Context

Add code
Apr 29, 2024
Viaarxiv icon

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

Add code
Nov 16, 2023
Viaarxiv icon