Picture for Di Zhang

Di Zhang

DMQR-RAG: Diverse Multi-Query Rewriting for RAG

Add code
Nov 20, 2024
Viaarxiv icon

Kwai-STaR: Transform LLMs into State-Transition Reasoners

Add code
Nov 07, 2024
Viaarxiv icon

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content

Add code
Oct 10, 2024
Viaarxiv icon

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Add code
Oct 03, 2024
Viaarxiv icon

Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding

Add code
Sep 29, 2024
Viaarxiv icon

Towards Unified 3D Hair Reconstruction from Single-View Portraits

Add code
Sep 25, 2024
Figure 1 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 2 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 3 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 4 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Viaarxiv icon

SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs

Add code
Aug 21, 2024
Viaarxiv icon

Asymmetric Graph Error Control with Low Complexity in Causal Bandits

Add code
Aug 20, 2024
Viaarxiv icon

ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area

Add code
Aug 16, 2024
Viaarxiv icon

Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM

Add code
Aug 14, 2024
Viaarxiv icon