Picture for Yihe Deng

Yihe Deng

Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning

Add code
Oct 29, 2024
Viaarxiv icon

MIRAI: Evaluating LLM Agents for Event Forecasting

Add code
Jul 01, 2024
Viaarxiv icon

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

Add code
May 30, 2024
Viaarxiv icon

Mitigating Object Hallucination in Large Vision-Language Models via Classifier-Free Guidance

Add code
Feb 13, 2024
Viaarxiv icon

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Add code
Jan 02, 2024
Viaarxiv icon

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Add code
Nov 23, 2023
Viaarxiv icon

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Add code
Nov 07, 2023
Viaarxiv icon

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

Add code
Oct 02, 2023
Viaarxiv icon

Robust Learning with Progressive Data Expansion Against Spurious Correlation

Add code
Jun 08, 2023
Viaarxiv icon

Towards Understanding Mixture of Experts in Deep Learning

Add code
Aug 04, 2022
Figure 1 for Towards Understanding Mixture of Experts in Deep Learning
Figure 2 for Towards Understanding Mixture of Experts in Deep Learning
Figure 3 for Towards Understanding Mixture of Experts in Deep Learning
Figure 4 for Towards Understanding Mixture of Experts in Deep Learning
Viaarxiv icon