Picture for Fandong Meng

Fandong Meng

A Dual-Space Framework for General Knowledge Distillation of Large Language Models

Add code
Apr 15, 2025
Viaarxiv icon

Deep Reasoning Translation via Reinforcement Learning

Add code
Apr 14, 2025
Viaarxiv icon

D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens

Add code
Mar 21, 2025
Viaarxiv icon

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Add code
Mar 04, 2025
Viaarxiv icon

DBudgetKV: Dynamic Budget in KV Cache Compression for Ensuring Optimal Performance

Add code
Feb 24, 2025
Viaarxiv icon

Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation

Add code
Feb 17, 2025
Viaarxiv icon

LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information

Add code
Feb 04, 2025
Figure 1 for LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information
Figure 2 for LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information
Figure 3 for LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information
Figure 4 for LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information
Viaarxiv icon

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Add code
Feb 03, 2025
Figure 1 for DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Figure 2 for DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Figure 3 for DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Figure 4 for DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Viaarxiv icon

Personalized Language Model Learning on Text Data Without User Identifiers

Add code
Jan 10, 2025
Figure 1 for Personalized Language Model Learning on Text Data Without User Identifiers
Figure 2 for Personalized Language Model Learning on Text Data Without User Identifiers
Figure 3 for Personalized Language Model Learning on Text Data Without User Identifiers
Figure 4 for Personalized Language Model Learning on Text Data Without User Identifiers
Viaarxiv icon

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Add code
Dec 23, 2024
Figure 1 for DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
Figure 2 for DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
Figure 3 for DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
Figure 4 for DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
Viaarxiv icon