Picture for Yeyun Gong

Yeyun Gong

Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling

Add code
Mar 24, 2025
Viaarxiv icon

Process-based Self-Rewarding Language Models

Add code
Mar 05, 2025
Viaarxiv icon

DeepThink: Aligning Language Models with Domain-Specific User Intents

Add code
Feb 08, 2025
Viaarxiv icon

Optimizing Large Language Model Training Using FP4 Quantization

Add code
Jan 28, 2025
Figure 1 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 2 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 3 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 4 for Optimizing Large Language Model Training Using FP4 Quantization
Viaarxiv icon

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Figure 1 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 2 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 3 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 4 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Viaarxiv icon

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Add code
Dec 20, 2024
Figure 1 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 2 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 3 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 4 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Viaarxiv icon

From Intention To Implementation: Automating Biomedical Research via LLMs

Add code
Dec 12, 2024
Viaarxiv icon

Generative Context Distillation

Add code
Nov 24, 2024
Viaarxiv icon

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

Add code
Nov 21, 2024
Figure 1 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 2 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 3 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 4 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Viaarxiv icon

Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation

Add code
Oct 21, 2024
Figure 1 for Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Figure 2 for Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Figure 3 for Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Figure 4 for Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Viaarxiv icon