Picture for Yeyun Gong

Yeyun Gong

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Add code
Dec 20, 2024
Viaarxiv icon

From Intention To Implementation: Automating Biomedical Research via LLMs

Add code
Dec 12, 2024
Viaarxiv icon

Generative Context Distillation

Add code
Nov 24, 2024
Viaarxiv icon

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

Add code
Nov 21, 2024
Figure 1 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 2 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 3 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 4 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Viaarxiv icon

Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation

Add code
Oct 21, 2024
Viaarxiv icon

Automated Proof Generation for Rust Code via Self-Evolution

Add code
Oct 21, 2024
Figure 1 for Automated Proof Generation for Rust Code via Self-Evolution
Figure 2 for Automated Proof Generation for Rust Code via Self-Evolution
Figure 3 for Automated Proof Generation for Rust Code via Self-Evolution
Figure 4 for Automated Proof Generation for Rust Code via Self-Evolution
Viaarxiv icon

Integrative Decoding: Improve Factuality via Implicit Self-consistency

Add code
Oct 02, 2024
Viaarxiv icon

Task Oriented In-Domain Data Augmentation

Add code
Jun 24, 2024
Viaarxiv icon

Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance

Add code
Jun 21, 2024
Viaarxiv icon

MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

Add code
May 13, 2024
Figure 1 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 2 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 3 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 4 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Viaarxiv icon