Picture for Corby Rosset

Corby Rosset

AgentInstruct: Toward Generative Teaching with Agentic Flows

Add code
Jul 03, 2024
Figure 1 for AgentInstruct: Toward Generative Teaching with Agentic Flows
Figure 2 for AgentInstruct: Toward Generative Teaching with Agentic Flows
Figure 3 for AgentInstruct: Toward Generative Teaching with Agentic Flows
Figure 4 for AgentInstruct: Toward Generative Teaching with Agentic Flows
Viaarxiv icon

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Add code
May 31, 2024
Figure 1 for Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Figure 2 for Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Viaarxiv icon

MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

Add code
May 13, 2024
Figure 1 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 2 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 3 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 4 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Viaarxiv icon

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Apr 23, 2024
Figure 1 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 2 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 3 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 4 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Viaarxiv icon

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Add code
Apr 04, 2024
Figure 1 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 2 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 3 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 4 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Viaarxiv icon

Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents

Add code
Feb 27, 2024
Figure 1 for Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Figure 2 for Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Figure 3 for Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Figure 4 for Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Viaarxiv icon

Orca-Math: Unlocking the potential of SLMs in Grade School Math

Add code
Feb 16, 2024
Viaarxiv icon

Axiomatic Preference Modeling for Longform Question Answering

Add code
Dec 02, 2023
Viaarxiv icon

Orca 2: Teaching Small Language Models How to Reason

Add code
Nov 21, 2023
Viaarxiv icon

Overview of the TREC 2023 Product Product Search Track

Add code
Nov 15, 2023
Viaarxiv icon