Picture for Olga Golovneva

Olga Golovneva

Self-Taught Evaluators

Add code
Aug 05, 2024
Figure 1 for Self-Taught Evaluators
Figure 2 for Self-Taught Evaluators
Figure 3 for Self-Taught Evaluators
Figure 4 for Self-Taught Evaluators
Viaarxiv icon

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Add code
Jul 28, 2024
Figure 1 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 2 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 3 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 4 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Viaarxiv icon

Contextual Position Encoding: Learning to Count What's Important

Add code
May 29, 2024
Figure 1 for Contextual Position Encoding: Learning to Count What's Important
Figure 2 for Contextual Position Encoding: Learning to Count What's Important
Figure 3 for Contextual Position Encoding: Learning to Count What's Important
Figure 4 for Contextual Position Encoding: Learning to Count What's Important
Viaarxiv icon

Reverse Training to Nurse the Reversal Curse

Add code
Mar 20, 2024
Figure 1 for Reverse Training to Nurse the Reversal Curse
Figure 2 for Reverse Training to Nurse the Reversal Curse
Figure 3 for Reverse Training to Nurse the Reversal Curse
Figure 4 for Reverse Training to Nurse the Reversal Curse
Viaarxiv icon

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Add code
Mar 12, 2024
Figure 1 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 2 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 3 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 4 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Viaarxiv icon

Efficient Tool Use with Chain-of-Abstraction Reasoning

Add code
Jan 30, 2024
Viaarxiv icon

PathFinder: Guided Search over Multi-Step Reasoning Paths

Add code
Dec 12, 2023
Viaarxiv icon

DOMINO: A Dual-System for Multi-step Visual Language Reasoning

Add code
Oct 04, 2023
Viaarxiv icon

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

Add code
Sep 05, 2023
Figure 1 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 2 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 3 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Figure 4 for Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Viaarxiv icon

Shepherd: A Critic for Language Model Generation

Add code
Aug 08, 2023
Figure 1 for Shepherd: A Critic for Language Model Generation
Figure 2 for Shepherd: A Critic for Language Model Generation
Figure 3 for Shepherd: A Critic for Language Model Generation
Figure 4 for Shepherd: A Critic for Language Model Generation
Viaarxiv icon