Picture for Yuling Gu

Yuling Gu

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Add code
Nov 22, 2024
Viaarxiv icon

SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

Add code
Oct 17, 2024
Figure 1 for SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Figure 2 for SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Figure 3 for SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Figure 4 for SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Viaarxiv icon

OLMoE: Open Mixture-of-Experts Language Models

Add code
Sep 03, 2024
Figure 1 for OLMoE: Open Mixture-of-Experts Language Models
Figure 2 for OLMoE: Open Mixture-of-Experts Language Models
Figure 3 for OLMoE: Open Mixture-of-Experts Language Models
Figure 4 for OLMoE: Open Mixture-of-Experts Language Models
Viaarxiv icon

OLMES: A Standard for Language Model Evaluations

Add code
Jun 12, 2024
Figure 1 for OLMES: A Standard for Language Model Evaluations
Figure 2 for OLMES: A Standard for Language Model Evaluations
Figure 3 for OLMES: A Standard for Language Model Evaluations
Figure 4 for OLMES: A Standard for Language Model Evaluations
Viaarxiv icon

WorldValuesBench: A Large-Scale Benchmark Dataset for Multi-Cultural Value Awareness of Language Models

Add code
Apr 25, 2024
Viaarxiv icon

PROC2PDDL: Open-Domain Planning Representations from Texts

Add code
Feb 29, 2024
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Digital Socrates: Evaluating LLMs through explanation critiques

Add code
Nov 16, 2023
Viaarxiv icon

What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations

Add code
Nov 01, 2023
Viaarxiv icon

Measure More, Question More: Experimental Studies on Transformer-based Language Models and Complement Coercion

Add code
Dec 20, 2022
Viaarxiv icon