Picture for Jason Phang

Jason Phang

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

Large Language Models as Misleading Assistants in Conversation

Add code
Jul 16, 2024
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Viaarxiv icon

Investigating the Effectiveness of HyperTuning via Gisting

Add code
Feb 26, 2024
Viaarxiv icon

Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?

Add code
Sep 19, 2023
Viaarxiv icon

Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs

Add code
May 23, 2023
Viaarxiv icon

Tool Learning with Foundation Models

Add code
Apr 17, 2023
Viaarxiv icon

Pretraining Language Models with Human Preferences

Add code
Feb 16, 2023
Viaarxiv icon

HyperTuning: Toward Adapting Large Language Models without Back-propagation

Add code
Nov 22, 2022
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon