Picture for Jeffrey Li

Jeffrey Li

BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models

Add code
Dec 11, 2025
Viaarxiv icon

Language Models Improve When Pretraining Data Matches Target Tasks

Add code
Jul 16, 2025
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining

Add code
Apr 02, 2025
Viaarxiv icon

Command A: An Enterprise-Ready Large Language Model

Add code
Apr 01, 2025
Figure 1 for Command A: An Enterprise-Ready Large Language Model
Figure 2 for Command A: An Enterprise-Ready Large Language Model
Figure 3 for Command A: An Enterprise-Ready Large Language Model
Figure 4 for Command A: An Enterprise-Ready Large Language Model
Viaarxiv icon

Stronger Than You Think: Benchmarking Weak Supervision on Realistic Tasks

Add code
Jan 13, 2025
Viaarxiv icon

SDS -- See it, Do it, Sorted: Quadruped Skill Synthesis from Single Video Demonstration

Add code
Oct 15, 2024
Figure 1 for SDS -- See it, Do it, Sorted: Quadruped Skill Synthesis from Single Video Demonstration
Figure 2 for SDS -- See it, Do it, Sorted: Quadruped Skill Synthesis from Single Video Demonstration
Figure 3 for SDS -- See it, Do it, Sorted: Quadruped Skill Synthesis from Single Video Demonstration
Figure 4 for SDS -- See it, Do it, Sorted: Quadruped Skill Synthesis from Single Video Demonstration
Viaarxiv icon

Better Alignment with Instruction Back-and-Forth Translation

Add code
Aug 08, 2024
Figure 1 for Better Alignment with Instruction Back-and-Forth Translation
Figure 2 for Better Alignment with Instruction Back-and-Forth Translation
Figure 3 for Better Alignment with Instruction Back-and-Forth Translation
Figure 4 for Better Alignment with Instruction Back-and-Forth Translation
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon

Language models scale reliably with over-training and on downstream tasks

Add code
Mar 13, 2024
Figure 1 for Language models scale reliably with over-training and on downstream tasks
Figure 2 for Language models scale reliably with over-training and on downstream tasks
Figure 3 for Language models scale reliably with over-training and on downstream tasks
Figure 4 for Language models scale reliably with over-training and on downstream tasks
Viaarxiv icon