Picture for Cyril Zhang

Cyril Zhang

Phi-4 Technical Report

Add code
Dec 12, 2024
Viaarxiv icon

Self-Improvement in Language Models: The Sharpening Mechanism

Add code
Dec 02, 2024
Viaarxiv icon

ASL STEM Wiki: Dataset and Benchmark for Interpreting STEM Articles

Add code
Nov 08, 2024
Viaarxiv icon

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Apr 23, 2024
Figure 1 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 2 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 3 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 4 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Viaarxiv icon

Can large language models explore in-context?

Add code
Mar 22, 2024
Figure 1 for Can large language models explore in-context?
Figure 2 for Can large language models explore in-context?
Figure 3 for Can large language models explore in-context?
Figure 4 for Can large language models explore in-context?
Viaarxiv icon

Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

Add code
Oct 17, 2023
Viaarxiv icon

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

Add code
Sep 07, 2023
Viaarxiv icon

Exposing Attention Glitches with Flip-Flop Language Modeling

Add code
Jun 01, 2023
Viaarxiv icon

Learning Hidden Markov Models Using Conditional Samples

Add code
Feb 28, 2023
Viaarxiv icon

Neural Active Learning on Heteroskedastic Distributions

Add code
Nov 02, 2022
Viaarxiv icon