Picture for Cyril Zhang

Cyril Zhang

ASL STEM Wiki: Dataset and Benchmark for Interpreting STEM Articles

Add code
Nov 08, 2024
Viaarxiv icon

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Apr 23, 2024
Figure 1 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 2 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 3 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 4 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Viaarxiv icon

Can large language models explore in-context?

Add code
Mar 22, 2024
Viaarxiv icon

Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

Add code
Oct 17, 2023
Viaarxiv icon

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

Add code
Sep 07, 2023
Viaarxiv icon

Exposing Attention Glitches with Flip-Flop Language Modeling

Add code
Jun 01, 2023
Viaarxiv icon

Learning Hidden Markov Models Using Conditional Samples

Add code
Feb 28, 2023
Viaarxiv icon

Neural Active Learning on Heteroskedastic Distributions

Add code
Nov 02, 2022
Viaarxiv icon

Transformers Learn Shortcuts to Automata

Add code
Oct 19, 2022
Viaarxiv icon

Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms

Add code
Sep 01, 2022
Figure 1 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Figure 2 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Figure 3 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Viaarxiv icon