Picture for Bingbin Liu

Bingbin Liu

Progressive distillation induces an implicit curriculum

Add code
Oct 07, 2024
Viaarxiv icon

TinyGSM: achieving >80% on GSM8k with small language models

Add code
Dec 14, 2023
Viaarxiv icon

Transformers are uninterpretable with myopic methods: a case study with bounded Dyck grammars

Add code
Dec 03, 2023
Viaarxiv icon

Exposing Attention Glitches with Flip-Flop Language Modeling

Add code
Jun 01, 2023
Viaarxiv icon

Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation

Add code
Jun 01, 2023
Viaarxiv icon

Transformers Learn Shortcuts to Automata

Add code
Oct 19, 2022
Viaarxiv icon

Masked prediction tasks: a parameter identifiability view

Add code
Feb 18, 2022
Viaarxiv icon

Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation

Add code
Oct 21, 2021
Figure 1 for Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation
Figure 2 for Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation
Figure 3 for Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation
Viaarxiv icon

Contrastive learning of strong-mixing continuous-time stochastic processes

Add code
Mar 03, 2021
Viaarxiv icon

Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction

Add code
Feb 20, 2020
Figure 1 for Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction
Figure 2 for Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction
Figure 3 for Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction
Figure 4 for Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction
Viaarxiv icon