Picture for Steven Zheng

Steven Zheng

Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers

Add code
Apr 06, 2025
Viaarxiv icon

Best Practices and Lessons Learned on Synthetic Data for Language Models

Add code
Apr 11, 2024
Figure 1 for Best Practices and Lessons Learned on Synthetic Data for Language Models
Viaarxiv icon

In-Context Principle Learning from Mistakes

Add code
Feb 09, 2024
Figure 1 for In-Context Principle Learning from Mistakes
Figure 2 for In-Context Principle Learning from Mistakes
Figure 3 for In-Context Principle Learning from Mistakes
Figure 4 for In-Context Principle Learning from Mistakes
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon