Picture for Yuxiao Dong

Yuxiao Dong

Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Add code
Dec 08, 2024
Viaarxiv icon

GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot

Add code
Dec 03, 2024
Viaarxiv icon

Scaling Speech-Text Pre-training with Synthetic Interleaved Data

Add code
Nov 26, 2024
Viaarxiv icon

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Add code
Nov 04, 2024
Figure 1 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 2 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 3 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 4 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Viaarxiv icon

DreamPolish: Domain Score Distillation With Progressive Geometry Generation

Add code
Nov 03, 2024
Figure 1 for DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Figure 2 for DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Figure 3 for DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Figure 4 for DreamPolish: Domain Score Distillation With Progressive Geometry Generation
Viaarxiv icon

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Add code
Oct 31, 2024
Figure 1 for AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Figure 2 for AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Figure 3 for AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Figure 4 for AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
Viaarxiv icon

SceneGenAgent: Precise Industrial Scene Generation with Coding Agent

Add code
Oct 29, 2024
Viaarxiv icon

LongReward: Improving Long-context Large Language Models with AI Feedback

Add code
Oct 28, 2024
Viaarxiv icon

AutoGLM: Autonomous Foundation Agents for GUIs

Add code
Oct 28, 2024
Viaarxiv icon

LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering

Add code
Oct 23, 2024
Viaarxiv icon