Picture for Xiao Liu

Xiao Liu

Tsinghua University

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Add code
Nov 04, 2024
Figure 1 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 2 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 3 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 4 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Viaarxiv icon

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Add code
Oct 31, 2024
Viaarxiv icon

AutoGLM: Autonomous Foundation Agents for GUIs

Add code
Oct 28, 2024
Viaarxiv icon

A Survey of AI-Generated Video Evaluation

Add code
Oct 24, 2024
Figure 1 for A Survey of AI-Generated Video Evaluation
Figure 2 for A Survey of AI-Generated Video Evaluation
Figure 3 for A Survey of AI-Generated Video Evaluation
Figure 4 for A Survey of AI-Generated Video Evaluation
Viaarxiv icon

A Statistical Machine Learning Approach for Adapting Reduced-Order Models using Projected Gaussian Process

Add code
Oct 18, 2024
Figure 1 for A Statistical Machine Learning Approach for Adapting Reduced-Order Models using Projected Gaussian Process
Figure 2 for A Statistical Machine Learning Approach for Adapting Reduced-Order Models using Projected Gaussian Process
Figure 3 for A Statistical Machine Learning Approach for Adapting Reduced-Order Models using Projected Gaussian Process
Figure 4 for A Statistical Machine Learning Approach for Adapting Reduced-Order Models using Projected Gaussian Process
Viaarxiv icon

DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models

Add code
Oct 09, 2024
Figure 1 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 2 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 3 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Figure 4 for DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
Viaarxiv icon

YouTube Video Analytics for Patient Engagement: Evidence from Colonoscopy Preparation Videos

Add code
Oct 01, 2024
Figure 1 for YouTube Video Analytics for Patient Engagement: Evidence from Colonoscopy Preparation Videos
Figure 2 for YouTube Video Analytics for Patient Engagement: Evidence from Colonoscopy Preparation Videos
Figure 3 for YouTube Video Analytics for Patient Engagement: Evidence from Colonoscopy Preparation Videos
Figure 4 for YouTube Video Analytics for Patient Engagement: Evidence from Colonoscopy Preparation Videos
Viaarxiv icon

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models

Add code
Sep 05, 2024
Figure 1 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 2 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 3 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 4 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Viaarxiv icon

FMRFT: Fusion Mamba and DETR for Query Time Sequence Intersection Fish Tracking

Add code
Sep 02, 2024
Viaarxiv icon

Fishers Harvest Parallel Unlearning in Inherited Model Networks

Add code
Aug 16, 2024
Figure 1 for Fishers Harvest Parallel Unlearning in Inherited Model Networks
Figure 2 for Fishers Harvest Parallel Unlearning in Inherited Model Networks
Figure 3 for Fishers Harvest Parallel Unlearning in Inherited Model Networks
Figure 4 for Fishers Harvest Parallel Unlearning in Inherited Model Networks
Viaarxiv icon