Picture for Yu Yang

Yu Yang

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Add code
Nov 04, 2024
Figure 1 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 2 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 3 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 4 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Viaarxiv icon

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning

Add code
Oct 30, 2024
Viaarxiv icon

AutoGLM: Autonomous Foundation Agents for GUIs

Add code
Oct 28, 2024
Viaarxiv icon

Personality Analysis from Online Short Video Platforms with Multi-domain Adaptation

Add code
Oct 26, 2024
Viaarxiv icon

FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs

Add code
Oct 22, 2024
Viaarxiv icon

Graph Neural Patching for Cold-Start Recommendations

Add code
Oct 18, 2024
Viaarxiv icon

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

Add code
Oct 14, 2024
Figure 1 for SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Figure 2 for SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Figure 3 for SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Figure 4 for SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Viaarxiv icon

DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries

Add code
Aug 28, 2024
Viaarxiv icon

Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving

Add code
Aug 26, 2024
Viaarxiv icon

MalLight: Influence-Aware Coordinated Traffic Signal Control for Traffic Signal Malfunctions

Add code
Aug 20, 2024
Viaarxiv icon