Picture for Yuhui Wang

Yuhui Wang

Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k

Add code
Mar 12, 2025
Viaarxiv icon

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Add code
Mar 09, 2025
Viaarxiv icon

GraphRAG under Fire

Add code
Jan 23, 2025
Viaarxiv icon

Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks

Add code
Dec 14, 2024
Figure 1 for Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks
Figure 2 for Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks
Figure 3 for Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks
Figure 4 for Cluster-Based Multi-Agent Task Scheduling for Space-Air-Ground Integrated Networks
Viaarxiv icon

RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction

Add code
Oct 25, 2024
Figure 1 for RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Figure 2 for RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Figure 3 for RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Figure 4 for RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction
Viaarxiv icon

Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning

Add code
Jun 12, 2024
Viaarxiv icon

Highway Value Iteration Networks

Add code
Jun 05, 2024
Figure 1 for Highway Value Iteration Networks
Figure 2 for Highway Value Iteration Networks
Figure 3 for Highway Value Iteration Networks
Figure 4 for Highway Value Iteration Networks
Viaarxiv icon

Highway Reinforcement Learning

Add code
May 28, 2024
Figure 1 for Highway Reinforcement Learning
Figure 2 for Highway Reinforcement Learning
Figure 3 for Highway Reinforcement Learning
Figure 4 for Highway Reinforcement Learning
Viaarxiv icon

Variational Delayed Policy Optimization

Add code
May 23, 2024
Viaarxiv icon

Deep Reinforcement Learning Based Placement for Integrated Access Backhauling in UAV-Assisted Wireless Networks

Add code
Dec 21, 2023
Viaarxiv icon