Picture for Yuhui Wang

Yuhui Wang

RobustKV: Defending Large Language Models against Jailbreak Attacks via KV Eviction

Add code
Oct 25, 2024
Viaarxiv icon

Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning

Add code
Jun 12, 2024
Viaarxiv icon

Highway Value Iteration Networks

Add code
Jun 05, 2024
Viaarxiv icon

Highway Reinforcement Learning

Add code
May 28, 2024
Viaarxiv icon

Variational Delayed Policy Optimization

Add code
May 23, 2024
Viaarxiv icon

Deep Reinforcement Learning Based Placement for Integrated Access Backhauling in UAV-Assisted Wireless Networks

Add code
Dec 21, 2023
Viaarxiv icon

Learning to Identify Critical States for Reinforcement Learning from Videos

Add code
Aug 15, 2023
Viaarxiv icon

Mindstorms in Natural Language-Based Societies of Mind

Add code
May 26, 2023
Figure 1 for Mindstorms in Natural Language-Based Societies of Mind
Figure 2 for Mindstorms in Natural Language-Based Societies of Mind
Figure 3 for Mindstorms in Natural Language-Based Societies of Mind
Figure 4 for Mindstorms in Natural Language-Based Societies of Mind
Viaarxiv icon

Guiding Online Reinforcement Learning with Action-Free Offline Pretraining

Add code
Jan 30, 2023
Viaarxiv icon

Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition

Add code
Nov 07, 2022
Figure 1 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 2 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 3 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Figure 4 for Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition
Viaarxiv icon