Picture for Hua Wu

Hua Wu

BeamLoRA: Beam-Constraint Low-Rank Adaptation

Add code
Feb 19, 2025
Viaarxiv icon

Shall Your Data Strategy Work? Perform a Swift Study

Add code
Feb 19, 2025
Viaarxiv icon

Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking

Add code
Feb 19, 2025
Viaarxiv icon

Curiosity-Driven Reinforcement Learning from Human Feedback

Add code
Jan 20, 2025
Viaarxiv icon

Mixture of Hidden-Dimensions Transformer

Add code
Dec 10, 2024
Figure 1 for Mixture of Hidden-Dimensions Transformer
Figure 2 for Mixture of Hidden-Dimensions Transformer
Figure 3 for Mixture of Hidden-Dimensions Transformer
Figure 4 for Mixture of Hidden-Dimensions Transformer
Viaarxiv icon

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Add code
Oct 03, 2024
Figure 1 for MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Figure 2 for MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Figure 3 for MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Figure 4 for MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Viaarxiv icon

Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging

Add code
Oct 02, 2024
Figure 1 for Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Figure 2 for Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Figure 3 for Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Figure 4 for Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Viaarxiv icon

NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time

Add code
Aug 07, 2024
Viaarxiv icon

Exploring the Causality of End-to-End Autonomous Driving

Add code
Jul 09, 2024
Figure 1 for Exploring the Causality of End-to-End Autonomous Driving
Figure 2 for Exploring the Causality of End-to-End Autonomous Driving
Figure 3 for Exploring the Causality of End-to-End Autonomous Driving
Figure 4 for Exploring the Causality of End-to-End Autonomous Driving
Viaarxiv icon

BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space

Add code
Jul 08, 2024
Viaarxiv icon