Picture for Hua Wu

Hua Wu

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Add code
Oct 03, 2024
Viaarxiv icon

Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging

Add code
Oct 02, 2024
Viaarxiv icon

NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time

Add code
Aug 07, 2024
Viaarxiv icon

Exploring the Causality of End-to-End Autonomous Driving

Add code
Jul 09, 2024
Figure 1 for Exploring the Causality of End-to-End Autonomous Driving
Figure 2 for Exploring the Causality of End-to-End Autonomous Driving
Figure 3 for Exploring the Causality of End-to-End Autonomous Driving
Figure 4 for Exploring the Causality of End-to-End Autonomous Driving
Viaarxiv icon

BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space

Add code
Jul 08, 2024
Viaarxiv icon

HFT: Half Fine-Tuning for Large Language Models

Add code
Apr 29, 2024
Viaarxiv icon

Dual Modalities of Text: Visual and Textual Generative Pre-training

Add code
Apr 17, 2024
Viaarxiv icon

On Training Data Influence of GPT Models

Add code
Apr 11, 2024
Viaarxiv icon

BASES: Large-scale Web Search User Simulation with Large Language Model based Agents

Add code
Feb 27, 2024
Viaarxiv icon

DeepRicci: Self-supervised Graph Structure-Feature Co-Refinement for Alleviating Over-squashing

Add code
Jan 23, 2024
Viaarxiv icon