Picture for Zixiao Huang

Zixiao Huang

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Add code
Jun 21, 2024
Viaarxiv icon

HetHub: A Heterogeneous distributed hybrid training system for large-scale models

Add code
May 25, 2024
Viaarxiv icon

FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs

Add code
Jan 09, 2024
Figure 1 for FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs
Figure 2 for FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs
Figure 3 for FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs
Figure 4 for FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs
Viaarxiv icon

Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym

Add code
Dec 06, 2023
Viaarxiv icon