Picture for Qinghao Hu

Qinghao Hu

IntraSlice: Towards High-Performance Structural Pruning with Block-Intra PCA for LLMs

Add code
Feb 02, 2026
Viaarxiv icon

Borrowing from anything: A generalizable framework for reference-guided instance editing

Add code
Dec 17, 2025
Viaarxiv icon

RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI

Add code
Dec 11, 2025
Figure 1 for RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI
Figure 2 for RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI
Figure 3 for RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI
Figure 4 for RoboNeuron: A Modular Framework Linking Foundation Models and ROS for Embodied AI
Viaarxiv icon

Scaling RL to Long Videos

Add code
Jul 10, 2025
Viaarxiv icon

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Add code
Feb 20, 2025
Viaarxiv icon

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Add code
Aug 21, 2024
Figure 1 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 2 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 3 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Figure 4 for LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Viaarxiv icon

TorchGT: A Holistic System for Large-scale Graph Transformer Training

Add code
Jul 19, 2024
Figure 1 for TorchGT: A Holistic System for Large-scale Graph Transformer Training
Figure 2 for TorchGT: A Holistic System for Large-scale Graph Transformer Training
Figure 3 for TorchGT: A Holistic System for Large-scale Graph Transformer Training
Figure 4 for TorchGT: A Holistic System for Large-scale Graph Transformer Training
Viaarxiv icon

Characterization of Large Language Model Development in the Datacenter

Add code
Mar 12, 2024
Viaarxiv icon

MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision Quantization

Add code
Nov 16, 2023
Figure 1 for MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision Quantization
Figure 2 for MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision Quantization
Figure 3 for MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision Quantization
Figure 4 for MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision Quantization
Viaarxiv icon

Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World

Add code
Sep 20, 2023
Figure 1 for Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World
Figure 2 for Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World
Figure 3 for Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World
Figure 4 for Spiking NeRF: Making Bio-inspired Neural Networks See through the Real World
Viaarxiv icon