Picture for Zihao Zheng

Zihao Zheng

Eric

CAST-TTS: A Simple Cross-Attention Framework for Unified Timbre Control in TTS

Add code
Mar 17, 2026
Viaarxiv icon

RAPID: Redundancy-Aware and Compatibility-Optimal Edge-Cloud Partitioned Inference for Diverse VLA Models

Add code
Mar 12, 2026
Viaarxiv icon

DyQ-VLA: Temporal-Dynamic-Aware Quantization for Embodied Vision-Language-Action Models

Add code
Mar 09, 2026
Viaarxiv icon

KERV: Kinematic-Rectified Speculative Decoding for Embodied VLA Models

Add code
Mar 02, 2026
Viaarxiv icon

ToProVAR: Efficient Visual Autoregressive Modeling via Tri-Dimensional Entropy-Aware Semantic Analysis and Sparsity Optimization

Add code
Feb 26, 2026
Viaarxiv icon

Diversity or Precision? A Deep Dive into Next Token Prediction

Add code
Dec 28, 2025
Viaarxiv icon

EaqVLA: Encoding-aligned Quantization for Vision-Language-Action Models

Add code
May 27, 2025
Viaarxiv icon

FedHQ: Hybrid Runtime Quantization for Federated Learning

Add code
May 17, 2025
Viaarxiv icon

MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness

Add code
Mar 27, 2025
Figure 1 for MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness
Figure 2 for MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness
Figure 3 for MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness
Figure 4 for MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness
Viaarxiv icon

Test Time Training for 4D Medical Image Interpolation

Add code
Feb 04, 2025
Figure 1 for Test Time Training for 4D Medical Image Interpolation
Figure 2 for Test Time Training for 4D Medical Image Interpolation
Figure 3 for Test Time Training for 4D Medical Image Interpolation
Figure 4 for Test Time Training for 4D Medical Image Interpolation
Viaarxiv icon