Picture for Kai Yu

Kai Yu

Sherman

PACER: Blockwise Pre-verification for Speculative Decoding with Adaptive Length

Add code
Feb 01, 2026
Viaarxiv icon

Fronthaul-Efficient Distributed Cooperative 3D Positioning with Quantized Latent CSI Embeddings

Add code
Jan 31, 2026
Viaarxiv icon

CMANet: Channel-Masked Attention Network for Cooperative Multi-Base-Station 3D Positioning

Add code
Jan 31, 2026
Viaarxiv icon

Habibi: Laying the Open-Source Foundation of Unified-Dialectal Arabic Speech Synthesis

Add code
Jan 20, 2026
Viaarxiv icon

PaperGuide: Making Small Language-Model Paper-Reading Agents More Efficient

Add code
Jan 19, 2026
Viaarxiv icon

Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders

Add code
Jan 15, 2026
Viaarxiv icon

SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing

Add code
Jan 14, 2026
Viaarxiv icon

What Does the Speaker Embedding Encode?

Add code
Dec 20, 2025
Figure 1 for What Does the Speaker Embedding Encode?
Figure 2 for What Does the Speaker Embedding Encode?
Figure 3 for What Does the Speaker Embedding Encode?
Figure 4 for What Does the Speaker Embedding Encode?
Viaarxiv icon

MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging

Add code
Nov 17, 2025
Figure 1 for MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging
Figure 2 for MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging
Figure 3 for MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging
Figure 4 for MergeDNA: Context-aware Genome Modeling with Dynamic Tokenization through Token Merging
Viaarxiv icon

BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction

Add code
Nov 08, 2025
Figure 1 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 2 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 3 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 4 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Viaarxiv icon