Picture for Kai Wang

Kai Wang

Refer to the report for detailed contributions

Mixture of Heterogeneous Grouped Experts for Language Modeling

Add code
Apr 28, 2026
Viaarxiv icon

KAConvNet: Kolmogorov-Arnold Convolutional Networks for Vision Recognition

Add code
Apr 25, 2026
Viaarxiv icon

Multi-Perspective Evidence Synthesis and Reasoning for Unsupervised Multimodal Entity Linking

Add code
Apr 22, 2026
Viaarxiv icon

ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

Add code
Apr 16, 2026
Viaarxiv icon

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

Add code
Apr 13, 2026
Viaarxiv icon

Back to Basics: Let Conversational Agents Remember with Just Retrieval and Generation

Add code
Apr 13, 2026
Viaarxiv icon

Beyond Loss Values: Robust Dynamic Pruning via Loss Trajectory Alignment

Add code
Apr 08, 2026
Viaarxiv icon

OmniSonic: Towards Universal and Holistic Audio Generation from Video and Text

Add code
Apr 06, 2026
Viaarxiv icon

Adaptive Action Chunking at Inference-time for Vision-Language-Action Models

Add code
Apr 05, 2026
Viaarxiv icon

Beyond Logit Adjustment: A Residual Decomposition Framework for Long-Tailed Reranking

Add code
Apr 02, 2026
Viaarxiv icon