Picture for Ke Chen

Ke Chen

Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning

Add code
Apr 07, 2026
Viaarxiv icon

Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects

Add code
Apr 07, 2026
Viaarxiv icon

HybridKV: Hybrid KV Cache Compression for Efficient Multimodal Large Language Model Inference

Add code
Apr 07, 2026
Viaarxiv icon

UAV-CB: A Complex-Background RGB-T Dataset and Local Frequency Bridge Network for UAV Detection

Add code
Mar 18, 2026
Viaarxiv icon

MOSS-TTS Technical Report

Add code
Mar 18, 2026
Viaarxiv icon

HiLoRA: Hierarchical Low-Rank Adaptation for Personalized Federated Learning

Add code
Mar 03, 2026
Viaarxiv icon

FastBUS: A Fast Bayesian Framework for Unified Weakly-Supervised Learning

Add code
Feb 28, 2026
Viaarxiv icon

ColoDiff: Integrating Dynamic Consistency With Content Awareness for Colonoscopy Video Generation

Add code
Feb 26, 2026
Viaarxiv icon

MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models

Add code
Feb 12, 2026
Viaarxiv icon

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Add code
Feb 09, 2026
Viaarxiv icon