Picture for Kai Li

Kai Li

Department of Computer Science and Technology, Tsinghua University, Beijing, China

Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization

Add code
Jun 11, 2025
Viaarxiv icon

Segment Concealed Objects with Incomplete Supervision

Add code
Jun 10, 2025
Viaarxiv icon

A Fast and Lightweight Model for Causal Audio-Visual Speech Separation

Add code
Jun 07, 2025
Viaarxiv icon

Zero-Trust Foundation Models: A New Paradigm for Secure and Collaborative Artificial Intelligence for Internet of Things

Add code
May 26, 2025
Viaarxiv icon

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline

Add code
May 25, 2025
Viaarxiv icon

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

Add code
May 22, 2025
Viaarxiv icon

Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation

Add code
May 19, 2025
Viaarxiv icon

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Add code
May 19, 2025
Viaarxiv icon

DIMM: Decoupled Multi-hierarchy Kalman Filter for 3D Object Tracking

Add code
May 18, 2025
Viaarxiv icon

SepPrune: Structured Pruning for Efficient Deep Speech Separation

Add code
May 17, 2025
Viaarxiv icon