Picture for Long Ma

Long Ma

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

Can Deep Research Agents Find and Organize? Evaluating the Synthesis Gap with Expert Taxonomies

Add code
Jan 18, 2026
Viaarxiv icon

Your One-Stop Solution for AI-Generated Video Detection

Add code
Jan 16, 2026
Viaarxiv icon

Route Experts by Sequence, not by Token

Add code
Nov 09, 2025
Viaarxiv icon

Social World Model-Augmented Mechanism Design Policy Learning

Add code
Oct 22, 2025
Viaarxiv icon

Thought Purity: Defense Paradigm For Chain-of-Thought Attack

Add code
Jul 16, 2025
Viaarxiv icon

A Multi-Stage Framework for Multimodal Controllable Speech Synthesis

Add code
Jun 26, 2025
Viaarxiv icon

Benchmarking Laparoscopic Surgical Image Restoration and Beyond

Add code
May 25, 2025
Viaarxiv icon

MPE-TTS: Customized Emotion Zero-Shot Text-To-Speech Using Multi-Modal Prompt

Add code
May 24, 2025
Viaarxiv icon

Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception

Add code
Apr 09, 2025
Figure 1 for Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception
Figure 2 for Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception
Figure 3 for Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception
Figure 4 for Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception
Viaarxiv icon