Picture for Caifeng Shan

Caifeng Shan

Optimize-at-Capture: Highly-adaptive Exposure Controlling for In-Vehicle Non-contact Heart-rate Monitoring

Add code
May 06, 2026
Viaarxiv icon

SpeechParaling-Bench: A Comprehensive Benchmark for Paralinguistic-Aware Speech Generation

Add code
Apr 22, 2026
Viaarxiv icon

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Add code
Apr 06, 2026
Viaarxiv icon

Center-Aware Detection with Swin-based Co-DETR Framework for Cervical Cytology

Add code
Apr 02, 2026
Viaarxiv icon

VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding

Add code
Mar 23, 2026
Viaarxiv icon

One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control

Add code
Mar 18, 2026
Viaarxiv icon

NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing

Add code
Mar 03, 2026
Viaarxiv icon

Exposing and Defending the Achilles' Heel of Video Mixture-of-Experts

Add code
Feb 01, 2026
Viaarxiv icon

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

Add code
Oct 10, 2025
Viaarxiv icon

An Effective End-to-End Solution for Multimodal Action Recognition

Add code
Jun 11, 2025
Viaarxiv icon