Picture for Yang Li

Yang Li

Shanghai Center for Systems Biomedicine, Key Laboratory of Systems Biomedicine

Kimi-VL Technical Report

Add code
Apr 10, 2025
Viaarxiv icon

Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation

Add code
Apr 08, 2025
Viaarxiv icon

SkyReels-A2: Compose Anything in Video Diffusion Transformers

Add code
Apr 03, 2025
Viaarxiv icon

ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting

Add code
Mar 28, 2025
Viaarxiv icon

Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation

Add code
Mar 25, 2025
Viaarxiv icon

UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection Framework

Add code
Mar 19, 2025
Viaarxiv icon

SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability

Add code
Mar 18, 2025
Viaarxiv icon

Post-interactive Multimodal Trajectory Prediction for Autonomous Driving

Add code
Mar 12, 2025
Viaarxiv icon

EVE: Towards End-to-End Video Subtitle Extraction with Vision-Language Models

Add code
Mar 06, 2025
Viaarxiv icon

Rethinking Video Tokenization: A Conditioned Diffusion-based Approach

Add code
Mar 05, 2025
Viaarxiv icon