Picture for Gang Yu

Gang Yu

Department of Biomedical Engineering, School of Basic Medical Sciences, Central South University, Changsha, China

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models

Add code
Oct 10, 2025
Viaarxiv icon

pFedSAM: Personalized Federated Learning of Segment Anything Model for Medical Image Segmentation

Add code
Sep 19, 2025
Viaarxiv icon

Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer

Add code
Aug 12, 2025
Viaarxiv icon

SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning

Add code
Aug 08, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Add code
Jun 09, 2025
Viaarxiv icon

DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds

Add code
May 30, 2025
Viaarxiv icon

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Add code
May 30, 2025
Viaarxiv icon

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Add code
May 22, 2025
Viaarxiv icon

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Add code
May 12, 2025
Viaarxiv icon