Picture for Tao Jin

Tao Jin

University of Science and Technology of China

Chat-Driven Text Generation and Interaction for Person Retrieval

Add code
Sep 16, 2025
Viaarxiv icon

SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

Add code
Sep 04, 2025
Viaarxiv icon

TAP: Parameter-efficient Task-Aware Prompting for Adverse Weather Removal

Add code
Aug 11, 2025
Viaarxiv icon

Vela: Scalable Embeddings with Voice Large Language Models for Multimodal Retrieval

Add code
Jun 17, 2025
Viaarxiv icon

IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models

Add code
May 30, 2025
Viaarxiv icon

Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation

Add code
May 30, 2025
Viaarxiv icon

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis

Add code
May 20, 2025
Viaarxiv icon

Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning

Add code
May 18, 2025
Viaarxiv icon

T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback

Add code
May 15, 2025
Viaarxiv icon

Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision

Add code
Apr 30, 2025
Viaarxiv icon