Picture for Rong-Cheng Tu

Rong-Cheng Tu

Intra-Trajectory Consistency for Reward Modeling

Add code
Jun 10, 2025
Viaarxiv icon

A Survey of Automatic Evaluation Methods on Text, Visual and Speech Generations

Add code
Jun 06, 2025
Viaarxiv icon

Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval

Add code
May 26, 2025
Viaarxiv icon

MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval

Add code
May 26, 2025
Viaarxiv icon

VORTA: Efficient Video Diffusion via Routing Sparse Attention

Add code
May 24, 2025
Viaarxiv icon

T2I-Eval-R1: Reinforcement Learning-Driven Reasoning for Interpretable Text-to-Image Evaluation

Add code
May 23, 2025
Viaarxiv icon

Robust Distribution Alignment for Industrial Anomaly Detection under Distribution Shift

Add code
Mar 19, 2025
Viaarxiv icon

AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration

Add code
Dec 16, 2024
Viaarxiv icon

Distribution-Consistency-Guided Multi-modal Hashing

Add code
Dec 15, 2024
Viaarxiv icon

SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing

Add code
Nov 28, 2024
Viaarxiv icon