Picture for Rong-Cheng Tu

Rong-Cheng Tu

AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration

Add code
Dec 16, 2024
Viaarxiv icon

Distribution-Consistency-Guided Multi-modal Hashing

Add code
Dec 15, 2024
Viaarxiv icon

SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing

Add code
Nov 28, 2024
Viaarxiv icon

Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong Baselines

Add code
Nov 25, 2024
Viaarxiv icon

Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark

Add code
Nov 23, 2024
Viaarxiv icon

Diffusion Model-Based Video Editing: A Survey

Add code
Jun 26, 2024
Figure 1 for Diffusion Model-Based Video Editing: A Survey
Figure 2 for Diffusion Model-Based Video Editing: A Survey
Figure 3 for Diffusion Model-Based Video Editing: A Survey
Figure 4 for Diffusion Model-Based Video Editing: A Survey
Viaarxiv icon

Global and Local Semantic Completion Learning for Vision-Language Pre-training

Add code
Jun 12, 2023
Viaarxiv icon

Unsupervised Hashing with Semantic Concept Mining

Add code
Sep 23, 2022
Figure 1 for Unsupervised Hashing with Semantic Concept Mining
Figure 2 for Unsupervised Hashing with Semantic Concept Mining
Figure 3 for Unsupervised Hashing with Semantic Concept Mining
Figure 4 for Unsupervised Hashing with Semantic Concept Mining
Viaarxiv icon

HunYuan_tvr for Text-Video Retrivial

Add code
Apr 14, 2022
Figure 1 for HunYuan_tvr for Text-Video Retrivial
Figure 2 for HunYuan_tvr for Text-Video Retrivial
Figure 3 for HunYuan_tvr for Text-Video Retrivial
Figure 4 for HunYuan_tvr for Text-Video Retrivial
Viaarxiv icon

Deep Cross-modal Proxy Hashing

Add code
Nov 06, 2020
Figure 1 for Deep Cross-modal Proxy Hashing
Figure 2 for Deep Cross-modal Proxy Hashing
Figure 3 for Deep Cross-modal Proxy Hashing
Figure 4 for Deep Cross-modal Proxy Hashing
Viaarxiv icon