Picture for Changyao Tian

Changyao Tian

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Add code
Dec 12, 2024
Viaarxiv icon

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 13, 2024
Figure 1 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Add code
Jun 12, 2024
Figure 1 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 2 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 3 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Figure 4 for OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Viaarxiv icon

Learning 1D Causal Visual Representation with De-focus Attention Networks

Add code
Jun 06, 2024
Viaarxiv icon

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Add code
Jan 18, 2024
Viaarxiv icon

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process

Add code
Jun 08, 2023
Viaarxiv icon

VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition

Add code
Dec 09, 2021
Figure 1 for VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition
Figure 2 for VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition
Figure 3 for VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition
Figure 4 for VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition
Viaarxiv icon