Picture for Qing-Guo Chen

Qing-Guo Chen

OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions

Add code
Dec 09, 2024
Viaarxiv icon

PEMF-VVTO: Point-Enhanced Video Virtual Try-on via Mask-free Paradigm

Add code
Dec 05, 2024
Viaarxiv icon

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Add code
Oct 10, 2024
Figure 1 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 2 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 3 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 4 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Viaarxiv icon

Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees

Add code
Jun 11, 2024
Viaarxiv icon

Wings: Learning Multimodal LLMs without Text-only Forgetting

Add code
Jun 05, 2024
Figure 1 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 2 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 3 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 4 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Viaarxiv icon

Parrot: Multilingual Visual Instruction Tuning

Add code
Jun 04, 2024
Figure 1 for Parrot: Multilingual Visual Instruction Tuning
Figure 2 for Parrot: Multilingual Visual Instruction Tuning
Figure 3 for Parrot: Multilingual Visual Instruction Tuning
Figure 4 for Parrot: Multilingual Visual Instruction Tuning
Viaarxiv icon

Ovis: Structural Embedding Alignment for Multimodal Large Language Model

Add code
May 31, 2024
Figure 1 for Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Figure 2 for Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Figure 3 for Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Figure 4 for Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Viaarxiv icon

TAI++: Text as Image for Multi-Label Image Classification by Co-Learning Transferable Prompt

Add code
May 11, 2024
Viaarxiv icon

Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences

Add code
Aug 08, 2022
Figure 1 for Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences
Figure 2 for Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences
Figure 3 for Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences
Figure 4 for Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences
Viaarxiv icon

Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge

Add code
Jul 31, 2020
Figure 1 for Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge
Figure 2 for Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge
Figure 3 for Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge
Figure 4 for Multi-label Zero-shot Classification by Learning to Transfer from External Knowledge
Viaarxiv icon