Picture for Yuxin Guo

Yuxin Guo

UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving

Add code
Dec 06, 2024
Viaarxiv icon

HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving

Add code
Dec 03, 2024
Viaarxiv icon

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Add code
Oct 07, 2024
Figure 1 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 2 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 3 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 4 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Viaarxiv icon

On the Nonlinearity of Layer Normalization

Add code
Jun 03, 2024
Figure 1 for On the Nonlinearity of Layer Normalization
Figure 2 for On the Nonlinearity of Layer Normalization
Figure 3 for On the Nonlinearity of Layer Normalization
Figure 4 for On the Nonlinearity of Layer Normalization
Viaarxiv icon

CoReS: Orchestrating the Dance of Reasoning and Segmentation

Add code
Apr 08, 2024
Viaarxiv icon

Cross Pseudo-Labeling for Semi-Supervised Audio-Visual Source Localization

Add code
Mar 05, 2024
Viaarxiv icon

Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization

Add code
Mar 05, 2024
Viaarxiv icon

Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model

Add code
Dec 18, 2023
Viaarxiv icon

ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation

Add code
Oct 26, 2023
Viaarxiv icon

Data-centric Graph Learning: A Survey

Add code
Oct 08, 2023
Figure 1 for Data-centric Graph Learning: A Survey
Figure 2 for Data-centric Graph Learning: A Survey
Figure 3 for Data-centric Graph Learning: A Survey
Viaarxiv icon