Picture for Yunhai Tong

Yunhai Tong

Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs

Add code
Jan 08, 2025
Viaarxiv icon

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Add code
Jan 07, 2025
Figure 1 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 2 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 3 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Figure 4 for Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos
Viaarxiv icon

Training-free Heterogeneous Graph Condensation via Data Selection

Add code
Dec 20, 2024
Viaarxiv icon

Towards Scalable and Deep Graph Neural Networks via Noise Masking

Add code
Dec 19, 2024
Figure 1 for Towards Scalable and Deep Graph Neural Networks via Noise Masking
Figure 2 for Towards Scalable and Deep Graph Neural Networks via Noise Masking
Figure 3 for Towards Scalable and Deep Graph Neural Networks via Noise Masking
Figure 4 for Towards Scalable and Deep Graph Neural Networks via Noise Masking
Viaarxiv icon

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Add code
Dec 10, 2024
Figure 1 for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Figure 2 for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Figure 3 for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Figure 4 for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Viaarxiv icon

RelationBooth: Towards Relation-Aware Customized Object Generation

Add code
Oct 30, 2024
Viaarxiv icon

MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning

Add code
Oct 15, 2024
Figure 1 for MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning
Figure 2 for MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning
Figure 3 for MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning
Figure 4 for MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning
Viaarxiv icon

RLRF4Rec: Reinforcement Learning from Recsys Feedback for Enhanced Recommendation Reranking

Add code
Oct 08, 2024
Viaarxiv icon

You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning

Add code
Aug 01, 2024
Viaarxiv icon

LLAVADI: What Matters For Multimodal Large Language Models Distillation

Add code
Jul 28, 2024
Viaarxiv icon