Picture for Qianru Sun

Qianru Sun

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Figure 1 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 2 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 3 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 4 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Viaarxiv icon

Reverse Modeling in Large Language Models

Add code
Oct 13, 2024
Viaarxiv icon

Towards Natural Image Matting in the Wild via Real-Scenario Prior

Add code
Oct 09, 2024
Figure 1 for Towards Natural Image Matting in the Wild via Real-Scenario Prior
Figure 2 for Towards Natural Image Matting in the Wild via Real-Scenario Prior
Figure 3 for Towards Natural Image Matting in the Wild via Real-Scenario Prior
Figure 4 for Towards Natural Image Matting in the Wild via Real-Scenario Prior
Viaarxiv icon

Frame-Voyager: Learning to Query Frames for Video Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Learning De-Biased Representations for Remote-Sensing Imagery

Add code
Oct 06, 2024
Viaarxiv icon

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

Add code
Sep 30, 2024
Viaarxiv icon

LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement

Add code
Jun 29, 2024
Viaarxiv icon

In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation

Add code
Apr 15, 2024
Viaarxiv icon

Unleashing Network Potentials for Semantic Scene Completion

Add code
Mar 14, 2024
Viaarxiv icon

Few-shot Learner Parameterization by Diffusion Time-steps

Add code
Mar 05, 2024
Viaarxiv icon