Picture for Guanqun Wang

Guanqun Wang

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception

Add code
Jun 22, 2024
Figure 1 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 2 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 3 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Figure 4 for MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception
Viaarxiv icon

Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation

Add code
May 27, 2024
Viaarxiv icon

Cloud-Device Collaborative Learning for Multimodal Large Language Models

Add code
Dec 26, 2023
Figure 1 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 2 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 3 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 4 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Viaarxiv icon

Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain

Add code
Jul 08, 2022
Figure 1 for Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain
Figure 2 for Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain
Figure 3 for Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain
Figure 4 for Consecutive Pretraining: A Knowledge Transfer Learning Strategy with Relevant Unlabeled Data for Remote Sensing Domain
Viaarxiv icon