Picture for Xiaolong Jiang

Xiaolong Jiang

WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs

Add code
Feb 06, 2025
Figure 1 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 2 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 3 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Figure 4 for WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Viaarxiv icon

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Add code
Dec 02, 2024
Figure 1 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 2 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 3 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 4 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Viaarxiv icon

P4Q: Learning to Prompt for Quantization in Visual-language Models

Add code
Sep 26, 2024
Figure 1 for P4Q: Learning to Prompt for Quantization in Visual-language Models
Figure 2 for P4Q: Learning to Prompt for Quantization in Visual-language Models
Figure 3 for P4Q: Learning to Prompt for Quantization in Visual-language Models
Figure 4 for P4Q: Learning to Prompt for Quantization in Visual-language Models
Viaarxiv icon

Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective

Add code
Aug 13, 2024
Figure 1 for Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Figure 2 for Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Figure 3 for Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Figure 4 for Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective
Viaarxiv icon

VISA: Reasoning Video Object Segmentation via Large Language Models

Add code
Jul 16, 2024
Figure 1 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 2 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 3 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 4 for VISA: Reasoning Video Object Segmentation via Large Language Models
Viaarxiv icon

A Sanity Check for AI-generated Image Detection

Add code
Jun 27, 2024
Figure 1 for A Sanity Check for AI-generated Image Detection
Figure 2 for A Sanity Check for AI-generated Image Detection
Figure 3 for A Sanity Check for AI-generated Image Detection
Figure 4 for A Sanity Check for AI-generated Image Detection
Viaarxiv icon

Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning

Add code
Jun 17, 2024
Figure 1 for Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Figure 2 for Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Figure 3 for Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Figure 4 for Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Viaarxiv icon

RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model

Add code
Mar 12, 2024
Figure 1 for RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model
Figure 2 for RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model
Figure 3 for RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model
Figure 4 for RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model
Viaarxiv icon

Controllable Mind Visual Diffusion Model

Add code
May 18, 2023
Viaarxiv icon

PiClick: Picking the desired mask in click-based interactive segmentation

Add code
Apr 23, 2023
Figure 1 for PiClick: Picking the desired mask in click-based interactive segmentation
Figure 2 for PiClick: Picking the desired mask in click-based interactive segmentation
Figure 3 for PiClick: Picking the desired mask in click-based interactive segmentation
Figure 4 for PiClick: Picking the desired mask in click-based interactive segmentation
Viaarxiv icon