Picture for Linli Xu

Linli Xu

BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models

Add code
Aug 09, 2025
Viaarxiv icon

CROP: Integrating Topological and Spatial Structures via Cross-View Prefixes for Molecular LLMs

Add code
Aug 09, 2025
Viaarxiv icon

Unifying Continuous and Discrete Text Diffusion with Non-simultaneous Diffusion Processes

Add code
May 28, 2025
Viaarxiv icon

AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization

Add code
Apr 02, 2025
Figure 1 for AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization
Figure 2 for AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization
Figure 3 for AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization
Figure 4 for AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization
Viaarxiv icon

ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning

Add code
Mar 13, 2025
Viaarxiv icon

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Add code
Feb 23, 2025
Figure 1 for Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Figure 2 for Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Figure 3 for Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Figure 4 for Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Viaarxiv icon

Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Add code
Nov 04, 2024
Figure 1 for Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Figure 2 for Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Figure 3 for Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Figure 4 for Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Viaarxiv icon

Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective

Add code
Oct 16, 2024
Figure 1 for Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Figure 2 for Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Figure 3 for Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Figure 4 for Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
Viaarxiv icon

Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models

Add code
Oct 09, 2024
Figure 1 for Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models
Figure 2 for Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models
Figure 3 for Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models
Figure 4 for Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models
Viaarxiv icon

Video In-context Learning

Add code
Jul 10, 2024
Viaarxiv icon