Picture for Bangzheng Li

Bangzheng Li

Reinforced Attention Learning

Add code
Feb 04, 2026
Viaarxiv icon

Unbiased Visual Reasoning with Controlled Visual Inputs

Add code
Dec 19, 2025
Viaarxiv icon

QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

Add code
May 29, 2025
Viaarxiv icon

Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models

Add code
May 26, 2025
Figure 1 for Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
Figure 2 for Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
Figure 3 for Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
Figure 4 for Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
Viaarxiv icon

Semantic-Clipping: Efficient Vision-Language Modeling with Semantic-Guidedd Visual Selection

Add code
Mar 14, 2025
Viaarxiv icon

MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design

Add code
Dec 20, 2024
Figure 1 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 2 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 3 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 4 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Viaarxiv icon

FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation

Add code
Jun 17, 2024
Figure 1 for FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Figure 2 for FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Figure 3 for FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Figure 4 for FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Viaarxiv icon

Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space

Add code
May 26, 2024
Figure 1 for Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Figure 2 for Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Figure 3 for Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Figure 4 for Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Viaarxiv icon

Red Teaming Language Models for Contradictory Dialogues

Add code
May 17, 2024
Figure 1 for Red Teaming Language Models for Contradictory Dialogues
Figure 2 for Red Teaming Language Models for Contradictory Dialogues
Figure 3 for Red Teaming Language Models for Contradictory Dialogues
Figure 4 for Red Teaming Language Models for Contradictory Dialogues
Viaarxiv icon

BLINK: Multimodal Large Language Models Can See but Not Perceive

Add code
Apr 18, 2024
Figure 1 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 2 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 3 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 4 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Viaarxiv icon