Picture for Bingchen Zhao

Bingchen Zhao

Michael Pokorny

Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis

Add code
Mar 13, 2025
Viaarxiv icon

"Principal Components" Enable A New Language of Images

Add code
Mar 11, 2025
Viaarxiv icon

A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning

Add code
Mar 10, 2025
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Add code
Dec 24, 2024
Figure 1 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 2 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 3 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 4 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Viaarxiv icon

CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions

Add code
Nov 25, 2024
Figure 1 for CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions
Figure 2 for CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions
Figure 3 for CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions
Figure 4 for CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions
Viaarxiv icon

AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation

Add code
Oct 11, 2024
Figure 1 for AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Figure 2 for AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Figure 3 for AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Figure 4 for AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
Viaarxiv icon

Contextuality Helps Representation Learning for Generalized Category Discovery

Add code
Jul 29, 2024
Viaarxiv icon

PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery

Add code
Jul 26, 2024
Viaarxiv icon

Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

Add code
Jun 18, 2024
Figure 1 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 2 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 3 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Figure 4 for Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
Viaarxiv icon