Picture for Dongzhan Zhou

Dongzhan Zhou

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Add code
Dec 02, 2024
Viaarxiv icon

MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts

Add code
Nov 22, 2024
Viaarxiv icon

A CLIP-Powered Framework for Robust and Generalizable Data Selection

Add code
Oct 15, 2024
Viaarxiv icon

MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses

Add code
Oct 09, 2024
Viaarxiv icon

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Add code
Oct 03, 2024
Viaarxiv icon

ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area

Add code
Aug 16, 2024
Figure 1 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 2 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 3 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Figure 4 for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
Viaarxiv icon

Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM

Add code
Aug 14, 2024
Figure 1 for Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM
Figure 2 for Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM
Figure 3 for Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM
Figure 4 for Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM
Viaarxiv icon

Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes

Add code
Jul 15, 2024
Viaarxiv icon

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Add code
Jun 11, 2024
Viaarxiv icon

Physical formula enhanced multi-task learning for pharmacokinetics prediction

Add code
Apr 16, 2024
Viaarxiv icon