Picture for Jianhao Yuan

Jianhao Yuan

Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language Models

Add code
Nov 09, 2024
Viaarxiv icon

SpatialBot: Precise Spatial Understanding with Vision Language Models

Add code
Jun 19, 2024
Viaarxiv icon

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

Add code
Apr 15, 2024
Viaarxiv icon

SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model

Add code
Mar 05, 2024
Figure 1 for SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Figure 2 for SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Figure 3 for SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Figure 4 for SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Viaarxiv icon

Efficient Multimodal Learning from Data-centric Perspective

Add code
Feb 18, 2024
Viaarxiv icon

RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model

Add code
Feb 16, 2024
Viaarxiv icon

Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Add code
Oct 16, 2023
Viaarxiv icon

Off the Radar: Uncertainty-Aware Radar Place Recognition with Introspective Querying and Map Maintenance

Add code
Jun 21, 2023
Viaarxiv icon

Not Just Pretty Pictures: Text-to-Image Generators Enable Interpretable Interventions for Robust Representations

Add code
Dec 21, 2022
Viaarxiv icon