Picture for Quan Wang

Quan Wang

Arden

Improving Few-Shot Change Detection Visual Question Answering via Decision-Ambiguity-guided Reinforcement Fine-Tuning

Add code
Dec 31, 2025
Viaarxiv icon

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Add code
Dec 22, 2025
Viaarxiv icon

Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs

Add code
Dec 19, 2025
Figure 1 for Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs
Figure 2 for Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs
Figure 3 for Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs
Figure 4 for Anatomical Region-Guided Contrastive Decoding: A Plug-and-Play Strategy for Mitigating Hallucinations in Medical VLMs
Viaarxiv icon

CheXPO-v2: Preference Optimization for Chest X-ray VLMs with Knowledge Graph Consistency

Add code
Dec 19, 2025
Viaarxiv icon

Scaling Spatial Intelligence with Multimodal Foundation Models

Add code
Nov 17, 2025
Figure 1 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 2 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 3 for Scaling Spatial Intelligence with Multimodal Foundation Models
Figure 4 for Scaling Spatial Intelligence with Multimodal Foundation Models
Viaarxiv icon

In-Token Rationality Optimization: Towards Accurate and Concise LLM Reasoning via Self-Feedback

Add code
Nov 13, 2025
Viaarxiv icon

State-of-the-Art Dysarthric Speech Recognition with MetaICL for on-the-fly Personalization

Add code
Sep 19, 2025
Viaarxiv icon

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Add code
Aug 18, 2025
Viaarxiv icon

CheXPO: Preference Optimization for Chest X-ray VLMs with Counterfactual Rationale

Add code
Jul 09, 2025
Viaarxiv icon

MST-Distill: Mixture of Specialized Teachers for Cross-Modal Knowledge Distillation

Add code
Jul 09, 2025
Viaarxiv icon