Picture for Jungong Han

Jungong Han

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models

Add code
Apr 11, 2026
Viaarxiv icon

Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track

Add code
Mar 24, 2026
Viaarxiv icon

Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models

Add code
Mar 16, 2026
Viaarxiv icon

Show Me When and Where: Towards Referring Video Object Segmentation in the Wild

Add code
Mar 15, 2026
Viaarxiv icon

Improving Anomaly Detection with Foundation-Model Synthesis and Wavelet-Domain Attention

Add code
Mar 03, 2026
Viaarxiv icon

ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization

Add code
Mar 03, 2026
Viaarxiv icon

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

Add code
Feb 22, 2026
Viaarxiv icon

SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos

Add code
Dec 09, 2025
Viaarxiv icon

PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning

Add code
Oct 22, 2025
Viaarxiv icon

Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model

Add code
Sep 09, 2025
Figure 1 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 2 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 3 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 4 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Viaarxiv icon