Picture for Junjie Hu

Junjie Hu

Fudan university

RiskCueBench: Benchmarking Anticipatory Reasoning from Early Risk Cues in Video-Language Models

Add code
Jan 06, 2026
Viaarxiv icon

MMGR: Multi-Modal Generative Reasoning

Add code
Dec 17, 2025
Figure 1 for MMGR: Multi-Modal Generative Reasoning
Figure 2 for MMGR: Multi-Modal Generative Reasoning
Figure 3 for MMGR: Multi-Modal Generative Reasoning
Figure 4 for MMGR: Multi-Modal Generative Reasoning
Viaarxiv icon

Class Incremental Medical Image Segmentation via Prototype-Guided Calibration and Dual-Aligned Distillation

Add code
Nov 11, 2025
Figure 1 for Class Incremental Medical Image Segmentation via Prototype-Guided Calibration and Dual-Aligned Distillation
Figure 2 for Class Incremental Medical Image Segmentation via Prototype-Guided Calibration and Dual-Aligned Distillation
Figure 3 for Class Incremental Medical Image Segmentation via Prototype-Guided Calibration and Dual-Aligned Distillation
Figure 4 for Class Incremental Medical Image Segmentation via Prototype-Guided Calibration and Dual-Aligned Distillation
Viaarxiv icon

PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting

Add code
Oct 31, 2025
Figure 1 for PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting
Figure 2 for PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting
Figure 3 for PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting
Figure 4 for PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting
Viaarxiv icon

V-SEAM: Visual Semantic Editing and Attention Modulating for Causal Interpretability of Vision-Language Models

Add code
Sep 18, 2025
Viaarxiv icon

LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals

Add code
Sep 16, 2025
Figure 1 for LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals
Figure 2 for LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals
Figure 3 for LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals
Figure 4 for LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals
Viaarxiv icon

Exploiting Unlabeled Structures through Task Consistency Training for Versatile Medical Image Segmentation

Add code
Sep 05, 2025
Figure 1 for Exploiting Unlabeled Structures through Task Consistency Training for Versatile Medical Image Segmentation
Figure 2 for Exploiting Unlabeled Structures through Task Consistency Training for Versatile Medical Image Segmentation
Figure 3 for Exploiting Unlabeled Structures through Task Consistency Training for Versatile Medical Image Segmentation
Figure 4 for Exploiting Unlabeled Structures through Task Consistency Training for Versatile Medical Image Segmentation
Viaarxiv icon

Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm

Add code
Aug 05, 2025
Viaarxiv icon

R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models Acceleration

Add code
May 30, 2025
Viaarxiv icon

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

Add code
May 26, 2025
Figure 1 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 2 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 3 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 4 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Viaarxiv icon