Picture for Jian Zhang

Jian Zhang

A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Add code
Aug 21, 2025
Figure 1 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 2 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 3 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Figure 4 for NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Viaarxiv icon

Towards Perfection: Building Inter-component Mutual Correction for Retinex-based Low-light Image Enhancement

Add code
Aug 12, 2025
Viaarxiv icon

Correspondence as Video: Test-Time Adaption on SAM2 for Reference Segmentation in the Wild

Add code
Aug 11, 2025
Viaarxiv icon

Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints

Add code
Jul 16, 2025
Viaarxiv icon

SWinMamba: Serpentine Window State Space Model for Vascular Segmentation

Add code
Jul 02, 2025
Viaarxiv icon

AssetDropper: Asset Extraction via Diffusion Models with Reward-Driven Optimization

Add code
Jun 06, 2025
Viaarxiv icon

Pts3D-LLM: Studying the Impact of Token Structure for 3D Scene Understanding With Large Language Models

Add code
Jun 06, 2025
Viaarxiv icon

Unleashing the Power of Intermediate Domains for Mixed Domain Semi-Supervised Medical Image Segmentation

Add code
May 30, 2025
Viaarxiv icon

LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter

Add code
May 29, 2025
Figure 1 for LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter
Figure 2 for LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter
Figure 3 for LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter
Figure 4 for LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter
Viaarxiv icon