Picture for Chenghao Zhang

Chenghao Zhang

University of California, Irvine

NVIDIA Nemotron 3: Efficient and Open Intelligence

Add code
Dec 24, 2025
Viaarxiv icon

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Dec 23, 2025
Viaarxiv icon

S-BEVLoc: BEV-based Self-supervised Framework for Large-scale LiDAR Global Localization

Add code
Sep 11, 2025
Viaarxiv icon

Re-ranking Reasoning Context with Tree Search Makes Large Vision-Language Models Stronger

Add code
Jun 09, 2025
Viaarxiv icon

CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery

Add code
Feb 13, 2025
Figure 1 for CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery
Figure 2 for CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery
Figure 3 for CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery
Figure 4 for CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery
Viaarxiv icon

Progressive Multimodal Reasoning via Active Retrieval

Add code
Dec 19, 2024
Figure 1 for Progressive Multimodal Reasoning via Active Retrieval
Figure 2 for Progressive Multimodal Reasoning via Active Retrieval
Figure 3 for Progressive Multimodal Reasoning via Active Retrieval
Figure 4 for Progressive Multimodal Reasoning via Active Retrieval
Viaarxiv icon

InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction

Add code
Sep 26, 2024
Figure 1 for InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction
Figure 2 for InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction
Figure 3 for InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction
Figure 4 for InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction
Viaarxiv icon

AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization

Add code
Jul 11, 2024
Figure 1 for AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization
Figure 2 for AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization
Figure 3 for AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization
Figure 4 for AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization
Viaarxiv icon

Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation

Add code
Jun 26, 2024
Viaarxiv icon

FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research

Add code
May 22, 2024
Viaarxiv icon