Picture for Hengxing Cai

Hengxing Cai

AirNav: A Large-Scale Real-World UAV Vision-and-Language Navigation Dataset with Natural and Diverse Instructions

Add code
Jan 07, 2026
Viaarxiv icon

Interpretable Reward Model via Sparse Autoencoder

Add code
Aug 12, 2025
Viaarxiv icon

MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval

Add code
Jun 14, 2025
Viaarxiv icon

FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models

Add code
May 19, 2025
Viaarxiv icon

Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs

Add code
May 16, 2025
Viaarxiv icon

A Multi-Granularity Retrieval Framework for Visually-Rich Documents

Add code
May 06, 2025
Viaarxiv icon

A Multi-Granularity Multimodal Retrieval Framework for Multimodal Document Tasks

Add code
May 01, 2025
Viaarxiv icon

Intelligent System for Automated Molecular Patent Infringement Assessment

Add code
Dec 10, 2024
Figure 1 for Intelligent System for Automated Molecular Patent Infringement Assessment
Figure 2 for Intelligent System for Automated Molecular Patent Infringement Assessment
Figure 3 for Intelligent System for Automated Molecular Patent Infringement Assessment
Figure 4 for Intelligent System for Automated Molecular Patent Infringement Assessment
Viaarxiv icon

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Add code
Aug 30, 2024
Figure 1 for SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Figure 2 for SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Figure 3 for SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Figure 4 for SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Viaarxiv icon

Uni-SMART: Universal Science Multimodal Analysis and Research Transformer

Add code
Mar 15, 2024
Figure 1 for Uni-SMART: Universal Science Multimodal Analysis and Research Transformer
Figure 2 for Uni-SMART: Universal Science Multimodal Analysis and Research Transformer
Figure 3 for Uni-SMART: Universal Science Multimodal Analysis and Research Transformer
Figure 4 for Uni-SMART: Universal Science Multimodal Analysis and Research Transformer
Viaarxiv icon