Picture for Naoto Yokoya

Naoto Yokoya

Experience-Driven Multi-Agent Systems Are Training-free Context-aware Earth Observers

Add code
Jan 30, 2026
Viaarxiv icon

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Add code
Jan 12, 2026
Viaarxiv icon

Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery

Add code
Dec 21, 2025
Viaarxiv icon

Hyperspectral Imaging

Add code
Aug 11, 2025
Figure 1 for Hyperspectral Imaging
Figure 2 for Hyperspectral Imaging
Figure 3 for Hyperspectral Imaging
Figure 4 for Hyperspectral Imaging
Viaarxiv icon

DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding

Add code
May 27, 2025
Viaarxiv icon

DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response

Add code
May 27, 2025
Viaarxiv icon

Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models

Add code
May 26, 2025
Viaarxiv icon

Enhancing Monocular Height Estimation via Sparse LiDAR-Guided Correction

Add code
May 11, 2025
Viaarxiv icon

Joint Super-Resolution and Segmentation for 1-m Impervious Surface Area Mapping in China's Yangtze River Economic Belt

Add code
May 08, 2025
Viaarxiv icon

SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding

Add code
Apr 04, 2025
Figure 1 for SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding
Figure 2 for SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding
Figure 3 for SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding
Figure 4 for SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding
Viaarxiv icon