Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhenlong Li

SIM: A mapping framework for built environment auditing based on street view imagery

May 29, 2025

Huan Ning, Zhenlong Li, Manzhu Yu, Wenpeng Yin

Abstract:Built environment auditing refers to the systematic documentation and assessment of urban and rural spaces' physical, social, and environmental characteristics, such as walkability, road conditions, and traffic lights. It is used to collect data for the evaluation of how built environments impact human behavior, health, mobility, and overall urban functionality. Traditionally, built environment audits were conducted using field surveys and manual observations, which were time-consuming and costly. The emerging street view imagery, e.g., Google Street View, has become a widely used data source for conducting built environment audits remotely. Deep learning and computer vision techniques can extract and classify objects from street images to enhance auditing productivity. Before meaningful analysis, the detected objects need to be geospatially mapped for accurate documentation. However, the mapping methods and tools based on street images are underexplored, and there are no universal frameworks or solutions yet, imposing difficulties in auditing the street objects. In this study, we introduced an open source street view mapping framework, providing three pipelines to map and measure: 1) width measurement for ground objects, such as roads; 2) 3D localization for objects with a known dimension (e.g., doors and stop signs); and 3) diameter measurements (e.g., street trees). These pipelines can help researchers, urban planners, and other professionals automatically measure and map target objects, promoting built environment auditing productivity and accuracy. Three case studies, including road width measurement, stop sign localization, and street tree diameter measurement, are provided in this paper to showcase pipeline usage.

Via

Access Paper or Ask Questions

GIScience in the Era of Artificial Intelligence: A Research Agenda Towards Autonomous GIS

Apr 01, 2025

Zhenlong Li, Huan Ning, Song Gao, Krzysztof Janowicz, Wenwen Li, Samantha T. Arundel, Chaowei Yang, Budhendra Bhaduri, Shaowen Wang, A-Xing Zhu(+6 more)

Abstract:The advent of generative AI exemplified by large language models (LLMs) opens new ways to represent and compute geographic information and transcend the process of geographic knowledge production, driving geographic information systems (GIS) towards autonomous GIS. Leveraging LLMs as the decision core, autonomous GIS can independently generate and execute geoprocessing workflows to perform spatial analysis. In this vision paper, we elaborate on the concept of autonomous GIS and present a framework that defines its five autonomous goals, five levels of autonomy, five core functions, and three operational scales. We demonstrate how autonomous GIS could perform geospatial data retrieval, spatial analysis, and map making with four proof-of-concept GIS agents. We conclude by identifying critical challenges and future research directions, including fine-tuning and self-growing decision cores, autonomous modeling, and examining the ethical and practical implications of autonomous GIS. By establishing the groundwork for a paradigm shift in GIScience, this paper envisions a future where GIS moves beyond traditional workflows to autonomously reason, derive, innovate, and advance solutions to pressing global challenges.

Via

Access Paper or Ask Questions

GIS Copilot: Towards an Autonomous GIS Agent for Spatial Analysis

Nov 05, 2024

Temitope Akinboyewa, Zhenlong Li, Huan Ning, M. Naser Lessani

Abstract:Recent advancements in Generative AI offer promising capabilities for spatial analysis. Despite their potential, the integration of generative AI with established GIS platforms remains underexplored. In this study, we propose a framework for integrating LLMs directly into existing GIS platforms, using QGIS as an example. Our approach leverages the reasoning and programming capabilities of LLMs to autonomously generate spatial analysis workflows and code through an informed agent that has comprehensive documentation of key GIS tools and parameters. The implementation of this framework resulted in the development of a "GIS Copilot" that allows GIS users to interact with QGIS using natural language commands for spatial analysis. The GIS Copilot was evaluated based on three complexity levels: basic tasks that require one GIS tool and typically involve one data layer to perform simple operations; intermediate tasks involving multi-step processes with multiple tools, guided by user instructions; and advanced tasks which involve multi-step processes that require multiple tools but not guided by user instructions, necessitating the agent to independently decide on and executes the necessary steps. The evaluation reveals that the GIS Copilot demonstrates strong potential in automating foundational GIS operations, with a high success rate in tool selection and code generation for basic and intermediate tasks, while challenges remain in achieving full autonomy for more complex tasks. This study contributes to the emerging vision of Autonomous GIS, providing a pathway for non-experts to engage with geospatial analysis with minimal prior expertise. While full autonomy is yet to be achieved, the GIS Copilot demonstrates significant potential for simplifying GIS workflows and enhancing decision-making processes.

Via

Access Paper or Ask Questions

Beyond Words: Evaluating Large Language Models in Transportation Planning

Sep 22, 2024

Shaowei Ying, Zhenlong Li, Manzhu Yu

Figure 1 for Beyond Words: Evaluating Large Language Models in Transportation Planning

Figure 2 for Beyond Words: Evaluating Large Language Models in Transportation Planning

Figure 3 for Beyond Words: Evaluating Large Language Models in Transportation Planning

Figure 4 for Beyond Words: Evaluating Large Language Models in Transportation Planning

Abstract:The resurgence and rapid advancement of Generative Artificial Intelligence (GenAI) in 2023 has catalyzed transformative shifts across numerous industry sectors, including urban transportation and logistics. This study investigates the evaluation of Large Language Models (LLMs), specifically GPT-4 and Phi-3-mini, to enhance transportation planning. The study assesses the performance and spatial comprehension of these models through a transportation-informed evaluation framework that includes general geospatial skills, general transportation domain skills, and real-world transportation problem-solving. Utilizing a mixed-methods approach, the research encompasses an evaluation of the LLMs' general Geographic Information System (GIS) skills, general transportation domain knowledge as well as abilities to support human decision-making in the real-world transportation planning scenarios of congestion pricing. Results indicate that GPT-4 demonstrates superior accuracy and reliability across various GIS and transportation-specific tasks compared to Phi-3-mini, highlighting its potential as a robust tool for transportation planners. Nonetheless, Phi-3-mini exhibits competence in specific analytical scenarios, suggesting its utility in resource-constrained environments. The findings underscore the transformative potential of GenAI technologies in urban transportation planning. Future work could explore the application of newer LLMs and the impact of Retrieval-Augmented Generation (RAG) techniques, on a broader set of real-world transportation planning and operations challenges, to deepen the integration of advanced AI models in transportation management practices.

Via

Access Paper or Ask Questions

Automated Floodwater Depth Estimation Using Large Multimodal Model for Rapid Flood Mapping

Feb 26, 2024

Temitope Akinboyewa, Huan Ning, M. Naser Lessani, Zhenlong Li

Abstract:Information on the depth of floodwater is crucial for rapid mapping of areas affected by floods. However, previous approaches for estimating floodwater depth, including field surveys, remote sensing, and machine learning techniques, can be time-consuming and resource-intensive. This paper presents an automated and fast approach for estimating floodwater depth from on-site flood photos. A pre-trained large multimodal model, GPT-4 Vision, was used specifically for estimating floodwater. The input data were flooding photos that contained referenced objects, such as street signs, cars, people, and buildings. Using the heights of the common objects as references, the model returned the floodwater depth as the output. Results show that the proposed approach can rapidly provide a consistent and reliable estimation of floodwater depth from flood photos. Such rapid estimation is transformative in flood inundation mapping and assessing the severity of the flood in near-real time, which is essential for effective flood response strategies.

Via

Access Paper or Ask Questions

Autonomous GIS: the next-generation AI-powered GIS

May 20, 2023

Zhenlong Li, Huan Ning

Abstract:Large Language Models (LLMs), such as ChatGPT, demonstrate a strong understanding of human natural language and have been explored and applied in various fields, including reasoning, creative writing, code generation, translation, and information retrieval. By adopting LLM as the reasoning core, we introduce Autonomous GIS (AutoGIS) as an AI-powered geographic information system (GIS) that leverages the LLM's general abilities in natural language understanding, reasoning and coding for addressing spatial problems with automatic spatial data collection, analysis and visualization. We envision that autonomous GIS will need to achieve five autonomous goals including self-generating, self-organizing, self-verifying, self-executing, and self-growing. We developed a prototype system called LLM-Geo using the GPT-4 API in a Python environment, demonstrating what an autonomous GIS looks like and how it delivers expected results without human intervention using three case studies. For all case studies, LLM-Geo was able to return accurate results, including aggregated numbers, graphs, and maps, significantly reducing manual operation time. Although still in its infancy and lacking several important modules such as logging and code testing , LLM-Geo demonstrates a potential path towards next-generation AI-powered GIS. We advocate for the GIScience community to dedicate more effort to the research and development of autonomous GIS, making spatial analysis easier, faster, and more accessible to a broader audience.

Via

Access Paper or Ask Questions

An optimal sensors-based simulation method for spatiotemporal event detection

Aug 16, 2022

Yuqin Jiang, Andrey A. Popov, Zhenlong Li, Michael E. Hodgson

Figure 1 for An optimal sensors-based simulation method for spatiotemporal event detection

Figure 2 for An optimal sensors-based simulation method for spatiotemporal event detection

Figure 3 for An optimal sensors-based simulation method for spatiotemporal event detection

Figure 4 for An optimal sensors-based simulation method for spatiotemporal event detection

Abstract:Human movements in urban areas are essential for understanding the human-environment interactions. However, activities and associated movements are full of uncertainties due to the complexity of a city. In this paper, we propose an optimal sensors-based simulation method for spatiotemporal event detection using human activity signals derived from taxi trip data. A sensor here is an abstract concept such that only the true observation data at the sensor location will be treated as known data for the simulation. Specifically, we first identify the optimal number of sensors and their locations that have the strongest correlation with the whole dataset. The observation data points from these sensors are then used to simulate a regular, uneventful scenario using the Discrete Empirical Interpolation Method. By comparing the simulated and observation scenarios, events are extracted both spatially and temporally. We apply this method in New York City with taxi trip records data. Results show that this method is effective in detecting when and where events occur.

Via

Access Paper or Ask Questions

Simulating multi-exit evacuation using deep reinforcement learning

Jul 11, 2020

Dong Xu, Xiao Huang, Joseph Mango, Xiang Li, Zhenlong Li

Figure 1 for Simulating multi-exit evacuation using deep reinforcement learning

Figure 2 for Simulating multi-exit evacuation using deep reinforcement learning

Figure 3 for Simulating multi-exit evacuation using deep reinforcement learning

Figure 4 for Simulating multi-exit evacuation using deep reinforcement learning

Abstract:Conventional simulations on multi-exit indoor evacuation focus primarily on how to determine a reasonable exit based on numerous factors in a changing environment. Results commonly include some congested and other under-utilized exits, especially with massive pedestrians. We propose a multi-exit evacuation simulation based on Deep Reinforcement Learning (DRL), referred to as the MultiExit-DRL, which involves in a Deep Neural Network (DNN) framework to facilitate state-to-action mapping. The DNN framework applies Rainbow Deep Q-Network (DQN), a DRL algorithm that integrates several advanced DQN methods, to improve data utilization and algorithm stability, and further divides the action space into eight isometric directions for possible pedestrian choices. We compare MultiExit-DRL with two conventional multi-exit evacuation simulation models in three separate scenarios: 1) varying pedestrian distribution ratios, 2) varying exit width ratios, and 3) varying open schedules for an exit. The results show that MultiExit-DRL presents great learning efficiency while reducing the total number of evacuation frames in all designed experiments. In addition, the integration of DRL allows pedestrians to explore other potential exits and helps determine optimal directions, leading to the high efficiency of exit utilization.

* 25 pages, 5 figures, submitted to Transactions in GIS

Via

Access Paper or Ask Questions

Translating multispectral imagery to nighttime imagery via conditional generative adversarial networks

Dec 28, 2019

Xiao Huang, Dong Xu, Zhenlong Li, Cuizhen Wang

Figure 1 for Translating multispectral imagery to nighttime imagery via conditional generative adversarial networks

Figure 2 for Translating multispectral imagery to nighttime imagery via conditional generative adversarial networks

Figure 3 for Translating multispectral imagery to nighttime imagery via conditional generative adversarial networks

Figure 4 for Translating multispectral imagery to nighttime imagery via conditional generative adversarial networks

Abstract:Nighttime satellite imagery has been applied in a wide range of fields. However, our limited understanding of how observed light intensity is formed and whether it can be simulated greatly hinders its further application. This study explores the potential of conditional Generative Adversarial Networks (cGAN) in translating multispectral imagery to nighttime imagery. A popular cGAN framework, pix2pix, was adopted and modified to facilitate this translation using gridded training image pairs derived from Landsat 8 and Visible Infrared Imaging Radiometer Suite (VIIRS). The results of this study prove the possibility of multispectral-to-nighttime translation and further indicate that, with the additional social media data, the generated nighttime imagery can be very similar to the ground-truth imagery. This study fills the gap in understanding the composition of satellite observed nighttime light and provides new paradigms to solve the emerging problems in nighttime remote sensing fields, including nighttime series construction, light desaturation, and multi-sensor calibration.

* 4 pages, 3 figures, submitted to the 2020 IEEE International Geoscience and Remote Sensing Symposium

Via

Access Paper or Ask Questions