Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stephen Law

MedGNN: Capturing the Links Between Urban Characteristics and Medical Prescriptions

Apr 07, 2025

Minwei Zhao, Sanja Scepanovic, Stephen Law, Daniele Quercia, Ivica Obadic

Abstract:Understanding how urban socio-demographic and environmental factors relate with health is essential for public health and urban planning. However, traditional statistical methods struggle with nonlinear effects, while machine learning models often fail to capture geographical (nearby areas being more similar) and topological (unequal connectivity between places) effects in an interpretable way. To address this, we propose MedGNN, a spatio-topologically explicit framework that constructs a 2-hop spatial graph, integrating positional and locational node embeddings with urban characteristics in a graph neural network. Applied to MEDSAT, a comprehensive dataset covering over 150 environmental and socio-demographic factors and six prescription outcomes (depression, anxiety, diabetes, hypertension, asthma, and opioids) across 4,835 Greater London neighborhoods, MedGNN improved predictions by over 25% on average compared to baseline methods. Using depression prescriptions as a case study, we analyzed graph embeddings via geographical principal component analysis, identifying findings that: align with prior research (e.g., higher antidepressant prescriptions among older and White populations), contribute to ongoing debates (e.g., greenery linked to higher and NO2 to lower prescriptions), and warrant further study (e.g., canopy evaporation correlated with fewer prescriptions). These results demonstrate MedGNN's potential, and more broadly, of carefully applied machine learning, to advance transdisciplinary public health research.

* 12 pages' main content. This is a preprint. Submitted to KDD 2025

Via

Access Paper or Ask Questions

Multimodal Contrastive Learning of Urban Space Representations from POI Data

Nov 09, 2024

Xinglei Wang, Tao Cheng, Stephen Law, Zichao Zeng, Lu Yin, Junyuan Liu

Abstract:Existing methods for learning urban space representations from Point-of-Interest (POI) data face several limitations, including issues with geographical delineation, inadequate spatial information modelling, underutilisation of POI semantic attributes, and computational inefficiencies. To address these issues, we propose CaLLiPer (Contrastive Language-Location Pre-training), a novel representation learning model that directly embeds continuous urban spaces into vector representations that can capture the spatial and semantic distribution of urban environment. This model leverages a multimodal contrastive learning objective, aligning location embeddings with textual POI descriptions, thereby bypassing the need for complex training corpus construction and negative sampling. We validate CaLLiPer's effectiveness by applying it to learning urban space representations in London, UK, where it demonstrates 5-15% improvement in predictive performance for land use classification and socioeconomic mapping tasks compared to state-of-the-art methods. Visualisations of the learned representations further illustrate our model's advantages in capturing spatial variations in urban semantics with high accuracy and fine resolution. Additionally, CaLLiPer achieves reduced training time, showcasing its efficiency and scalability. This work provides a promising pathway for scalable, semantically rich urban space representation learning that can support the development of geospatial foundation models. The implementation code is available at https://github.com/xlwang233/CaLLiPer.

* 19 pages, 5 figures, 7 tables

Via

Access Paper or Ask Questions

Examining the Commitments and Difficulties Inherent in Multimodal Foundation Models for Street View Imagery

Aug 23, 2024

Zhenyuan Yang, Xuhui Lin, Qinyi He, Ziye Huang, Zhengliang Liu, Hanqi Jiang, Peng Shu, Zihao Wu, Yiwei Li, Stephen Law(+3 more)

Figure 1 for Examining the Commitments and Difficulties Inherent in Multimodal Foundation Models for Street View Imagery

Figure 2 for Examining the Commitments and Difficulties Inherent in Multimodal Foundation Models for Street View Imagery

Figure 3 for Examining the Commitments and Difficulties Inherent in Multimodal Foundation Models for Street View Imagery

Figure 4 for Examining the Commitments and Difficulties Inherent in Multimodal Foundation Models for Street View Imagery

Abstract:The emergence of Large Language Models (LLMs) and multimodal foundation models (FMs) has generated heightened interest in their applications that integrate vision and language. This paper investigates the capabilities of ChatGPT-4V and Gemini Pro for Street View Imagery, Built Environment, and Interior by evaluating their performance across various tasks. The assessments include street furniture identification, pedestrian and car counts, and road width measurement in Street View Imagery; building function classification, building age analysis, building height analysis, and building structure classification in the Built Environment; and interior room classification, interior design style analysis, interior furniture counts, and interior length measurement in Interior. The results reveal proficiency in length measurement, style analysis, question answering, and basic image understanding, but highlight limitations in detailed recognition and counting tasks. While zero-shot learning shows potential, performance varies depending on the problem domains and image complexities. This study provides new insights into the strengths and weaknesses of multimodal foundation models for practical challenges in Street View Imagery, Built Environment, and Interior. Overall, the findings demonstrate foundational multimodal intelligence, emphasizing the potential of FMs to drive forward interdisciplinary applications at the intersection of computer vision and language.

Via

Access Paper or Ask Questions

SMA-Hyper: Spatiotemporal Multi-View Fusion Hypergraph Learning for Traffic Accident Prediction

Jul 24, 2024

Xiaowei Gao, James Haworth, Ilya Ilyankou, Xianghui Zhang, Tao Cheng, Stephen Law, Huanfa Chen

Abstract:Predicting traffic accidents is the key to sustainable city management, which requires effective address of the dynamic and complex spatiotemporal characteristics of cities. Current data-driven models often struggle with data sparsity and typically overlook the integration of diverse urban data sources and the high-order dependencies within them. Additionally, they frequently rely on predefined topologies or weights, limiting their adaptability in spatiotemporal predictions. To address these issues, we introduce the Spatiotemporal Multiview Adaptive HyperGraph Learning (SMA-Hyper) model, a dynamic deep learning framework designed for traffic accident prediction. Building on previous research, this innovative model incorporates dual adaptive spatiotemporal graph learning mechanisms that enable high-order cross-regional learning through hypergraphs and dynamic adaptation to evolving urban data. It also utilises contrastive learning to enhance global and local data representations in sparse datasets and employs an advance attention mechanism to fuse multiple views of accident data and urban functional features, thereby enriching the contextual understanding of risk factors. Extensive testing on the London traffic accident dataset demonstrates that the SMA-Hyper model significantly outperforms baseline models across various temporal horizons and multistep outputs, affirming the effectiveness of its multiview fusion and adaptive learning strategies. The interpretability of the results further underscores its potential to improve urban traffic management and safety by leveraging complex spatiotemporal urban data, offering a scalable framework adaptable to diverse urban environments.

Via

Access Paper or Ask Questions

Exploring selective image matching methods for zero-shot and few-sample unsupervised domain adaptation of urban canopy prediction

Apr 16, 2024

John Francis, Stephen Law

Abstract:We explore simple methods for adapting a trained multi-task UNet which predicts canopy cover and height to a new geographic setting using remotely sensed data without the need of training a domain-adaptive classifier and extensive fine-tuning. Extending previous research, we followed a selective alignment process to identify similar images in the two geographical domains and then tested an array of data-based unsupervised domain adaptation approaches in a zero-shot setting as well as with a small amount of fine-tuning. We find that the selective aligned data-based image matching methods produce promising results in a zero-shot setting, and even more so with a small amount of fine-tuning. These methods outperform both an untransformed baseline and a popular data-based image-to-image translation model. The best performing methods were pixel distribution adaptation and fourier domain adaptation on the canopy cover and height tasks respectively.

* ICLR 2024 Machine Learning for Remote Sensing (ML4RS) Workshop

Via

Access Paper or Ask Questions

Self-supervised learning unveils change in urban housing from street-level images

Sep 21, 2023

Steven Stalder, Michele Volpi, Nicolas Büttner, Stephen Law, Kenneth Harttgen, Esra Suel

Figure 1 for Self-supervised learning unveils change in urban housing from street-level images

Figure 2 for Self-supervised learning unveils change in urban housing from street-level images

Figure 3 for Self-supervised learning unveils change in urban housing from street-level images

Figure 4 for Self-supervised learning unveils change in urban housing from street-level images

Abstract:Cities around the world face a critical shortage of affordable and decent housing. Despite its critical importance for policy, our ability to effectively monitor and track progress in urban housing is limited. Deep learning-based computer vision methods applied to street-level images have been successful in the measurement of socioeconomic and environmental inequalities but did not fully utilize temporal images to track urban change as time-varying labels are often unavailable. We used self-supervised methods to measure change in London using 15 million street images taken between 2008 and 2021. Our novel adaptation of Barlow Twins, Street2Vec, embeds urban structure while being invariant to seasonal and daily changes without manual annotations. It outperformed generic embeddings, successfully identified point-level change in London's housing supply from street-level images, and distinguished between major and minor change. This capability can provide timely information for urban planning and policy decisions toward more liveable, equitable, and sustainable cities.

* 16 pages, 5 figures

Via

Access Paper or Ask Questions

Estimating Chicago's tree cover and canopy height using multi-spectral satellite imagery

Dec 09, 2022

John Francis, Stephen Law

Abstract:Information on urban tree canopies is fundamental to mitigating climate change [1] as well as improving quality of life [2]. Urban tree planting initiatives face a lack of up-to-date data about the horizontal and vertical dimensions of the tree canopy in cities. We present a pipeline that utilizes LiDAR data as ground-truth and then trains a multi-task machine learning model to generate reliable estimates of tree cover and canopy height in urban areas using multi-source multi-spectral satellite imagery for the case study of Chicago.

* 4 pages, 4 figures, Submitted to Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022

Via

Access Paper or Ask Questions

Jane Jacobs in the Sky: Predicting Urban Vitality with Open Satellite Data

Jan 28, 2021

Sanja Šćepanović, Sagar Joglekar, Stephen Law, Daniele Quercia

Figure 1 for Jane Jacobs in the Sky: Predicting Urban Vitality with Open Satellite Data

Figure 2 for Jane Jacobs in the Sky: Predicting Urban Vitality with Open Satellite Data

Figure 3 for Jane Jacobs in the Sky: Predicting Urban Vitality with Open Satellite Data

Figure 4 for Jane Jacobs in the Sky: Predicting Urban Vitality with Open Satellite Data

Abstract:The presence of people in an urban area throughout the day -- often called 'urban vitality' -- is one of the qualities world-class cities aspire to the most, yet it is one of the hardest to achieve. Back in the 1970s, Jane Jacobs theorized urban vitality and found that there are four conditions required for the promotion of life in cities: diversity of land use, small block sizes, the mix of economic activities, and concentration of people. To build proxies for those four conditions and ultimately test Jane Jacobs's theory at scale, researchers have had to collect both private and public data from a variety of sources, and that took decades. Here we propose the use of one single source of data, which happens to be publicly available: Sentinel-2 satellite imagery. In particular, since the first two conditions (diversity of land use and small block sizes) are visible to the naked eye from satellite imagery, we tested whether we could automatically extract them with a state-of-the-art deep-learning framework and whether, in the end, the extracted features could predict vitality. In six Italian cities for which we had call data records, we found that our framework is able to explain on average 55% of the variance in urban vitality extracted from those records.

Via

Access Paper or Ask Questions

Adversarial Perturbations on the Perceptual Ball

Dec 19, 2019

Andrew Elliott, Stephen Law, Chris Russell

Figure 1 for Adversarial Perturbations on the Perceptual Ball

Figure 2 for Adversarial Perturbations on the Perceptual Ball

Figure 3 for Adversarial Perturbations on the Perceptual Ball

Figure 4 for Adversarial Perturbations on the Perceptual Ball

Abstract:We present a simple regularisation of Adversarial Perturbations based upon the perceptual loss. While the resulting perturbations remain imperceptible to the human eye, they differ from existing adversarial perturbations in two important regards: (i) our resulting perturbations are semi-sparse,and typically make alterations to objects and regions of interest leaving the background static; (ii) our perturbations do not alter the distribution of data in the image and are undetectable by state-of-the-art-methods. As such this workreinforces the connection between explainable AI and adversarial perturbations. We show the merits of our approach by evaluating onstandard explainablity benchmarks and by defeating recenttests for detecting adversarial perturbations, substantially decreasing the effectiveness of detecting adversarial perturbations.

* 16 pages, 8 figures

Via

Access Paper or Ask Questions

Learning from Discovering: An unsupervised approach to Geographical Knowledge Discovery using street level and street network images

Jun 18, 2019

Stephen Law, Mateo Neira

Figure 1 for Learning from Discovering: An unsupervised approach to Geographical Knowledge Discovery using street level and street network images

Figure 2 for Learning from Discovering: An unsupervised approach to Geographical Knowledge Discovery using street level and street network images

Figure 3 for Learning from Discovering: An unsupervised approach to Geographical Knowledge Discovery using street level and street network images

Figure 4 for Learning from Discovering: An unsupervised approach to Geographical Knowledge Discovery using street level and street network images

Abstract:Recent researches have shown the increasing use of machine learning methods in geography and urban analytics, primarily to extract features and patterns from spatial and temporal data. Research, integrating geographical processes in machine learning models and, leveraging on geographical information to better interpret these methods had been sparse. This research contributes to the ladder, where we show how latent variables learned from unsupervised learning methods can be used for geographic knowledge discovery. In particular, we propose a simple and novel approach called Convolutional-PCA (ConvPCA) which are applied on both street level and street network images in finding a set of uncorrelated visual latent responses. The approach allows for meaningful explanations using a combination of, geographical and generative visualizations to explore the latent space, and to show how the learned embeddings can be used to predict urban characteristics such as street-level enclosures and street network density.

* 7 pages, 13 figures

Via

Access Paper or Ask Questions