Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yijun Lin

Hyper-Local Deformable Transformers for Text Spotting on Historical Maps

Jun 17, 2025

Yijun Lin, Yao-Yi Chiang

Abstract:Text on historical maps contains valuable information providing georeferenced historical, political, and cultural contexts. However, text extraction from historical maps is challenging due to the lack of (1) effective methods and (2) training data. Previous approaches use ad-hoc steps tailored to only specific map styles. Recent machine learning-based text spotters (e.g., for scene images) have the potential to solve these challenges because of their flexibility in supporting various types of text instances. However, these methods remain challenges in extracting precise image features for predicting every sub-component (boundary points and characters) in a text instance. This is critical because map text can be lengthy and highly rotated with complex backgrounds, posing difficulties in detecting relevant image features from a rough text region. This paper proposes PALETTE, an end-to-end text spotter for scanned historical maps of a wide variety. PALETTE introduces a novel hyper-local sampling module to explicitly learn localized image features around the target boundary points and characters of a text instance for detection and recognition. PALETTE also enables hyper-local positional embeddings to learn spatial interactions between boundary points and characters within and across text instances. In addition, this paper presents a novel approach to automatically generate synthetic map images, SynthMap+, for training text spotters for historical maps. The experiment shows that PALETTE with SynthMap+ outperforms SOTA text spotters on two new benchmark datasets of historical maps, particularly for long and angled text. We have deployed PALETTE with SynthMap+ to process over 60,000 maps in the David Rumsey Historical Map collection and generated over 100 million text labels to support map searching. The project is released at https://github.com/kartta-foundation/mapkurator-palette-doc.

* Published in KDD2024

Via

Access Paper or Ask Questions

The mapKurator System: A Complete Pipeline for Extracting and Linking Text from Historical Maps

Jul 03, 2023

Jina Kim, Zekun Li, Yijun Lin, Min Namgung, Leeje Jang, Yao-Yi Chiang

Abstract:Scanned historical maps in libraries and archives are valuable repositories of geographic data that often do not exist elsewhere. Despite the potential of machine learning tools like the Google Vision APIs for automatically transcribing text from these maps into machine-readable formats, they do not work well with large-sized images (e.g., high-resolution scanned documents), cannot infer the relation between the recognized text and other datasets, and are challenging to integrate with post-processing tools. This paper introduces the mapKurator system, an end-to-end system integrating machine learning models with a comprehensive data processing pipeline. mapKurator empowers automated extraction, post-processing, and linkage of text labels from large numbers of large-dimension historical map scans. The output data, comprising bounding polygons and recognized text, is in the standard GeoJSON format, making it easily modifiable within Geographic Information Systems (GIS). The proposed system allows users to quickly generate valuable data from large numbers of historical maps for in-depth analysis of the map content and, in turn, encourages map findability, accessibility, interoperability, and reusability (FAIR principles). We deployed the mapKurator system and enabled the processing of over 60,000 maps and over 100 million text/place names in the David Rumsey Historical Map collection. We also demonstrated a seamless integration of mapKurator with a collaborative web platform to enable accessing automated approaches for extracting and linking text labels from historical map scans and collective work to improve the results.

* 4 pages, 4 figures

Via

Access Paper or Ask Questions

Building Autocorrelation-Aware Representations for Fine-Scale Spatiotemporal Prediction

Dec 10, 2021

Yijun Lin, Yao-Yi Chiang, Meredith Franklin, Sandrah P. Eckel, José Luis Ambite

Figure 1 for Building Autocorrelation-Aware Representations for Fine-Scale Spatiotemporal Prediction

Figure 2 for Building Autocorrelation-Aware Representations for Fine-Scale Spatiotemporal Prediction

Figure 3 for Building Autocorrelation-Aware Representations for Fine-Scale Spatiotemporal Prediction

Figure 4 for Building Autocorrelation-Aware Representations for Fine-Scale Spatiotemporal Prediction

Abstract:Many scientific prediction problems have spatiotemporal data- and modeling-related challenges in handling complex variations in space and time using only sparse and unevenly distributed observations. This paper presents a novel deep learning architecture, Deep learning predictions for LocATion-dependent Time-sEries data (DeepLATTE), that explicitly incorporates theories of spatial statistics into neural networks to address these challenges. In addition to a feature selection module and a spatiotemporal learning module, DeepLATTE contains an autocorrelation-guided semi-supervised learning strategy to enforce both local autocorrelation patterns and global autocorrelation trends of the predictions in the learned spatiotemporal embedding space to be consistent with the observed data, overcoming the limitation of sparse and unevenly distributed observations. During the training process, both supervised and semi-supervised losses guide the updates of the entire network to: 1) prevent overfitting, 2) refine feature selection, 3) learn useful spatiotemporal representations, and 4) improve overall prediction. We conduct a demonstration of DeepLATTE using publicly available data for an important public health topic, air quality prediction, in a well-studied, complex physical environment - Los Angeles. The experiment demonstrates that the proposed approach provides accurate fine-spatial-scale air quality predictions and reveals the critical environmental factors affecting the results.

* Published in ICDM2020

Via

Access Paper or Ask Questions

A Semi-Supervised Approach for Abnormal Event Prediction on Large Operational Network Time-Series Data

Oct 14, 2021

Yijun Lin, Yao-Yi Chiang

Figure 1 for A Semi-Supervised Approach for Abnormal Event Prediction on Large Operational Network Time-Series Data

Figure 2 for A Semi-Supervised Approach for Abnormal Event Prediction on Large Operational Network Time-Series Data

Figure 3 for A Semi-Supervised Approach for Abnormal Event Prediction on Large Operational Network Time-Series Data

Figure 4 for A Semi-Supervised Approach for Abnormal Event Prediction on Large Operational Network Time-Series Data

Abstract:Large network logs, recording multivariate time series generated from heterogeneous devices and sensors in a network, can often reveal important information about abnormal activities, such as network intrusions and device malfunctions. Existing machine learning methods for anomaly detection on multivariate time series typically assume that 1) normal sequences would have consistent behavior for training unsupervised models, or 2) require a large set of labeled normal and abnormal sequences for supervised models. However, in practice, normal network activities can demonstrate significantly varying sequence patterns (e.g., before and after rerouting partial network traffic). Also, the recorded abnormal events can be sparse. This paper presents a novel semi-supervised method that efficiently captures dependencies between network time series and across time points to generate meaningful representations of network activities for predicting abnormal events. The method can use the limited labeled data to explicitly learn separable embedding space for normal and abnormal samples and effectively leverage unlabeled data to handle training data scarcity. The experiments demonstrate that our approach significantly outperformed state-of-the-art approaches for event detection on a large real-world network log.

* 9 pages + 3 supplementary pages; submitted to SDM2022

Via

Access Paper or Ask Questions

A Comparison and Strategy of Semantic Segmentation on Remote Sensing Images

May 24, 2019

Junxing Hu, Ling Li, Yijun Lin, Fengge Wu, Junsuo Zhao

Figure 1 for A Comparison and Strategy of Semantic Segmentation on Remote Sensing Images

Figure 2 for A Comparison and Strategy of Semantic Segmentation on Remote Sensing Images

Figure 3 for A Comparison and Strategy of Semantic Segmentation on Remote Sensing Images

Figure 4 for A Comparison and Strategy of Semantic Segmentation on Remote Sensing Images

Abstract:In recent years, with the development of aerospace technology, we use more and more images captured by satellites to obtain information. But a large number of useless raw images, limited data storage resource and poor transmission capability on satellites hinder our use of valuable images. Therefore, it is necessary to deploy an on-orbit semantic segmentation model to filter out useless images before data transmission. In this paper, we present a detailed comparison on the recent deep learning models. Considering the computing environment of satellites, we compare methods from accuracy, parameters and resource consumption on the same public dataset. And we also analyze the relation between them. Based on experimental results, we further propose a viable on-orbit semantic segmentation strategy. It will be deployed on the TianZhi-2 satellite which supports deep learning methods and will be lunched soon.

* 8 pages, 3 figures, ICNC-FSKD 2019

Via

Access Paper or Ask Questions