Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lewis Lehe

TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models

Dec 07, 2024

Saipraneeth Devunuri, Lewis Lehe

Figure 1 for TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models

Figure 2 for TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models

Figure 3 for TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models

Figure 4 for TransitGPT: A Generative AI-based framework for interacting with GTFS data using Large Language Models

Abstract:This paper introduces a framework that leverages Large Language Models (LLMs) to answer natural language queries about General Transit Feed Specification (GTFS) data. The framework is implemented in a chatbot called TransitGPT with open-source code. TransitGPT works by guiding LLMs to generate Python code that extracts and manipulates GTFS data relevant to a query, which is then executed on a server where the GTFS feed is stored. It can accomplish a wide range of tasks, including data retrieval, calculations, and interactive visualizations, without requiring users to have extensive knowledge of GTFS or programming. The LLMs that produce the code are guided entirely by prompts, without fine-tuning or access to the actual GTFS feeds. We evaluate TransitGPT using GPT-4o and Claude-3.5-Sonnet LLMs on a benchmark dataset of 100 tasks, to demonstrate its effectiveness and versatility. The results show that TransitGPT can significantly enhance the accessibility and usability of transit data.

Via

Access Paper or Ask Questions

ChatGPT for GTFS: From Words to Information

Aug 04, 2023

Saipraneeth Devunuri, Shirin Qiam, Lewis Lehe

Abstract:The General Transit Feed Specification (GTFS) standard for publishing transit data is ubiquitous. GTFS being tabular data, with information spread across different files, necessitates specialized tools or packages to retrieve information. Concurrently, the use of Large Language Models for text and information retrieval is growing. The idea of this research is to see if the current widely adopted LLMs (ChatGPT) are able to retrieve information from GTFS using natural language instructions. We first test whether ChatGPT (GPT-3.5) understands the GTFS specification. GPT-3.5 answers 77% of our multiple-choice questions (MCQ) correctly. Next, we task the LLM with information extractions from a filtered GTFS feed with 4 routes. For information retrieval, we compare zero-shot and program synthesis. Program synthesis works better, achieving ~90% accuracy on simple questions and ~40% accuracy on complex questions.

* 18 pages, 7 figures, 1 table, Transportation Research Board

Via

Access Paper or Ask Questions