Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michael Staniek

Early Prediction of Causes (not Effects) in Healthcare by Long-Term Clinical Time Series Forecasting

Aug 07, 2024

Michael Staniek, Marius Fracarolli, Michael Hagmann, Stefan Riezler

Abstract:Machine learning for early syndrome diagnosis aims to solve the intricate task of predicting a ground truth label that most often is the outcome (effect) of a medical consensus definition applied to observed clinical measurements (causes), given clinical measurements observed several hours before. Instead of focusing on the prediction of the future effect, we propose to directly predict the causes via time series forecasting (TSF) of clinical variables and determine the effect by applying the gold standard consensus definition to the forecasted values. This method has the invaluable advantage of being straightforwardly interpretable to clinical practitioners, and because model training does not rely on a particular label anymore, the forecasted data can be used to predict any consensus-based label. We exemplify our method by means of long-term TSF with Transformer models, with a focus on accurate prediction of sparse clinical variables involved in the SOFA-based Sepsis-3 definition and the new Simplified Acute Physiology Score (SAPS-II) definition. Our experiments are conducted on two datasets and show that contrary to recent proposals which advocate set function encoders for time series and direct multi-step decoders, best results are achieved by a combination of standard dense encoders with iterative multi-step decoders. The key for success of iterative multi-step decoding can be attributed to its ability to capture cross-variate dependencies and to a student forcing training strategy that teaches the model to rely on its own previous time step predictions for the next time step prediction.

Via

Access Paper or Ask Questions

Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap

Aug 30, 2023

Michael Staniek, Raphael Schumann, Maike Züfle, Stefan Riezler

Abstract:We present Text-to-OverpassQL, a task designed to facilitate a natural language interface for querying geodata from OpenStreetMap (OSM). The Overpass Query Language (OverpassQL) allows users to formulate complex database queries and is widely adopted in the OSM ecosystem. Generating Overpass queries from natural language input serves multiple use-cases. It enables novice users to utilize OverpassQL without prior knowledge, assists experienced users with crafting advanced queries, and enables tool-augmented large language models to access information stored in the OSM database. In order to assess the performance of current sequence generation models on this task, we propose OverpassNL, a dataset of 8,352 queries with corresponding natural language inputs. We further introduce task specific evaluation metrics and ground the evaluation of the Text-to-OverpassQL task by executing the queries against the OSM database. We establish strong baselines by finetuning sequence-to-sequence models and adapting large language models with in-context examples. The detailed evaluation reveals strengths and weaknesses of the considered learning strategies, laying the foundations for further research into the Text-to-OverpassQL task.

Via

Access Paper or Ask Questions

Error-Aware Interactive Semantic Parsing of OpenStreetMap

Jun 22, 2021

Michael Staniek, Stefan Riezler

Figure 1 for Error-Aware Interactive Semantic Parsing of OpenStreetMap

Figure 2 for Error-Aware Interactive Semantic Parsing of OpenStreetMap

Figure 3 for Error-Aware Interactive Semantic Parsing of OpenStreetMap

Figure 4 for Error-Aware Interactive Semantic Parsing of OpenStreetMap

Abstract:In semantic parsing of geographical queries against real-world databases such as OpenStreetMap (OSM), unique correct answers do not necessarily exist. Instead, the truth might be lying in the eye of the user, who needs to enter an interactive setup where ambiguities can be resolved and parsing mistakes can be corrected. Our work presents an approach to interactive semantic parsing where an explicit error detection is performed, and a clarification question is generated that pinpoints the suspected source of ambiguity or error and communicates it to the human user. Our experimental results show that a combination of entropy-based uncertainty detection and beam search, together with multi-source training on clarification question, initial parse, and user answer, results in improvements of 1.2% F1 score on a parser that already performs at 90.26% on the NLMaps dataset for OSM semantic parsing.

* Accepted at SpLU-RoboNLP 2021

Via

Access Paper or Ask Questions

Assessing the Difficulty of Classifying ConceptNet Relations in a Multi-Label Classification Setting

May 14, 2019

Maria Becker, Michael Staniek, Vivi Nastase, Anette Frank

Figure 1 for Assessing the Difficulty of Classifying ConceptNet Relations in a Multi-Label Classification Setting

Figure 2 for Assessing the Difficulty of Classifying ConceptNet Relations in a Multi-Label Classification Setting

Figure 3 for Assessing the Difficulty of Classifying ConceptNet Relations in a Multi-Label Classification Setting

Figure 4 for Assessing the Difficulty of Classifying ConceptNet Relations in a Multi-Label Classification Setting

Abstract:Commonsense knowledge relations are crucial for advanced NLU tasks. We examine the learnability of such relations as represented in CONCEPTNET, taking into account their specific properties, which can make relation classification difficult: a given concept pair can be linked by multiple relation types, and relations can have multi-word arguments of diverse semantic types. We explore a neural open world multi-label classification approach that focuses on the evaluation of classification accuracy for individual relations. Based on an in-depth study of the specific properties of the CONCEPTNET resource, we investigate the impact of different relation representations and model variations. Our analysis reveals that the complexity of argument types and relation ambiguity are the most important challenges to address. We design a customized evaluation method to address the incompleteness of the resource that can be expanded in future work.

* RELATIONS - Workshop on meaning relations between phrases and sentences (co-located with IWCS). May 2019, Gothenburg, Sweden

Via

Access Paper or Ask Questions