Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andreas Schmidt

Keep your Distance: Determining Sampling and Distance Thresholds in Machine Learning Monitoring

Jul 11, 2022

Al-Harith Farhad, Ioannis Sorokos, Andreas Schmidt, Mohammed Naveed Akram, Koorosh Aslansefat, Daniel Schneider

Figure 1 for Keep your Distance: Determining Sampling and Distance Thresholds in Machine Learning Monitoring

Figure 2 for Keep your Distance: Determining Sampling and Distance Thresholds in Machine Learning Monitoring

Figure 3 for Keep your Distance: Determining Sampling and Distance Thresholds in Machine Learning Monitoring

Figure 4 for Keep your Distance: Determining Sampling and Distance Thresholds in Machine Learning Monitoring

Abstract:Machine Learning~(ML) has provided promising results in recent years across different applications and domains. However, in many cases, qualities such as reliability or even safety need to be ensured. To this end, one important aspect is to determine whether or not ML components are deployed in situations that are appropriate for their application scope. For components whose environments are open and variable, for instance those found in autonomous vehicles, it is therefore important to monitor their operational situation to determine its distance from the ML components' trained scope. If that distance is deemed too great, the application may choose to consider the ML component outcome unreliable and switch to alternatives, e.g. using human operator input instead. SafeML is a model-agnostic approach for performing such monitoring, using distance measures based on statistical testing of the training and operational datasets. Limitations in setting SafeML up properly include the lack of a systematic approach for determining, for a given application, how many operational samples are needed to yield reliable distance information as well as to determine an appropriate distance threshold. In this work, we address these limitations by providing a practical approach and demonstrate its use in a well known traffic sign recognition problem, and on an example using the CARLA open-source automotive simulator.

Via

Access Paper or Ask Questions

It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings

Mar 08, 2022

Benjamin Hättasch, Michael Truong-Ngoc, Andreas Schmidt, Carsten Binnig

Figure 1 for It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings

Figure 2 for It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings

Figure 3 for It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings

Figure 4 for It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings

Abstract:Since data is often stored in different sources, it needs to be integrated to gather a global view that is required in order to create value and derive knowledge from it. A critical step in data integration is schema matching which aims to find semantic correspondences between elements of two schemata. In order to reduce the manual effort involved in schema matching, many solutions for the automatic determination of schema correspondences have already been developed. In this paper, we propose a novel end-to-end approach for schema matching based on neural embeddings. The main idea is to use a two-step approach consisting of a table matching step followed by an attribute matching step. In both steps we use embeddings on different levels either representing the whole table or single attributes. Our results show that our approach is able to determine correspondences in a robust and reliable way and compared to traditional schema matching approaches can find non-trivial correspondences.

* Accepted to the 2nd International Workshop on Applied AI for Database Systems and Applications (AIDB'20), August 31, 2020, Tokyo, Japan

Via

Access Paper or Ask Questions