Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Miroslaw Staron

Improving Image Data Leakage Detection in Automotive Software

Oct 29, 2024

Md Abu Ahammed Babu, Sushant Kumar Pandey, Darko Durisic, Ashok Chaitanya Koppisetty, Miroslaw Staron

Figure 1 for Improving Image Data Leakage Detection in Automotive Software

Figure 2 for Improving Image Data Leakage Detection in Automotive Software

Figure 3 for Improving Image Data Leakage Detection in Automotive Software

Figure 4 for Improving Image Data Leakage Detection in Automotive Software

Abstract:Data leakage is a very common problem that is often overlooked during splitting data into train and test sets before training any ML/DL model. The model performance gets artificially inflated with the presence of data leakage during the evaluation phase which often leads the model to erroneous prediction on real-time deployment. However, detecting the presence of such leakage is challenging, particularly in the object detection context of perception systems where the model needs to be supplied with image data for training. In this study, we conduct a computational experiment on the Cirrus dataset from our industrial partner Volvo Cars to develop a method for detecting data leakage. We then evaluate the method on another public dataset, Kitti, which is a popular and widely accepted benchmark dataset in the automotive domain. The results show that thanks to our proposed method we are able to detect data leakage in the Kitti dataset, which was previously unknown.

Via

Access Paper or Ask Questions

A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development

Oct 04, 2024

Jesper Knapp, Klas Moberg, Yuchuan Jin, Simin Sun, Miroslaw Staron

Abstract:Autonomous driving software generates enormous amounts of data every second, which software development organizations save for future analysis and testing in the form of logs. However, given the vast size of this data, locating specific scenarios within a collection of vehicle logs can be challenging. Writing the correct SQL queries to find these scenarios requires engineers to have a strong background in SQL and the specific databases in question, further complicating the search process. This paper presents and evaluates a pipeline that allows searching for specific scenarios in log collections using natural language descriptions instead of SQL. The generated descriptions were evaluated by engineers working with vehicle logs at the Zenseact on a scale from 1 to 5. Our approach achieved a mean score of 3.3, demonstrating the potential of using a multi-model architecture to improve the software development workflow. We also present an interface that can visualize the query process and visualize the results.

Via

Access Paper or Ask Questions

Trusting Machine Learning Results from Medical Procedures in the Operating Room

Jan 04, 2022

Ali El-Merhi, Helena Odenstedt Hergés, Linda Block, Mikael Elam, Richard Vithal, Jaquette Liljencrantz, Miroslaw Staron

Figure 1 for Trusting Machine Learning Results from Medical Procedures in the Operating Room

Figure 2 for Trusting Machine Learning Results from Medical Procedures in the Operating Room

Figure 3 for Trusting Machine Learning Results from Medical Procedures in the Operating Room

Figure 4 for Trusting Machine Learning Results from Medical Procedures in the Operating Room

Abstract:Machine learning can be used to analyse physiological data for several purposes. Detection of cerebral ischemia is an achievement that would have high impact on patient care. We attempted to study if collection of continous physiological data from non-invasive monitors, and analysis with machine learning could detect cerebral ischemia in tho different setting, during surgery for carotid endarterectomy and during endovascular thrombectomy in acute stroke. We compare the results from the two different group and one patient from each group in details. While results from CEA-patients are consistent, those from thrombectomy patients are not and frequently contain extreme values such as 1.0 in accuracy. We conlcude that this is a result of short duration of the procedure and abundance of data with bad quality resulting in small data sets. These results can therefore not be trusted.

* AAAI workshop on Trustworthy AI for Healthcare 2022

Via

Access Paper or Ask Questions

Towards Trustworthy Cross-patient Model Development

Dec 20, 2021

Ali El-Merhi, Helena Odenstedt Hergés, Linda Block, Mikael Elam, Richard Vithal, Jaquette Liljencrantz, Miroslaw Staron

Figure 1 for Towards Trustworthy Cross-patient Model Development

Figure 2 for Towards Trustworthy Cross-patient Model Development

Figure 3 for Towards Trustworthy Cross-patient Model Development

Figure 4 for Towards Trustworthy Cross-patient Model Development

Abstract:Machine learning is used in medicine to support physicians in examination, diagnosis, and predicting outcomes. One of the most dynamic area is the usage of patient generated health data from intensive care units. The goal of this paper is to demonstrate how we advance cross-patient ML model development by combining the patient's demographics data with their physiological data. We used a population of patients undergoing Carotid Enderarterectomy (CEA), where we studied differences in model performance and explainability when trained for all patients and one patient at a time. The results show that patients' demographics has a large impact on the performance and explainability and thus trustworthiness. We conclude that we can increase trust in ML models in a cross-patient context, by careful selection of models and patients based on their demographics and the surgical procedure.

* AAAI workshop on Trustworthy AI for Healthcare, 2022

Via

Access Paper or Ask Questions