Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Petra Heck

What About the Data? A Mapping Study on Data Engineering for AI Systems

Feb 07, 2024

Petra Heck

Abstract:AI systems cannot exist without data. Now that AI models (data science and AI) have matured and are readily available to apply in practice, most organizations struggle with the data infrastructure to do so. There is a growing need for data engineers that know how to prepare data for AI systems or that can setup enterprise-wide data architectures for analytical projects. But until now, the data engineering part of AI engineering has not been getting much attention, in favor of discussing the modeling part. In this paper we aim to change this by perform a mapping study on data engineering for AI systems, i.e., AI data engineering. We found 25 relevant papers between January 2019 and June 2023, explaining AI data engineering activities. We identify which life cycle phases are covered, which technical solutions or architectures are proposed and which lessons learned are presented. We end by an overall discussion of the papers with implications for practitioners and researchers. This paper creates an overview of the body of knowledge on data engineering for AI. This overview is useful for practitioners to identify solutions and best practices as well as for researchers to identify gaps.

* Preprint, accepted for CAIN24

Via

Access Paper or Ask Questions

Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform

Mar 23, 2023

Petra Heck, Gerard Schouten

Figure 1 for Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform

Figure 2 for Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform

Figure 3 for Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform

Figure 4 for Defining Quality Requirements for a Trustworthy AI Wildflower Monitoring Platform

Abstract:For an AI solution to evolve from a trained machine learning model into a production-ready AI system, many more things need to be considered than just the performance of the machine learning model. A production-ready AI system needs to be trustworthy, i.e. of high quality. But how to determine this in practice? For traditional software, ISO25000 and its predecessors have since long time been used to define and measure quality characteristics. Recently, quality models for AI systems, based on ISO25000, have been introduced. This paper applies one such quality model to a real-life case study: a deep learning platform for monitoring wildflowers. The paper presents three realistic scenarios sketching what it means to respectively use, extend and incrementally improve the deep learning platform for wildflower identification and counting. Next, it is shown how the quality model can be used as a structured dictionary to define quality requirements for data, model and software. Future work remains to extend the quality model with metrics, tools and best practices to aid AI engineering practitioners in implementing trustworthy AI systems.

* Preprint - Paper accepted for CAIN23 - 2nd international conference on AI Engineering

Via

Access Paper or Ask Questions

Lessons Learned from Educating AI Engineers

Mar 19, 2021

Petra Heck, Gerard Schouten

Figure 1 for Lessons Learned from Educating AI Engineers

Figure 2 for Lessons Learned from Educating AI Engineers

Figure 3 for Lessons Learned from Educating AI Engineers

Figure 4 for Lessons Learned from Educating AI Engineers

Abstract:Over the past three years we have built a practice-oriented, bachelor level, educational programme for software engineers to specialize as AI engineers. The experience with this programme and the practical assignments our students execute in industry has given us valuable insights on the profession of AI engineer. In this paper we discuss our programme and the lessons learned for industry and research.

* Acccepted for the 1st International Workshop on AI Engineering (WAIN21)

Via

Access Paper or Ask Questions

Systematic Mapping Study on the Machine Learning Lifecycle

Mar 11, 2021

Yuanhao Xie, Luís Cruz, Petra Heck, Jan S. Rellermeyer

Figure 1 for Systematic Mapping Study on the Machine Learning Lifecycle

Figure 2 for Systematic Mapping Study on the Machine Learning Lifecycle

Figure 3 for Systematic Mapping Study on the Machine Learning Lifecycle

Figure 4 for Systematic Mapping Study on the Machine Learning Lifecycle

Abstract:The development of artificial intelligence (AI) has made various industries eager to explore the benefits of AI. There is an increasing amount of research surrounding AI, most of which is centred on the development of new AI algorithms and techniques. However, the advent of AI is bringing an increasing set of practical problems related to AI model lifecycle management that need to be investigated. We address this gap by conducting a systematic mapping study on the lifecycle of AI model. Through quantitative research, we provide an overview of the field, identify research opportunities, and provide suggestions for future research. Our study yields 405 publications published from 2005 to 2020, mapped in 5 different main research topics, and 31 sub-topics. We observe that only a minority of publications focus on data management and model production problems, and that more studies should address the AI lifecycle from a holistic perspective.

* Accepted at WAIN21: 1st Workshop on AI Engineering - Software Engineering for AI

Via

Access Paper or Ask Questions

Turning Software Engineers into AI Engineers

Nov 03, 2020

Petra Heck, Gerard Schouten

Figure 1 for Turning Software Engineers into AI Engineers

Figure 2 for Turning Software Engineers into AI Engineers

Figure 3 for Turning Software Engineers into AI Engineers

Figure 4 for Turning Software Engineers into AI Engineers

Abstract:In industry as well as education as well as academics we see a growing need for knowledge on how to apply machine learning in software applications. With the educational programme ICT & AI at Fontys UAS we had to find an answer to the question: "How should we educate software engineers to become AI engineers?" This paper describes our educational programme, the open source tools we use, and the literature it is based on. After three years of experience, we present our lessons learned for both educational institutions and software engineers in practice.

* Under submission for ICSE-SEET 2021

Via

Access Paper or Ask Questions