Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marco Spruit

A Review of Challenges in Speech-based Conversational AI for Elderly Care

Dec 10, 2024

Willemijn Klaassen, Bram van Dijk, Marco Spruit

Abstract:Artificially intelligent systems optimized for speech conversation are appearing at a fast pace. Such models are interesting from a healthcare perspective, as these voice-controlled assistants may support the elderly and enable remote health monitoring. The bottleneck for efficacy, however, is how well these devices work in practice and how the elderly experience them, but research on this topic is scant. We review elderly use of voice-controlled AI and highlight various user- and technology-centered issues, that need to be considered before effective speech-controlled AI for elderly care can be realized.

* Accepted for publication at Medical Informatics Europe 2025 conference, Glasgow. 5 pages, 1 figure, 1 table

Via

Access Paper or Ask Questions

UU-Tax at SemEval-2022 Task 3: Improving the generalizability of language models for taxonomy classification through data augmentation

Oct 07, 2022

Injy Sarhan, Pablo Mosteiro, Marco Spruit

Figure 1 for UU-Tax at SemEval-2022 Task 3: Improving the generalizability of language models for taxonomy classification through data augmentation

Figure 2 for UU-Tax at SemEval-2022 Task 3: Improving the generalizability of language models for taxonomy classification through data augmentation

Figure 3 for UU-Tax at SemEval-2022 Task 3: Improving the generalizability of language models for taxonomy classification through data augmentation

Figure 4 for UU-Tax at SemEval-2022 Task 3: Improving the generalizability of language models for taxonomy classification through data augmentation

Abstract:This paper presents our strategy to address the SemEval-2022 Task 3 PreTENS: Presupposed Taxonomies Evaluating Neural Network Semantics. The goal of the task is to identify if a sentence is deemed acceptable or not, depending on the taxonomic relationship that holds between a noun pair contained in the sentence. For sub-task 1 -- binary classification -- we propose an effective way to enhance the robustness and the generalizability of language models for better classification on this downstream task. We design a two-stage fine-tuning procedure on the ELECTRA language model using data augmentation techniques. Rigorous experiments are carried out using multi-task learning and data-enriched fine-tuning. Experimental results demonstrate that our proposed model, UU-Tax, is indeed able to generalize well for our downstream task. For sub-task 2 -- regression -- we propose a simple classifier that trains on features obtained from Universal Sentence Encoder (USE). In addition to describing the submitted systems, we discuss other experiments that employ pre-trained language models and data augmentation techniques. For both sub-tasks, we perform error analysis to further understand the behaviour of the proposed models. We achieved a global F1_Binary score of 91.25% in sub-task 1 and a rho score of 0.221 in sub-task 2.

* In Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), pages 271-281, Seattle, United States. Association for Computational Linguistics

Via

Access Paper or Ask Questions

Bias Discovery in Machine Learning Models for Mental Health

May 24, 2022

Pablo Mosteiro, Jesse Kuiper, Judith Masthoff, Floortje Scheepers, Marco Spruit

Figure 1 for Bias Discovery in Machine Learning Models for Mental Health

Figure 2 for Bias Discovery in Machine Learning Models for Mental Health

Figure 3 for Bias Discovery in Machine Learning Models for Mental Health

Figure 4 for Bias Discovery in Machine Learning Models for Mental Health

Abstract:Fairness and bias are crucial concepts in artificial intelligence, yet they are relatively ignored in machine learning applications in clinical psychiatry. We computed fairness metrics and present bias mitigation strategies using a model trained on clinical mental health data. We collected structured data related to the admission, diagnosis, and treatment of patients in the psychiatry department of the University Medical Center Utrecht. We trained a machine learning model to predict future administrations of benzodiazepines on the basis of past data. We found that gender plays an unexpected role in the predictions-this constitutes bias. Using the AI Fairness 360 package, we implemented reweighing and discrimination-aware regularization as bias mitigation strategies, and we explored their implications for model performance. This is the first application of bias exploration and mitigation in a machine learning model trained on real clinical psychiatry data.

Via

Access Paper or Ask Questions

Federated learning for violence incident prediction in a simulated cross-institutional psychiatric setting

May 17, 2022

Thomas Borger, Pablo Mosteiro, Heysem Kaya, Emil Rijcken, Albert Ali Salah, Floortje Scheepers, Marco Spruit

Figure 1 for Federated learning for violence incident prediction in a simulated cross-institutional psychiatric setting

Figure 2 for Federated learning for violence incident prediction in a simulated cross-institutional psychiatric setting

Figure 3 for Federated learning for violence incident prediction in a simulated cross-institutional psychiatric setting

Figure 4 for Federated learning for violence incident prediction in a simulated cross-institutional psychiatric setting

Abstract:Inpatient violence is a common and severe problem within psychiatry. Knowing who might become violent can influence staffing levels and mitigate severity. Predictive machine learning models can assess each patient's likelihood of becoming violent based on clinical notes. Yet, while machine learning models benefit from having more data, data availability is limited as hospitals typically do not share their data for privacy preservation. Federated Learning (FL) can overcome the problem of data limitation by training models in a decentralised manner, without disclosing data between collaborators. However, although several FL approaches exist, none of these train Natural Language Processing models on clinical notes. In this work, we investigate the application of Federated Learning to clinical Natural Language Processing, applied to the task of Violence Risk Assessment by simulating a cross-institutional psychiatric setting. We train and compare four models: two local models, a federated model and a data-centralised model. Our results indicate that the federated model outperforms the local models and has similar performance as the data-centralised model. These findings suggest that Federated Learning can be used successfully in a cross-institutional setting and is a step towards new applications of Federated Learning based on clinical notes

* Expert Systems with Applications Volume 199, 1 August 2022, 116720

Via

Access Paper or Ask Questions

Making sense of violence risk predictions using clinical notes

Apr 29, 2022

Pablo Mosteiro, Emil Rijcken, Kalliopi Zervanou, Uzay Kaymak, Floortje Scheepers, Marco Spruit

Figure 1 for Making sense of violence risk predictions using clinical notes

Figure 2 for Making sense of violence risk predictions using clinical notes

Figure 3 for Making sense of violence risk predictions using clinical notes

Figure 4 for Making sense of violence risk predictions using clinical notes

Abstract:Violence risk assessment in psychiatric institutions enables interventions to avoid violence incidents. Clinical notes written by practitioners and available in electronic health records (EHR) are valuable resources that are seldom used to their full potential. Previous studies have attempted to assess violence risk in psychiatric patients using such notes, with acceptable performance. However, they do not explain why classification works and how it can be improved. We explore two methods to better understand the quality of a classifier in the context of clinical note analysis: random forests using topic models, and choice of evaluation metric. These methods allow us to understand both our data and our methodology more profoundly, setting up the groundwork to work on improved models that build upon this understanding. This is particularly important when it comes to the generalizability of evaluated classifiers to new data, a trustworthiness problem that is of great interest due to the increased availability of new data in electronic format.

* In: Huang, Z., Siuly, S., Wang, H., Zhou, R., Zhang, Y. (eds) Health Information Science. HIS 2020. Lecture Notes in Computer Science(), vol 12435. Springer, Cham
* arXiv admin note: substantial text overlap with arXiv:2204.13535

Via

Access Paper or Ask Questions

Machine Learning for Violence Risk Assessment Using Dutch Clinical Notes

Apr 28, 2022

Pablo Mosteiro, Emil Rijcken, Kalliopi Zervanou, Uzay Kaymak, Floortje Scheepers, Marco Spruit

Figure 1 for Machine Learning for Violence Risk Assessment Using Dutch Clinical Notes

Figure 2 for Machine Learning for Violence Risk Assessment Using Dutch Clinical Notes

Figure 3 for Machine Learning for Violence Risk Assessment Using Dutch Clinical Notes

Figure 4 for Machine Learning for Violence Risk Assessment Using Dutch Clinical Notes

Abstract:Violence risk assessment in psychiatric institutions enables interventions to avoid violence incidents. Clinical notes written by practitioners and available in electronic health records are valuable resources capturing unique information, but are seldom used to their full potential. We explore conventional and deep machine learning methods to assess violence risk in psychiatric patients using practitioner notes. The performance of our best models is comparable to the currently used questionnaire-based method, with an area under the Receiver Operating Characteristic curve of approximately 0.8. We find that the deep-learning model BERTje performs worse than conventional machine learning methods. We also evaluate our data and our classifiers to understand the performance of our models better. This is particularly important for the applicability of evaluated classifiers to new data, and is also of great interest to practitioners, due to the increased availability of new data in electronic format.

* Journal of Artificial Intelligence for Medical Sciences Volume 2, Issue 1-2, June 2021, Pages 44 - 54

Via

Access Paper or Ask Questions

Dutch Named Entity Recognition and De-identification Methods for the Human Resource Domain

Jun 04, 2021

Chaïm van Toledo, Friso van Dijk, Marco Spruit

Figure 1 for Dutch Named Entity Recognition and De-identification Methods for the Human Resource Domain

Figure 2 for Dutch Named Entity Recognition and De-identification Methods for the Human Resource Domain

Figure 3 for Dutch Named Entity Recognition and De-identification Methods for the Human Resource Domain

Figure 4 for Dutch Named Entity Recognition and De-identification Methods for the Human Resource Domain

Abstract:The human resource (HR) domain contains various types of privacy-sensitive textual data, such as e-mail correspondence and performance appraisal. Doing research on these documents brings several challenges, one of them anonymisation. In this paper, we evaluate the current Dutch text de-identification methods for the HR domain in four steps. First, by updating one of these methods with the latest named entity recognition (NER) models. The result is that the NER model based on the CoNLL 2002 corpus in combination with the BERTje transformer give the best combination for suppressing persons (recall 0.94) and locations (recall 0.82). For suppressing gender, DEDUCE is performing best (recall 0.53). Second NER evaluation is based on both strict de-identification of entities (a person must be suppressed as a person) and third evaluation on a loose sense of de-identification (no matter what how a person is suppressed, as long it is suppressed). In the fourth and last step a new kind of NER dataset is tested for recognising job titles in texts.

* International Journal on Natural Language Computing (IJNLC) Vol.9, No.6, December 2020

Via

Access Paper or Ask Questions