Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marcelo Queiroz

Discriminant audio properties in deep learning based respiratory insufficiency detection in Brazilian Portuguese

May 27, 2024

Marcelo Matheus Gauy, Larissa Cristina Berti, Arnaldo Cândido Jr, Augusto Camargo Neto, Alfredo Goldman, Anna Sara Shafferman Levin, Marcus Martins, Beatriz Raposo de Medeiros, Marcelo Queiroz, Ester Cerdeira Sabino(+2 more)

Abstract:This work investigates Artificial Intelligence (AI) systems that detect respiratory insufficiency (RI) by analyzing speech audios, thus treating speech as a RI biomarker. Previous works collected RI data (P1) from COVID-19 patients during the first phase of the pandemic and trained modern AI models, such as CNNs and Transformers, which achieved $96.5\%$ accuracy, showing the feasibility of RI detection via AI. Here, we collect RI patient data (P2) with several causes besides COVID-19, aiming at extending AI-based RI detection. We also collected control data from hospital patients without RI. We show that the considered models, when trained on P1, do not generalize to P2, indicating that COVID-19 RI has features that may not be found in all RI types.

* Artificial Intellingence in Medicine Proceedings 2023, page 271-275
* 5 pages, 2 figures, 1 table. Published in Artificial Intelligence in Medicine (AIME) 2023

Via

Access Paper or Ask Questions

Tempo vs. Pitch: understanding self-supervised tempo estimation

Apr 14, 2023

Giovana Morais, Matthew E. P. Davies, Marcelo Queiroz, Magdalena Fuentes

Figure 1 for Tempo vs. Pitch: understanding self-supervised tempo estimation

Figure 2 for Tempo vs. Pitch: understanding self-supervised tempo estimation

Figure 3 for Tempo vs. Pitch: understanding self-supervised tempo estimation

Abstract:Self-supervision methods learn representations by solving pretext tasks that do not require human-generated labels, alleviating the need for time-consuming annotations. These methods have been applied in computer vision, natural language processing, environmental sound analysis, and recently in music information retrieval, e.g. for pitch estimation. Particularly in the context of music, there are few insights about the fragility of these models regarding different distributions of data, and how they could be mitigated. In this paper, we explore these questions by dissecting a self-supervised model for pitch estimation adapted for tempo estimation via rigorous experimentation with synthetic data. Specifically, we study the relationship between the input representation and data distribution for self-supervised tempo estimation.

* 5 pages, 3 figures, published on 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing

Via

Access Paper or Ask Questions