Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Longfei Song

ICSD: An Open-source Dataset for Infant Cry and Snoring Detection

Aug 20, 2024

Qingyu Liu, Longfei Song, Dongxing Xu, Yanhua Long

Figure 1 for ICSD: An Open-source Dataset for Infant Cry and Snoring Detection

Figure 2 for ICSD: An Open-source Dataset for Infant Cry and Snoring Detection

Figure 3 for ICSD: An Open-source Dataset for Infant Cry and Snoring Detection

Figure 4 for ICSD: An Open-source Dataset for Infant Cry and Snoring Detection

Abstract:The detection and analysis of infant cry and snoring events are crucial tasks within the field of audio signal processing. While existing datasets for general sound event detection are plentiful, they often fall short in providing sufficient, strongly labeled data specific to infant cries and snoring. To provide a benchmark dataset and thus foster the research of infant cry and snoring detection, this paper introduces the Infant Cry and Snoring Detection (ICSD) dataset, a novel, publicly available dataset specially designed for ICSD tasks. The ICSD comprises three types of subsets: a real strongly labeled subset with event-based labels annotated manually, a weakly labeled subset with only clip-level event annotations, and a synthetic subset generated and labeled with strong annotations. This paper provides a detailed description of the ICSD creation process, including the challenges encountered and the solutions adopted. We offer a comprehensive characterization of the dataset, discussing its limitations and key factors for ICSD usage. Additionally, we conduct extensive experiments on the ICSD dataset to establish baseline systems and offer insights into the main factors when using this dataset for ICSD research. Our goal is to develop a dataset that will be widely adopted by the community as a new open benchmark for future ICSD research.

* 11 pages, 6 figures

Via

Access Paper or Ask Questions