Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dimitri Leandro de Oliveira Silva

Microphone Array Based Surveillance Audio Classification

May 22, 2020

Dimitri Leandro de Oliveira Silva, Tito Spadini, Ricardo Suyama

Figure 1 for Microphone Array Based Surveillance Audio Classification

Figure 2 for Microphone Array Based Surveillance Audio Classification

Figure 3 for Microphone Array Based Surveillance Audio Classification

Figure 4 for Microphone Array Based Surveillance Audio Classification

Abstract:The work assessed seven classical classifiers and two beamforming algorithms for detecting surveillance sound events. The tests included the use of AWGN with -10 dB to 30 dB SNR. Data Augmentation was also employed to improve algorithms' performance. The results showed that the combination of SVM and Delay-and-Sum (DaS) scored the best accuracy (up to 86.0\%), but had high computational cost ($\approx $ 402 ms), mainly due to DaS. The use of SGD also seems to be a good alternative since it has achieved good accuracy either (up to 85.3\%), but with quicker processing time ($\approx$ 165 ms).

Via

Access Paper or Ask Questions

Sound Event Recognition in a Smart City Surveillance Context

Oct 27, 2019

Tito Spadini, Dimitri Leandro de Oliveira Silva, Ricardo Suyama

Figure 1 for Sound Event Recognition in a Smart City Surveillance Context

Figure 2 for Sound Event Recognition in a Smart City Surveillance Context

Figure 3 for Sound Event Recognition in a Smart City Surveillance Context

Abstract:Due to the growing demand for improving surveillance capabilities in smart cities, systems need to be developed to provide better monitoring capabilities to competent authorities, agencies responsible for strategic resource management, and emergency call centers. This work assumes that, as a complementary monitoring solution, the use of a system capable of detecting the occurrence of sound events, performing the Sound Events Recognition (SER) task, is highly convenient. In order to contribute to the classification of such events, this paper explored several classifiers over the SESA dataset, composed of audios of three hazard classes (gunshots, explosions, and sirens) and a class of casual sounds that could be misinterpreted as some of the other sounds. The best result was obtained by SGD, with an accuracy of 72.13% with 6.81 ms classification time, reinforcing the viability of such an approach.

Via

Access Paper or Ask Questions