Bird Audio Diarization with Faster R-CNN
Conference paper
Shrestha, R., Glackin, C., Wall, J. and Cannings, N. 2021. Bird Audio Diarization with Faster R-CNN. 30th International Conference on Artificial Neural Networks (ICANN). Online 14 - 17 Sep 2021 Springer. https://doi.org/10.1007/978-3-030-86362-3_34
Authors | Shrestha, R., Glackin, C., Wall, J. and Cannings, N. |
---|---|
Type | Conference paper |
Abstract | Birds embody particular phonic and visual traits that distinguish them from 10,000 distinct bird species worldwide. Birds are also perceived to be indicators of biodiversity due to their propensity for responding to changes in their environment. An effective, automatic wildlife monitoring system based on bird bioacoustics, which can support manual classification, can be pivotal for the protection of the environment and endangered species. In modern machine learning, real-life bird audio classification is still considered as an esoteric challenge owing to the convoluted patterns present in bird song, and the complications that arise when numerous bird species are present in a common setting. Existing avian bioacoustic monitoring systems struggle when multiple bird species are present in an audio segment. To overcome these challenges, we propose a novel Faster Region-Based Convolutional Neural Network bird audio diarization system that incorporates object detection in the spectral domain and performs diarization of 50 bird species to effectively tackle the `which bird spoke when?' problem. Benchmark results are presented using the Bird Songs from Europe dataset achieving a Diarization Error Rate of 21.81, Jaccard Error Rate of 20.94 and F1, precision and recall values of 0.85, 0.83 and 0.87 respectively. |
Keywords | Deep Neural Networks; Audio Classification; Diarization; Automatic Wildlife Monitoring |
Year | 2021 |
Conference | 30th International Conference on Artificial Neural Networks (ICANN) |
Publisher | Springer |
Accepted author manuscript | License File Access Level Anyone |
Publication dates | |
Online | 07 Sep 2021 |
Publication process dates | |
Accepted | 15 Jun 2021 |
Deposited | 01 Jul 2021 |
Journal citation | pp. 415-426 |
ISSN | 0302-9743 |
Book title | Artificial Neural Networks and Machine Learning – ICANN 2021: 30th International Conference on Artificial Neural Networks, Bratislava, Slovakia, September 14–17, 2021, Proceedings, Part I |
Book editor | Farkaš, I. |
Masulli, P. | |
Otte, S. | |
Wermter, S. | |
ISBN | 978-3-030-86361-6 |
Digital Object Identifier (DOI) | https://doi.org/10.1007/978-3-030-86362-3_34 |
Web address (URL) | https://www.springer.com/gb/book/9783030863616 |
Copyright holder | © Springer Nature Switzerland AG 2021 |
Additional information | The final authenticated version is |
https://repository.uel.ac.uk/item/8986x
Download files
Accepted author manuscript
ICANN2021_BirdAudioDiarization.pdf | ||
License: Springer Nature Terms of Use for accepted manuscripts of subscription articles, books and chapters | ||
File access level: Anyone |
459
total views215
total downloads9
views this month5
downloads this month