Gender-Specific Speech Enhancement Architecture for Improving Deep Neural Networks Learning
Conference paper
Nossier, S. A. and Sharif, S. 2024. Gender-Specific Speech Enhancement Architecture for Improving Deep Neural Networks Learning. 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies. IEEE. https://doi.org/10.1109/3ict64318.2024.10824570
Authors | Nossier, S. A. and Sharif, S. |
---|---|
Type | Conference paper |
Abstract | Deep learning techniques for speech enhancement rely on training a deep neural network to process noisy speech, regardless the gender of the speaker. However, research shows that the speech of male and female stimulates different parts in human brain, and that female speech requires more complex analysis. This implies that different processing is applied on the speech, based on the speaker gender. In this work, we argue that male and female speeches have different features that can help in the learning process of speech enhancement deep neural networks if the training is performed on male and female speech data, independently, and using two different deep neural networks, specifically implemented for enhancing the clean speech signal of the target gender. This work presents a genderspecific speech enhancement architecture, which consists of a front-end binary classifier to detect the speaker gender. Based on the classifier decision, the noisy speech is enhanced using either a male or female speech enhancement model. One-stage and twostage speech enhancement approaches are used to process male and female speeches, respectively. The results reveal that genderspecific speech enhancement has positive impact on the enhanced speech by deep neural networks. Additionally, the developed architecture achieved classifier accuracy 96.9% and 0.11 increase in Covl speech quality metric for the test data, in comparison to other best-performing networks. |
Year | 2024 |
Conference | 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies |
Publisher | IEEE |
Accepted author manuscript | License File Access Level Anyone |
Publication dates | |
Online | 13 Jan 2025 |
Publication process dates | |
Completed | Nov 2024 |
Accepted | 02 Nov 2024 |
Deposited | 20 Dec 2024 |
Journal | 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT) |
Journal citation | pp. 857-862 |
ISSN | 2770-7466 |
2770-7458 | |
Book title | 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT) |
ISBN | 979-8-3315-3313-7 |
979-8-3315-3314-4 | |
Digital Object Identifier (DOI) | https://doi.org/10.1109/3ict64318.2024.10824570 |
Web address (URL) of conference proceedings | https://ieeexplore.ieee.org/xpl/conhome/10823647/proceeding |
Copyright holder | © 2024 IEEE |
Additional information | Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
https://repository.uel.ac.uk/item/8yvz1
Download files
Accepted author manuscript
Gender-Specific Speech Enhancement Architecture for Improving Deep Neural Networks Learning - AM.pdf | ||
License: All rights reserved | ||
File access level: Anyone |
82
total views1
total downloads4
views this month1
downloads this month