Noise-Robust Detection of Whispering in Telephone Calls Using Deep Neural Networks

Diment, Aleksandr; Virtanen, Tuomas; Parviainen, Mikko; Zelov, Roman; Glasman, Alex
Abstract

Detection of whispered speech in the presence of high levels of background noise has applications in fraudulent behaviour recognition. For instance, it can serve as an indicator of possible insider trading. We propose a deep neural network (DNN)-based whispering detection system, which operates on both magnitude and phase features, including the group delay feature from all-pole models (APGD). We show that the APGD feature outperforms the conventional ones. Trained and evaluated on the collected diverse dataset of whispered and normal speech with emulated phone line distortions and significant amounts of added background noise, the proposed system performs with accuracies as high as 91.8%.

Keywords

whispering; noise robustness; deep neural networks

Year:
2016
Publisher:
IEEE
Month:
8
DOI:
10.1109/EUSIPCO.2016.7760661