Speech and Vision Lab

  • Increase font size
  • Default font size
  • Decrease font size
Home Publications
Exploring subsegmental and suprasegmental features for a text-dependent speaker verification in distant speech signals
Research Area: Speech Analysis Year: 2010
Type of Publication: In Proceedings Keywords: automatic speaker verification, text-dependent, distant speech, signal-to-noise ratio, pitch, duration
Authors: B. Avinash, Guruprasad S., B. Yegnanarayana  
   
Abstract:
Existing automatic speaker verification (ASV) systems perform with high accuracy when the speech signal is collected close to the mouth of the speaker (bjective of this paper is to address some issues in the processing of speech signals collected at a distance from the speaker, for textdependent ASV system. An acoustic feature derived from short segments of speech signals is proposed for the ASV task. The key idea is to exploit the high signal-to-noise nature of short segments of speech in the vicinity of impulse-like excitations. We show that the proposed feature yields better performance of speaker verification than the mel-frequency cepstral coefficients (MFCCs). In addition, regions of high signal-to-reverberation ratio, duration and pitch information are used to improve the performance of the ASV system for distant speech.
Digital version