Speech and Vision Lab

  • Increase font size
  • Default font size
  • Decrease font size
Home Publications
Voiced/nonvoiced detection based on robustness of voiced epochs
Research Area: Speech Analysis Year: 2010
Type of Publication: Article Keywords: Excitation source, glottal activity detection , glottal closure instant , voiced nonvoiced detection , zero frequency filtering
Authors: Dhananjaya N., B. Yegnanarayana  
In this paper, a new method for voiced/nonvoiced detection based on epoch extraction is proposed. Zero-frequency filtered speech signal is used to extract the instants of significant excitation (or epochs). The robustness of the method to extract epochs in the voiced regions, even with small amount of additive white noise, is used to distinguish voiced epochs from random instants detected in nonvoiced regions. The main feature of the proposed method is that it uses the strength of glottal activity as against using the periodicity of the signal. Performance of the proposed algorithm is studied on TIMIT and CMU ARCTIC databases, for two different noise types, white and vehicle noise from the NOISEX database, at different signal-to-noise ratios (SNRs). The proposed method performs similar or better than the popular normalized crosscorrelation based voiced/nonvoiced detection used in the open source utility wavesurfer, especially at lower SNRs.
Digital version