Speech and Vision Lab

  • Increase font size
  • Default font size
  • Decrease font size
Home Publications
Epoch-based analysis of speech signals
Research Area: Speech Analysis Year: 2011
Type of Publication: Article Keywords: Epoch; zero-frequency filtering; fundamental frequency; pitch; impulse-like excitation
Authors: B. Yegnanarayana, Suryakanth V Gangashetty  
Speech analysis is traditionally performed using short-time analysis to extract features in time and frequency domains. The window size for the analysis is fixed somewhat arbitrarily, mainly to account for the time varying vocal tract system during production. However, speech in its primary mode of excitation is produced due to impulse-like excitation in each glottal cycle. Anchoring the speech analysis around the glottal closure instants (epochs) yields significant benefits for speech analysis. Epoch-based analysis of speech helps not only to segment the speech signals based on speech production characteristics, but also helps in accurate analysis of speech. It enables extraction of important acoustic-phonetic features such as glottal vibra- tions, formants, instantaneous fundamental frequency, etc. Epoch sequence is useful to manipulate prosody in speech synthesis applications. Accurate estimation of epochs helps in characterizing voice quality features. Epoch extraction also helps in speech enhancement and multispeaker separation. In this tutorial article, the importance of epochs for speech analysis is discussed, and methods to extract the epoch informa- tion are reviewed. Applications of epoch extraction for some speech applications are demonstrated.
Digital version