Speech and Vision Lab

  • Increase font size
  • Default font size
  • Decrease font size
Home Publications
Effectiveness of a periodic and aperiodic decomposition method for analysis of voice sources
Research Area: Uncategorized Year: 1998
Type of Publication: Article Keywords: jitter, modulation, random noise, spectral analysis, speech processing, speech synthesisadditive pulsed noise, aperiodic decomposition method, continuous random noise, excitation source control, formant synthesis, fundamental frequency variation, jitter,
Authors: C. d'Alessandro, V. Darsinos, B. Yegnanarayana  
Journal: IEEE Trans. Speech and Audio Processing Volume: 6
Number: 1 Pages: 12-23
Month: January
   
Abstract:
Decomposition of speech into periodic and aperiodic components is useful in analyzing and describing the characteristics of voice sources. Such a decomposition is also useful in controlling the excitation source for synthesis. This paper addresses the issue of decomposition of speech into periodic and aperiodic components in the context of speech production. The effectiveness of a recently proposed algorithm for decomposing speech into these components is examined for analysis of voice sources. Synthetic signals are generated using formant synthesis. Different sources of aperiodicity encountered in normal speech production are considered, using a set of parameters to control the synthetic signals. The sources of aperiodicity studied are: (1) additive pulsed or continuous random noise, and (2) modulation aperiodicities due to variation in the fundamental frequency, jitter, and shimmer. Three types of measures are used to characterize these voices: ratio of energies in the periodic and aperiodic components, perceptual spectral distance, and spectrograms. The results demonstrate the effectiveness of the periodic-aperiodic decomposition algorithm for analyzing aperiodicities for a wide variety of voices, and point out the limitations of the algorithm
Digital version