Research Area: | Speech Analysis | Year: | 2011 | ||||
Type of Publication: | In Proceedings | Keywords: | Emotions, ZFF, strength of excitation, instanta- neous pitch, duration | ||||
Authors: | B. Yegnanarayana, D. Govind, S. R. M. Prasanna | ||||||
Abstract: | |||||||
This work uses instantaneous pitch and strength of excitation
along with duration of syllable-like units as the parameters for
emotion conversion. Instantaneous pitch and duration of the
syllable-like units of the neutral speech are modified by the
prosody modification of its linear prediction (LP) residual us-
ing the instants of significant excitation. The strength of excita-
tion is modified by scaling the Hilbert envelope (HE) of the LP
residual. The target emotion speech is then synthesized using
the prosody and strength modified LP residual. The pitch, du-
ration and strength modification factors for emotion conversion
are derived using the syllable-like units of initial, middle and
final regions from an emotion speech database having different
speakers, texts and emotions. The effectiveness of the region
wise modification of source and supra segmental features over
the gross level modification is confirmed by the waveforms,
spectrograms and subjective evaluations. |
|||||||
Digital version |