Research Area: | Speech Analysis | Year: | 2012 | ||||
Type of Publication: | In Proceedings | Keywords: | Emotion analysis, emotion synthesis, emotion conversion, dynamic time warping, zero frequency filtering, prosody modification | ||||
Authors: | P. Gangamohan, Vinay Kumar Mittal, B. Yegnanarayana | ||||||
Abstract: | |||||||
This paper aims to understand the components of speech that
contribute to emotion characteristics in speech. Four compo-
nents of speech (vocal tract, excitation, duration and intonation)
are considered in this study. A Flexible Analysis Synthesis Tool
(FAST) is developed to modify the features of an utterance from
neutral to emotion or from emotion to neutral. The key ideas
used in this work are the dynamic time warping algorithm for
alignment of two utterances and a flexible prosody manipula-
tion for incorporating the desired features. The tool is used for
conversion of neutral to emotion speech. Subjective evaluation
is performed based on listening tests. The tool has potential
to convert neutral to emotion speech and vice-versa, which can
lead to understanding the significance of various components
contributing to emotional content in speech. |
|||||||
Digital version |