Speech and Vision Lab

  • Increase font size
  • Default font size
  • Decrease font size
Home Publications
Analysis of acoustic events in speech signals using Bessel series expansion
Research Area: Speech Analysis Year: 2013
Type of Publication: Article Keywords: Bessel series expansion; Voice onset time; Glottal closure instant; DESA; AM-FM
Authors: Chetana Prakash, Dhananjaya N., Suryakanth V Gangashetty  
   
Abstract:
In this paper, we propose an approach for the analysis and detection of acoustic events in speech signals using the Bessel series expansion. The acoustic events analyzed are the voice onset time (VOT) and the glottal closure instants (GCIs). The hypothesis is that the Bessel functions with their damped sinusoid-like basis functions are better suited for representing the speech signals than the sinusoidal basis functions used in the conventional Fourier representation. The speech signal is band-pass filtered by choosing the appropriate range of Bessel coefficients to obtain a narrow-band signal, which is decomposed further into amplitude modulated (AM) and frequency modulated (FM) components. The discrete energy separation algorithm (DESA) is used to compute the amplitude envelope (AE) of the narrow-band AM-FM signal. Events such as the consonant and vowel beginnings in an unvoiced stop consonant vowel (SCV) and the GCIs are derived by processing the AE of the signal. The proposed approach for the detection of the VOT using the Bessel expansion is shown to perform better than the conventional Fourier representation. The performance of the proposed GCI detection method using the Bessel series expansion is compared against some of the existing methods for various noise environments and signal-to-noise ratios.
Digital version