Speech and Vision Lab

  • Increase font size
  • Default font size
  • Decrease font size
Home Publications Advanced Publication Search
Word boundary hypothesization in Hindi speech
Research Area: Uncategorized Year: 1991
Type of Publication: Article  
Authors: G. V. Ramana Rao, B. Yegnanarayana  
Journal: Computer Speech & Language Volume: 5
Number: 4 Pages: 379 - 392
   
Note:
http://www.sciencedirect.com/science/article/B6WCW-4D7027Y-5/2/d0724f0327c2d2aa3b7f35a6a605563f
Abstract:
This paper proposes a method for hypothesizing word boundaries in Hindi speech. The method is based on the observation that function words such as case markers, pronouns and conjunctions occur frequently in Hindi text and spotting of these frequently occurring patterns is proposed as a means for hypothesizing word boundaries in a speech-to-text conversion system for Hindi. Initially, the idea was tested on a correct text with all word boundaries (except sentence boundaries) removed; the results showed that nearly 67% of the word boundaries were correctly hypothesized. Later, experiments with input containing errors simulated to represent speech environment showed that the proposed method is effective even at error levels as high as 50%. The implications of these results in the development of a speech-to-text conversion system for Hindi are discussed.
Digital version