论文部分内容阅读
A detection system for American English glides /w y r l/in a knowledge-based automatic speech recognition system is presented.The method uses detection of dips in band-limited energy to total energy ratios,instead of detecting dips along the unmodified band-limited energy contours.By using band-limited energy ratio,the dip detection is applicable in not only intervocalic regions but also in non-intervocalic regions.A Gaussian mixture model(GMM)based classifier is then used to separate the detected vowels and nasals.This approach is tested using the TIMIT corpus and results in an overall detection rate of 69.5%,which is a 4.7% absolute increase in detection rate compared with an hidden Markov model(HMM)based phone recognizer.
A detection system for American English glides / wyrl / in a knowledge-based automatic speech recognition system is presented. The method uses detection of dips in band-limited energy to total energy ratios, instead of detecting dips along the unmodified band-limited energy contours .By using band-limited energy ratio, the dip detection is applicable in not only intervocalic regions but also in non-intervocalic regions. A Gaussian mixture model (GMM) based classifier is then used to separate the detected vows and nasals. This approach is tested using the TIMIT corpus and results in an overall detection rate of 69.5%, which is a 4.7% absolute increase in detection rate compared with an hidden markov model (HMM) based phone recognizer.