Peripheral Nonlinear Time Spectrum Features Algorithm for Large Vocabulary Mandarin Automatic Speech

来源 :Tsinghua Science and Technology | 被引量 : 0次 | 上传用户:horns01
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This work describes an improved feature extractor algorithm to extract the peripheral features of point x(ti,fj) using a nonlinear algorithm to compute the nonlinear time spectrum (NL-TS) pattern. The algo- rithm observes n×n neighborhoods of the point in all directions, and then incorporates the peripheral fea- tures using the Mel frequency cepstrum components (MFCCs)-based feature extractor of the Tsinghua elec- tronic engineering speech processing (THEESP) for Mandarin automatic speech recognition (MASR) sys- tem as replacements of the dynamic features with different feature combinations. In this algorithm, the or- thogonal bases are extracted directly from the speech data using discrite cosime transformation (DCT) with 3×3 blocks on an NL-TS pattern as the peripheral features. The new primal bases are then selected and simplified in the form of the ?dp- operator in the time direction and the ?dp- operator in the frequency di- t f rection. The algorithm has 23.29% improvements of the relative error rate in comparison with the standard MFCC feature-set and the dynamic features in tests using THEESP with the duration distribution-based hid- den Markov model (DDBHMM) based on MASR system. This work describes an improved feature extractor algorithm to extract the peripheral features of point x (ti, fj) using a nonlinear algorithm to compute the nonlinear time spectrum (NL-TS) pattern. The algo- rithm observes n × n neighborhoods of the point in all directions, and then incorporates the peripheral fea- tures using the Mel frequency cepstrum components (MFCCs) -based feature extractor of the Tsinghua elec- tronic engineering speech processing (THEESP) for Mandarin automatic speech recognition (MASR) sys- tem as replacements of the dynamic features with different feature combinations. In this algorithm, the or-anogonal bases are extracted directly from the speech data using discrite cosine transformation (DCT) with 3 × 3 blocks on an NL-TS pattern as the peripheral features. The new primal bases are then selected and simplified in the form of the? dp-operator in the time direction and the? dp-operator in the frequency di- t The algorithm has 23.29% improvements of the relative error rate in comparison with the standard MFCC feature-set and the dynamic features in tests using THEESP with the duration distribution-based hid-den Markov model (DDBHMM) based on MASR system.
其他文献
由于天津现代城钢结构施工的构件重量超出塔吊起重性能范围,因此采用中联QY80H531汽车吊上栈桥进行吊装工作.考虑栈桥承载力较小,因此分析汽车吊工作状态下几种最不利工况的
通过对掺钢纤维的橡胶混凝土进行弯拉力学性能试验,研究了钢纤维的加入对橡胶混凝土弯拉性能的影响,探讨了钢纤维掺量变化对橡胶混凝土的弯拉强度的影响.试验结果表明:当钢纤
在新专辑《反转地球》里,潘玮柏特别创作了一首叫《谢谢》的抒情慢歌。在歌中他写到,“我会笑着难过,他能给你保护,代替我的照顾,这是我最后的祝福。谢谢你的结束,冷却后的残
首先对煤矸石混凝土在国内的研究现状进行简要阐述.然后进行高性能煤矸石混凝土试验,分为C30、C40、C50三组,着重研究C30、C40,以C50为突破,进行探讨性的试验研究,从而获得其
本文对组合结构进行了基本定义,指出了抗剪连接件在组合结构设计中的重要作用.给出抗剪连接件的基本分类.同时,结合我国设计规范,给出了单个抗剪连接件的设计承载力计算方法,
详细介绍了地下室结构空间受力计算分析的结果与常规设计计算方法的数据差异,以期设计人员在地下室工程设计中提供一定的参考.
叠合构件在现代的建筑工程当中大量被运用,在我国的规范中也有详细的规定,笔者借鉴德国钢筋混凝土设计规范DIN1045-1,并结合试验与单一混凝土梁进行比较,得出在接触面无配置
《电影画刊》创刊20多年了,我们拥有了一大批忠实的铁杆读者,从创刊就开始期期不落的订阅本刊。有的从少年肘期开始订阅伴随着杂志一起成长;有的从青年时期开始订阅和杂志一
本文介绍大底盘双塔高层结构的设计分析过程,采用SATWE和ETABS对整体和单塔模型进行计算,同时补充弹性时程分析和罕遇地震作用下的静力弹塑性分析.结构分析中,连接各塔的裙房
高性能混凝土(简称HPC)是一种新型高技术混凝土,是在大幅度提高普通混凝土性能的基础上采用现代混凝土技术制作的混凝土.它以耐久性作为设计的主要指标,针对不同用途要求,对