Decoding the Structural Keywords in Protein Structure Universe

来源 :计算机科学技术学报(英文版) | 被引量 : 0次 | 上传用户:a41808829739
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Although the protein sequence-structure gap continues to enlarge due to the development of high-throughput sequencing tools, the protein structure universe tends to be complete without proteins with novel structural folds deposited in the protein data bank (PDB) recently. In this work, we identify a protein structural dictionary (Frag-K) composed of a set of backbone fragments ranging from 4 to 20 residues as the structural keywords that can effectively distinguish between major protein folds. We firstly apply randomized spectral clustering and random forest algorithms to construct representative and sensitive protein fragment libraries from a large scale of high-quality, non-homologous protein structures available in PDB. We analyze the impacts of clustering cut-offs on the performance of the fragment libraries. Then, the Frag-K fragments are employed as structural features to classify protein structures in major protein folds defined by SCOP (Structural Classification of Proteins). Our results show that a structural dictionary with ~4004- to 20-residue Frag-K fragments is capable of classifying major SCOP folds with high accuracy.
其他文献
期刊
A tightly secure cryptographic scheme refers to a construction with a tight security reduction to a hardness assumption, where the reduction loss is a small con
请下载后查看,本文暂不支持在线获取查看简介。 Please download to view, this article does not support online access to view profile.
期刊
期刊
日本大同电子公司灰塚弘在2010年7月开发出全球最高性能的低Dy热变形磁环,牌号为ND-43SHR和ND-39SHR,目前可提供样品试用。该法用急冷法制造Nd—Fe—B薄带,
探讨母体铅染毒对其仔鼠大脑皮层组织中IL-1β和TNF-α表达量的影响,揭示铅神经毒性的潜在机制。母鼠采用自由饮水的方式自妊娠1d开始经饮水染铅(0.1%,0.5%和1.0%的浓度溶解
2012年7月31日,省联社通化办事处与柳河县政府在县宾馆举行战略合作协议签字仪式,省联社党委副书记、主任李世杰,中小企业部经理杨明,柳河县委书记王坤、代县长蒋海燕出席签
患者 女,43岁。左上腹部包块3年伴疼痛1月余入院。查体:左上腹侧卧位时,可触及一活动度差、无压痛肿块。肝、脾未触及。实验室检查无异常。B超报告左上腹探及一大小约为9cm