,Biclustering by sparse canonical correlation analysis

来源 :定量生物学(英文版) | 被引量 : 0次 | 上传用户:Z_L_Q
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Developing appropriate computational tools to distill biological insights from large-scale gene expression data has been an important part of systems biology. Considering that gene relationships may change or only exist in a subset of collected samples, biclustering that involves clustering both genes and samples has become in-creasingly important, especially when the samples are pooled from a wide range of experimental conditions. Methods: In this paper, we introduce a new biclustering algorithm to find subsets of genomic expression features (EFs) (e·g·, genes, isoforms, exon inclusion) that show strong "group interactions"under certain subsets of samples. Group interactions are defined by strong partial correlations, or equivalently, conditional dependencies between EFs after removing the influences of a set of other functionally related EFs. Our new biclustering method, named SCCA-BC, extends an existing method for group interaction inference, which is based on sparse canonical correlation analysis(SCCA) coupled with repeated random partitioning of the gene expression data set. Results: SCCA-BC gives sensible results on real data sets and outperforms most existing methods in simulations. Software is available at https://github. com/pimentel/scca-bc. Conclusions: SCCA-BC seems to work in numerous conditions and the results seem promising for future extensions. SCCA-BC has the ability to find different types of bicluster pattes, and it is especially advantageous in identifying a bicluster whose elements share the same progressive and multivariate normal distribution with a dense covariance matrix.
其他文献
磷是植物生长必需的营养元素之一,也是植物体内重要化合物的组成成分,又以多种方式参与植物体内的各种生理和代谢过程,在农业生产中具有不可替代的作用。但是,磷肥是一种不可再生资源,农业中为了提高产量而提高磷肥的施用量,不仅浪费资源,又会带来很多环境问题。有许多研究表明,不同植物或同一植物的不同基因型对磷的吸收利用能力存在差异。因此,利用科学手段筛选和培育对土壤磷利用率高,活化土壤中非有效态磷能力强的新作
Background: Immune evasion is a fundamental hallmark for cancer.At the early stages of tumor development,immune evasion strategies must be implemented by tumors
新时代散文的发展虽然有点缓慢,甚至寂寞,但毫无疑问,散文的艺术形式正在蜕变,全方位探讨散文新生面的热流正在涌动。散文界的艺术视角正在打开,一批在散文内蕴和美学追求上
Background:Human immunodeficiency virus isolates most often use chemokine receptor CCR5 or CXCR4 as a coreceptor to enter target cells.During early stages of HI
<正> 所有20世纪杰出的发明中,电视也许是公认的最有代表性的发明。印刷业的发展、科学知识的爆炸,使我们人类向前跃进了一大步。但是,就对人们日常生活的影响、人们观念和价值的形成以及人们之间相互的交流而言,电视的作用是十分明显的。它的影响遍及各地,对那些生活在偏僻角落、从未接触过现代文明的人来说,电视似乎向他们展示了另一个星球的情况。电视的确是一项伟大的发明。20年代初,无线电波已在欧洲、北美两个大陆之间穿梭往来,世界仿佛变小了。此后,一个名叫贝尔德的苏格兰人利用信号的传递在屏幕上显示出了图象
亲本是甘蔗杂交育种的基础,甘蔗亲本遗传多样性的信息有助于指导亲本选择和组合配置,也有助于扩大育种计划中亲本材料的遗传多样性,同时,分子指纹图谱的构建是甘蔗品种权保护和基因型鉴别的依据,因此,相关研究具有重要的理论和实践意义。本研究采用5对SSR荧光标记引物,对116份甘蔗常用亲本、创新亲本或新亲本进行SSR标记,构建DNA指纹图谱,通过对SSR标记数据进行聚类分析、主成分分析和遗传相似性分析,获得