论文部分内容阅读
为探讨紫背天葵(Gynura bicolor)花青素等物质合成的遗传基础,采用高通量测序技术(Illumina HiSeq 2500)对其嫩叶进行转录组测序,共获得21 387 624个序列读取片段(reads),将测序数据进行序列组装后,获得33 314个单基因簇(Unigene),其中超过1 kb的7 792个Unigene中共检测到2 387个SSR位点。对所得Unigene进行不同数据库注释,22 048和14 417个Unigene分别在Nr和Swiss Prot数据库有同源比对信息,并发现有29个Unigene与花青素合成相关;Pfam功能注释到13 909个Unigene分为5 198类相关蛋白功能区域,有12个Unigene涉及到花青素合成;GO注释到11 613个Unigene分为细胞组分、分子功能及生物学过程等3大类51个功能组,有24个Unigene与花青素合成有关;COG注释到的6 589个Unigene功能系统分为24类,而KOG注释到的13 498个Unigene功能系统分为25类;以KEGG数据库为参考,将4 466个Unigene定位到108个代谢途径分支,其中有47个Unigene与类黄酮生物合成相关。
In order to explore the genetic basis of anthocyanin synthesis in Gynura bicolor, the leaves of young leaves were sequenced by high throughput sequencing (Illumina HiSeq 2500), and 21 387 624 reads (reads). After sequenced data were assembled, 33 314 single gene clusters (Unigene) were obtained, of which 2 387 SSR sites were detected in 7 792 Unigene over 1 kb. Unigene annotation of different databases, 22 048 and 14 417 Unigene homologous alignment information in the Nr and Swiss Prot database respectively, and found that 29 Unigene related anthocyanin synthesis; Pfam function annotated to 13 909 Unigene Divided into 5 198 categories of related protein functional areas, 12 Unigene related to anthocyanin synthesis; GO Notes to 11 613 Unigene divided into cell components, molecular functions and biological processes in three categories 51 functional groups, there 24 Unigene were related to anthocyanin synthesis; 6 589 Unigene functional systems annotated by COG were classified into 24 categories, while 13 498 Unigene functional systems annotated by KOG were classified into 25 categories; with KEGG database as reference, 4 466 Unigene locates in 108 metabolic pathway branches, of which 47 Unigene are associated with flavonoid biosynthesis.