Analysis on transcriptome sequenced for pinus massoniana | 马尾松转录组测序和分析
2013
Wang Xiaofeng, Nanjing Forestry University, Nanjing (China) | He Weilong, Nanjing Forestry University, Nanjing (China) | Cai Weijia, Nanjing Forestry University, Nanjing (China)
صينى. 本研究首次构建了马尾松均一化cDNA文库,采用Ⅲumina高通量测序技术对转录组进行了测序,利用生物信息学方法开展基因表达谱的研究、功能基因的预测。EST序列拼接获得83680个contig,其中33772个comig被注释为相应的331669对生物学功能,10647个contig被注释具有酶功能。根据KEGGpathway数据库,对马尾松转录组的contig进行Pathway生物学通路的注释和预测,共识别出10647个contig具有对应的1029种酶功能,并关联到135条生物学通路。SSR查找发现,从83680个contig中找到889个SSR位点,占contig总数的比例为1.06%。其中,三核苷酸重复所占比例最高,达到48.37%,其次是六核苷酸重复,为19.12%,比例最低的是四核苷酸重复,仅为4.72%,二核苷酸重复和五核苷酸重复基本相同,分别为14.62%和13.16%。SSR不同重复基元类型中,出现频率最高的为AT/AT,其次是AGC/CTG和AAG/CTT。
اظهر المزيد [+] اقل [-]إنجليزي. The transcriptome of the shoots of a seven-year-old Pinus massoniana was sequenced by Illumina that is a new generation of high-throughput sequencing technology to study the expression profiling and predict the functional gene. 83 680 contigs were obtained through sequence assembly, for which 33 772 contigs were annotated for 331 669 pairs in biological functions, 10 647 contigs were annotated for enzyme function. A total of 10 647 contigs were identified to correspond with 1 029 enzyme functions and associated with 135 biological pathways by annotating and forecasting the biological pathways for the transcriptome of P. massoniana. There were 889 SSR in 83 680 contigs were found, which accounting for 1.06% proportion of the total number of contigs. The characteristic of EST-SSR distribution showed that tri-nucleotide repeat was the highest reaching 48.37%, following by hexa-nucleotide repeat, which was 19.12%, and the least was tetra-nucleotide repeat which was only 4.72%, the proportion of dinucleotide repeat was as the same as penta-nucleotide repeat, they were 14.62% and 13.16%, respectively. The types of EST-SSR were analyzed that AT/AT was the highest repeat, following by AGC/CTG and AAG/CTT.
اظهر المزيد [+] اقل [-]