Genome reannotation aims for complete and accurate characterization of gene models and thus is of critical significance for in-depth exploration of gene function. Although the availability of massive RNA-seq data provides great opportunities for gene model refinement, few efforts have been made to adopt these precious data in rice genome reannotation. Here we reannotate the rice (Oryza sativa L. ssp. japonica) genome based on integration of large-scale RNA-seq data and release a new annotation system IC4R-2.0. In general, IC4R-2.0 significantly improves the completeness of gene structure, identifies a number of novel genes, and integrates a variety of functional annotations. Furthermore, long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs) are systematically characterized in the rice genome. Performance evaluation shows that compared to previous annotation systems, IC4R-2.0 achieves higher integrity and quality, primarily attributable to massive RNA-seq data applied in genome annotation. Consequently, we incorporate the improved annotations into the Information Commons for Rice (IC4R), a database integrating multiple omics data of rice, and accordingly update IC4R by providing more user-friendly web interfaces and implementing a series of practical online tools. Together, the updated IC4R, which is equipped with the improved annotations, bears great promise for comparative and functional genomic studies in rice and other monocotyledonous species. The IC4R-2.0 annotation system and related resources are freely accessible at http://ic4r.org/.
基因组重注释是不断修正基因模型的过程,对模式生物与非模式生物功能基因的深度解析具有重要意义。转录组测序技术由于能有效地识别基因组中的可变剪接位点,敏感地鉴定出低丰度表达基因与组织特异性基因,在基因组重注释研究中有巨大的应用潜力。鉴于目前水稻中已积累了海量转录组测序数据,我们开发了一套以公共RNA-Seq数据大规模整合分析为基础的基因组注释流程,对水稻基因组开展重注释研究,进而获得了一套新的水稻基因组注释系统:IC4R-2.0。结果表明,IC4R-2.0通过外显子/内含子区域矫正,新UTR区域识别,基因融合及新基因挖掘等方式,对原注释系统中蛋白质编码基因的结构进行了更新。同时,我们对水稻基因组中的长链非编码RNA(lncRNA)与环形RNA(circRNA)进行了鉴定。通过整合多个基因组功能注释平台的资源,我们为水稻基因提供了更为丰富的功能注释信息。不同版本水稻基因组注释系统的定量评估与比较分析表明,大规模整合转录组测序数据的确可以使水稻基因模型的完整度与注释质量获得提升。为方便用户获取水稻基因组重注释信息,我们在水稻生物信息门户IC4R (v 1.0)的基础上进行了重新设计及二次开发,不但有效地整合了水稻基因组重注释信息,还提供了更为友好的数据展示界面,提高了数据检索效率,并提供了一系列丰富而实用的在线分析工具。本研究为在水稻和其他单子叶植物中开展大规模基因功能解析等相关工作提供了数据基础。IC4R-2.0注释系统及相关资源可通过http://ic4r.org/ 来获取。
PDF全文下载地址:
http://gpb.big.ac.cn/articles/download/777
删除或更新信息,请邮件至freekaoyan#163.com(#换成@)
IC4R-2.0: Rice Genome Reannotation Using Massive RNA-seq Data
本站小编 Free考研考试/2022-01-03
相关话题/gen
SR4R: An Integrative SNP Resource for Genomic Breeding and Population Research in Rice
Theinformationcommonsforrice(IC4R)databaseisacollectionof18millionsinglenucleotidepolymorphisms(SNPs)identifiedbyresequencingof5152riceaccessions.Alth ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03HybridSucc: A Hybrid-learning Architecture for General and Species-specific Succinylation Site Predi
Asanimportantproteinacylationmodification,lysinesuccinylation(Ksucc)isinvolvedindiversebiologicalprocesses,andparticipatesinhumantumorigenesis.Here,we ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Multi-omics Analysis of Primary Cell Culture Models Reveals Genetic and Epigenetic Basis of Intratum
Uncoveringthefunctionallyessentialvariationsrelatedtotumorigenesisandtumorprogressionfromcancergenomicsdataisstillchallengingduetothegeneticdiversitya ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Tung Tree (Vernicia fordii) Genome Provides A Resource for Understanding Genome Evolution and Improv
Tungtree(Verniciafordii)isaneconomicallyimportantwoodyoilplantthatproducestungoilrichineleostearicacid.Here,wereportahigh-qualitychromosome-scalegenom ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Schizophrenia-associated MicroRNA–Gene Interactions in the Dorsolateral Prefrontal Cortex
Schizophrenia-associatedanomaliesingeneexpressioninpostmortembraincanbeattributedtoacombinationofgeneticandenvironmentalinfluences.Giventhesmalleffect ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03I3: A Self-organising Learning Workflow for Intuitive Integrative Interpretation of Complex Genetic
Weproposeacomputationalworkflow(I3)forintuitiveintegrativeinterpretationofcomplexgeneticdatamainlybuildingontheself-organisingprinciple.Weillustrateth ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03shinyChromosome: An R/Shiny Application for Interactive Creation of Non-circular Plots of Whole Geno
Non-circularplotsofwholegenomesarenaturalrepresentationsofgenomicdataalignedalongallchromosomes.Currently,thereisnospecializedgraphicaluserinterface(G ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Gclust: A Parallel Clustering Tool for Microbial Genomic Data
Theacceleratinggrowthofthepublicmicrobialgenomicdataimposessubstantialburdenontheresearchcommunitythatusessuchresources.Buildingdatabasesfornon-redund ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03MakeHub: Fully Automated Generation of UCSC Genome Browser Assembly Hubs
Novelgenomesaretodayoftenannotatedbysmallconsortiaorindividualswhosebackgroundisnotfrombioinformatics.Thisaudiencerequirestoolsthatareeasytouse.Suchne ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Mapping Genome Variants Sheds Light on Genetic and Phenotypic Differentiation in Chinese
遗传变异和人类健康和精准医疗息息相关,因此绘制全人类基因组遗传变异图谱成为全球科学家共同奋斗的目标。近年来,国际千人基因组等多个研究小组纷纷致力于发现世界不同种族人群中基因组变异。我国是个多民族国家,拥有大约20%的世界人口和丰富的遗传多样性。但由于缺乏中国南北方人群特异的参考基因组以及深度测序数据 ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03