Novel genomes are today often annotated by small consortia or individuals whose background is not from bioinformatics. This audience requires tools that are easy to use. Such need has been addressed by several genome annotation tools and pipelines. Visualizing resulting annotation is a crucial step of quality control. The UCSC Genome Browser is a powerful and popular genome visualization tool. Assembly Hubs, which can be hosted on any publicly available web server, allow browsing genomes via UCSC Genome Browser servers. The steps for creating custom Assembly Hubs are well documented and the required tools are publicly available. However, the number of steps for creating a novel Assembly Hub is large. In some cases, the format of input files needs to be adapted, which is a difficult task for scientists without programming background. Here, we describe MakeHub, a novel command line tool that generates Assembly Hubs for the UCSC Genome Browser in a fully automated fashion. The pipeline also allows extending previously created Hubs by additional tracks. MakeHub is freely available for downloading at https://github.com/Gaius-Augustus/MakeHub.
随着测序成本的降低,越来越多的个体及小型研究团体可以负担得起对感兴趣的非模式生物的基因组测序费用。同时可以供具有不同背景的科学家使用的,在新基因组中注释蛋白质编码基因的工具已经被开发出来,例如,AUGUSTUS、GeneMark ES/ET、GlimmerHMM、SNAP和GeMoMa,以及BRAKER、WebAUGUSTUS和MAKER。这种基因预测工具的输出文件是一种类似表格的文本文件,格式为基因转换格式(GTF)或通用特征格式3(GFF3)。在所有基因组注释项目中,对预测的基因结构进行可视化是质量控制的关键步骤。许多基因组浏览器可实现基因组的可视化功能,例如UCSC基因组浏览器、JBrowse和GBrowse2。其中,UCSC基因组浏览器是一个功能强大且被科研工作者广泛使用的基因组可视化工具。装配hubs是一种可以托管在任意公共可用网络服务器上,允许通过UCSC基因组浏览器服务器来浏览基因组。目前,对于创建自定义装配hubs的步骤方法已有公开的说明教程,并且所需的工具亦是公开可用的。但是,创建新的装配hubs的步骤却很多。在某些情况下,研究者需要调整输入文件的格式,而这对于没有编程背景的科学家来说是一项困难的任务。因此,我们开发了一种新的命令行工具MakeHub,用于在命令行上实现将BRAKER、MAKER、GlimmerHMM、SNAP和GeMoMa多个软件输出的单个物种基因组注释信息完全自动生成UCSC装配hubs。此方法还允许通过增加轨道进而扩充先前创建的hubs。MakeHub 可以从https://github.com/Gaius-Augustus/MakeHub 免费获得。
PDF全文下载地址:
http://gpb.big.ac.cn/articles/download/738
删除或更新信息,请邮件至freekaoyan#163.com(#换成@)
MakeHub: Fully Automated Generation of UCSC Genome Browser Assembly Hubs
本站小编 Free考研考试/2022-01-03
相关话题/gen
Gclust: A Parallel Clustering Tool for Microbial Genomic Data
Theacceleratinggrowthofthepublicmicrobialgenomicdataimposessubstantialburdenontheresearchcommunitythatusessuchresources.Buildingdatabasesfornon-redund ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Mapping Genome Variants Sheds Light on Genetic and Phenotypic Differentiation in Chinese
遗传变异和人类健康和精准医疗息息相关,因此绘制全人类基因组遗传变异图谱成为全球科学家共同奋斗的目标。近年来,国际千人基因组等多个研究小组纷纷致力于发现世界不同种族人群中基因组变异。我国是个多民族国家,拥有大约20%的世界人口和丰富的遗传多样性。但由于缺乏中国南北方人群特异的参考基因组以及深度测序数据 ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Whole Genome Analyses of Chinese Population and De Novo Assembly of A Northern Han Genome
Tounravelthegeneticmechanismsofdiseaseandphysiologicaltraits,itrequirescomprehensivesequencinganalysisoflargesamplesizeinChinesepopulations.Here,werep ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03H3K27me3 Signal in the Cis Regulatory Elements Reveals the Differentiation Potential of Progenitors
Drosophilaneuraldevelopmentundergoesextensivechromatinremodelingandpreciseepigeneticregulation.However,therolesofchromatinremodelinginestablishmentand ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03C3: Consensus Cancer Driver Gene Caller
Next-generationsequencinghasallowedidentificationofmillionsofsomaticmutationsinhumancancercells.Akeychallengeininterpretingcancergenomesistodistinguis ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03gFACs: Gene Filtering, Analysis, and Conversion to Unify Genome Annotations Across Alignment and Gen
Publishedgenomesfrequentlycontainerroneousgenemodelsthatrepresentissuesassociatedwithidentificationofopenreadingframes,startsites,splicesites,andrelat ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03m6A Regulates Neurogenesis and Neuronal Development by Modulating Histone Methyltransferase Ezh2
N6-methyladenosine(m6A),catalyzedbythemethyltransferasecomplexconsistingofMettl3andMettl14,isthemostabundantRNAmodificationinmRNAsandparticipatesindiv ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Chronic Food Antigen-specific IgG-mediated Hypersensitivity Reaction as A Risk Factor for Adolescent
Majordepressivedisorder(MDD)isthemostcommonnonfataldiseaseburdenworldwide.Systemicchroniclow-gradeinflammationhasbeenreportedtobeassociatedwithMDDprog ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Integrating Culture-based Antibiotic Resistance Profiles with Whole-genome Sequencing Data for 11,08
Emergingantibioticresistanceisamajorglobalhealththreat.Theanalysisofnucleicacidsequenceslinkedtosusceptibilityphenotypesfacilitatesthestudyofgenetican ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03SeqSQC: A Bioconductor Package for Evaluating the Sample Quality of Next-generation Sequencing Data
Asnext-generationsequencing(NGS)technologyhasbecomewidelyusedtoidentifygeneticcausalvariantsforvariousdiseasesandtraits,anumberofpackagesforcheckingNG ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03