Identification of genetic variants via high-throughput sequencing (HTS) technologies has been essential for both fundamental and clinical studies. However, to what extent the genome sequence composition affects variant calling remains unclear. In this study, we identified 63,897 multi-copy sequences (MCSs) with a minimum length of 300 bp, each of which occurs at least twice in the human genome. The 151,749 genomic loci (multi-copy regions, or MCRs) harboring these MCSs account for 1.98% of the genome and are distributed unevenly across chromosomes. MCRs containing the same MCS tend to be located on the same chromosome. Gene Ontology (GO) analyses revealed that 3800 genes whose UTRs or exons overlap with MCRs are enriched for Golgi-related cellular component terms and various enzymatic activities in the GO biological function category. MCRs are also enriched for loci that are sensitive to neocarzinostatin-induced double-strand breaks. Moreover, genetic variants discovered by genome-wide association studies and recorded in dbSNP are significantly underrepresented in MCRs. Using simulated HTS datasets, we show that false variant discovery rates are significantly higher in MCRs than in other genomic regions. These results suggest that extra caution must be taken when identifying genetic variants in the MCRs via HTS technologies.
                                
                                
                            
                        
PDF全文下载地址:
http://gpb.big.ac.cn/articles/download/810
删除或更新信息,请邮件至freekaoyan#163.com(#换成@) 
The Biological Significance of Multi-copy Regions and Their Impact on Variant Discovery
本站小编 Free考研考试/2022-01-03
相关话题/gen
- Pooled Plasmid Sequencing Reveals the Relationship Between Mobile Genetic Elements and AntimicrobialPlasmidsremainimportantmicrobialcomponentsmediatingthehorizontalgenetransfer(HGT)anddisseminationofantimicrobialresistance.Tosystematicallyexplorether ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- SeSaMe PS Function: Functional Analysis of the Whole Metagenome Sequencing Data of the Arbuscular MyInthisstudy,weintroduceanovelbioinformaticsprogram,Spore-associatedSymbioticMicrobesPosition-specificFunction(SeSaMePSFunction),forposition-specificfu ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- SeSaMe: Metagenome Sequence Classification of Arbuscular Mycorrhizal Fungi-associated MicroorganismsArbuscularmycorrhizalfungi(AMF)areplantrootsymbiontsthatplaykeyrolesinplantgrowthandsoilfertility.Theyareobligatebiotrophicfungithatformcoenocyticmult ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- m6A Regulates Liver Metabolic Disorders and Hepatogenous DiabetesN6-methyladenosine(m6A)isoneofthemostabundantmodificationsonmRNAsandplaysimportantrolesinvariousbiologicalprocesses.Theformationofm6Aiscatalyzedbyamet ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- Corrigendum to “Antibiotic Treatment Drives the Diversification of the Human Gut Resistome” [GenomicPDF全文下载地址:/articles/download/818 ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- Tissue-specific Gene Expression Changes Are Associated with Aging in MiceAgingisacomplexprocessthatcanbecharacterizedbyfunctionalandcognitivedeclineinanindividual.Agingcanbeassessedbasedonthefunctionalcapacityofvitalorgansa ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- Transcriptomic and Proteomic Analysis of Mannitol-metabolism-associated Genes in Saccharina japonicaAsacarbon-storagecompoundandosmoprotectantinbrownalgae,mannitolissynthesizedandthenaccumulatedathighlevelsinSaccharinajaponica(Sja);however,theunderly ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- The Wolfiporia cocos Genome and Transcriptome Shed Light on the Formation of Its Edible and MedicinaWolfiporiacocos(F.A.Wolf)hasbeenpraisedasafooddelicacyandmedicineforcenturiesinChina.Here,wepresentthegenomeandtranscriptomeoftheChinesestrainCGMCC5.7 ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- Global Analysis of Gene Expression Profiles Provides Novel Insights into the Development and EvolutiChinesemittencrab(Eriocheirsinensis)isanimportantaquaculturespeciesinCrustacea.Functionalanalysis,althoughessential,hasbeenhinderedduetothelackofsuffi ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
- mrMLM v4.0.2: An R Platform for Multi-locus Genome-wide Association StudiesPreviousstudieshavereportedthatsomeimportantlociaremissedinsingle-locusgenome-wideassociationstudies(GWAS),especiallybecauseofthelargephenotypicerrori ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03
