Clustering is a prevalent analytical means to analyze single cell RNA sequencing (scRNA-seq) data but the rapidly expanding data volume can make this process computationally challenging. New methods for both accurate and efficient clustering are of pressing need. Here we proposed Spearman subsampling-clustering-classification (SSCC), a new clustering framework based on random projection and feature construction, for large-scale scRNA-seq data. SSCC greatly improves clustering accuracy, robustness, and computational efficacy for various state-of-the-art algorithms benchmarked on multiple real datasets. On a dataset with 68,578 human blood cells, SSCC achieved 20% improvement for clustering accuracy and 50-fold acceleration, but only consumed 66% memory usage, compared to the widelyused software package SC3. Compared to k-means, the accuracy improvement of SSCC can reach 3-fold. An R implementation of SSCC is available at https://github.com/Japrin/sscClust.
PDF全文下载地址:
http://gpb.big.ac.cn/articles/download/701
删除或更新信息,请邮件至freekaoyan#163.com(#换成@)
SSCC: A Novel Computational Framework for Rapid and Accurate Clustering Large-scale Single Cell RNA-
本站小编 Free考研考试/2022-01-03
相关话题/gen
SeqSQC: A Bioconductor Package for Evaluating the Sample Quality of Next-generation Sequencing Data
Asnext-generationsequencing(NGS)technologyhasbecomewidelyusedtoidentifygeneticcausalvariantsforvariousdiseasesandtraits,anumberofpackagesforcheckingNG ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03How Microbes Shape Their Communities? A Microbial Community Model Based on Functional Genes
Exploringthemechanismsofmaintainingmicrobialcommunitystructureisimportanttounderstandbiofilmdevelopmentormicrobiotadysbiosis.Inthispaper,weproposeafun ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03GPA: A Microbial Genetic Polymorphisms Assignments Tool in Metagenomic Analysis by Bayesian Estimati
Identifyingantimicrobialresistant(AMR)bacteriainmetagenomicssamplesisessentialforpublichealthandfoodsafety.Next-generationsequencing(NGS)technologyhas ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Rice Genomics: over the Past Two Decades and into the Future
Domesticrice(OryzasativaL.)isoneofthemostimportantcerealcrops,feedingalargenumberofworldwidepopulations.Alongwithvarioushigh-throughputgenomesequencin ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Development of the “Third-Generation” Hybrid Rice in China
RiceisamajorcerealcropforChina.Thedevelopmentofthe“three-line”hybridricesystembasedoncytoplasmicmalesterilityinthe1970s(first-generation)andthe“two-li ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Discovery of Novel Androgen Receptor Ligands by Structure-based Virtual Screening and Bioassays
Androgenreceptor(AR)isaligand-activatedtranscriptionfactorthatplaysapivotalroleinthedevelopmentandprogressionofmanyseverediseasessuchasprostatecancer, ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Recent Advances in Function-based Metagenomic Screening
Metagenomesfromunculturedmicroorganismsarerichresourcesfornovelenzymegenes.Themethodsusedtoscreenthemetagenomiclibrariesfallintotwocategories,whichare ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03An Exome-seq Based Tool for Mapping and Selection of Candidate Genes in Maize Deletion Mutants
Despitethelargenumberofgenomicandtranscriptomicresourcesinmaize,thereisstillmuchtolearnaboutthefunctionofgenesindevelopmentalandbiochemicalprocesses.S ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03The Genome of Opium Poppy Reveals Evolutionary History of Morphinan Pathway
PDF全文下载地址:/articles/download/681 ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03Polyphyly in 16S rRNA-based LVTree Versus Monophyly in Whole-genome-based CVTree
Wereportanimportantbutlong-overlookedmanifestationoflow-resolutionpowerof16SrRNAsequenceanalysisatthespecieslevel,namely,in16SrRNA-basedphylogenetictr ...中科院北京基因组研究所 本站小编 Free考研考试 2022-01-03