Examining the practical limits of batch effect-correction algorithms: When should you care about bat

删除或更新信息，请邮件至freekaoyan#163.com(#换成@)

本站小编 Free考研考试/2022-01-01

Longjian Zhoua,
Andrew Chi-Hau Suea,
Wilson Wen Bin Gohb
aSchool of Pharmaceutical Science and Technology, Tianjin University, Tianjin, 30072, China
bSchool of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, 637551, Singapore

More InformationCorresponding author: E-mail address: goh.informatics@gmail.com (Wilson Wen Bin Goh)
Received Date: 2019-05-11
Accepted Date:2019-08-04
Rev Recd Date:2019-08-02
Available Online: 2019-09-20 Publish Date:2019-09-20

Abstract

Abstract

Batch effects are technical sources of variation and can confound analysis. While many performance ranking exercises have been conducted to establish the best batch effect-correction algorithm (BECA), we hold the viewpoint that the notion of best is context-dependent. Moreover, alternative questions beyond the simplistic notion of “best” are also interesting: are BECAs robust against various degrees of confounding and if so, what is the limit? Using two different methods for simulating class (phenotype) and batch effects and taking various representative datasets across both genomics (RNA-Seq) and proteomics platforms, we demonstrate that under situations where sample classes and batch factors are moderately confounded, most BECAs are remarkably robust and only weakly affected by upstream normalization procedures. This observation is consistently supported across the multitude of test datasets. BECAs do have limits: When sample classes and batch factors are strongly confounded, BECA performance declines, with variable performance in precision, recall and also batch correction. We also report that while conventional normalization methods have minimal impact on batch effect correction, they do not affect downstream statistical feature selection, and in strongly confounded scenarios, may even outperform BECAs. In other words, removing batch effects is no guarantee of optimal functional analysis. Overall, this study suggests that simplistic performance ranking exercises are quite trivial, and all BECAs are compromises in some context or another.
Keywords: Batch effects,
Bioinformatics,
Feature selection,
Normalization,
Statistics

PDF全文下载地址:

http://www.jgenetgenomics.org/article/exportPdf?id=ade2f73c-5f50-4bfb-93c4-910dad8786fe&language=en

相关话题/Examining practical limits

领限时大额优惠券,享本站正版考研考试资料!
优惠券领取后72小时内有效，10万种最新考研考试考证类电子打印资料任你选。涵盖全国500余所院校考研专业课、200多种职业资格考试、1100多种经典教材，产品类型包含电子书、题库、全套资料以及视频，无论您是考研复习、考证刷题，还是考前冲刺等，不同类型的产品可满足您学习上的不同需求。 ...
考试优惠券本站小编 Free壹佰分学习网 2022-09-19
Replication Protein A large Subunit (RPA1a) Limits Chiasma Formation During Rice Meiosis
Yongjie Miao, Wenqing Shi, Hongjun Wang, Zhihui Xue, Hanli You, Fanfan Zhang, Guijie Du, Ding Tang, Yafei Li, Yi Shen, Zhukuan Cheng Plant Ph ...
中科院遗传与发育生物学研究所本站小编 Free考研考试 2022-01-01
A Practical Guide to Amplicon and Metagenomic Analysis of Microbiome Data
Yong-Xin Liu, Yuan Qin, Tong Chen, Meiping Lu, Xubo Qian, Xiaoxuan Guo, Yang Bai Protein & Cell Abstract Advances ...
中科院遗传与发育生物学研究所本站小编 Free考研考试 2022-01-01
PandaX limits on the light dark matter with a light mediator in the singlet extension of MSSM
WenyuWang1,,Jia-JunWu1,,Zhao-HuaXiong1,,JunZhao2,3,,1.InstituteofTheoreticalPhysics,CollegeofAppliedScience,BeijingUniversityofTechnology,Beijing10012 ...
中科院高能物理研究所本站小编 Free考研考试 2022-01-01
王健教授：Homogenization of jump processes: limits and convergence rates
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
Prof. Lei Wu：Hydrodynamic Limits in Kinetic Equations IV: More on Boundary Layer in General Domains
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
Prof. Lei Wu:Hydrodynamic Limits in Kinetic Equations III: Boundary Layer in Disk Domains
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
Prof. Lei Wu:Hydrodynamic Limits in Kinetic Theory II: Boundary Layer in Flat Domains
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
Prof. Lei Wu:Hydrodynamic Limits in Kinetic Equations Part I: Asymptotic Analysis
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
Decode-seq: a Practical Approach to Improve Differential Gene Expression Analysis
Yingshu Li, Hang Yang, Hujun Zhang, Yongjie Liu, Hanqiao Shang, Herong Zhao, Ting Zhang and Qiang Tu Genome Biology Abstra ...
中科院遗传与发育生物学研究所本站小编 Free考研 2020-05-26
Towards scalable and practical Oblivious RAM
时间：2019年6月10日（周一）上午10:00　　地点：计算所446会议室　　报告人：Ass. Prof. Rujia Wang, Illinois Institute of Technology　　摘要：Oblivious RAM (ORAM) is a security primitive ...
中科院计算技术研究所本站小编 Free考研 2020-05-26