删除或更新信息,请邮件至freekaoyan#163.com(#换成@)

49AISNP:东亚北方三个族群遗传来源推断

本站小编 Free考研考试/2022-01-01

郭晓媛,1,2, 孙昌春1,2, 薛思瑶1,2, 赵慧2, 江丽,2, 李彩霞,1,2 1. 山西医科大学,太原 030001
2. 公安部物证鉴定中心,法医遗传学公安部重点实验室,现场物证溯源技术国家工程实验室,北京 100038

49AISNP: a study on the ancestry inference of the three ethnic groups in the north of East Asia

Xiaoyuan Guo,1,2, Changchun Sun1,2, Siyao Xue1,2, Hui Zhao2, Li Jiang,2, Caixia Li,1,2 1. Shanxi Medical University, Taiyuan 030001, China
2. Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China

通讯作者: 江丽,博士,副主任法医师,研究方向:法医遗传学。E-mail:jl@mail.bnu.edu.cn;李彩霞,博士,主任法医师,研究方向:法医遗传学。E-mail:licaixia@tsinghua.org.cn

编委: 朱波峰
收稿日期:2021-02-25修回日期:2021-05-31
基金资助:国家自然科学基金项目资助编号.81772027


Received:2021-02-25Revised:2021-05-31
Fund supported: Supported by the National Natural Science Foundation of China No.81772027

作者简介 About authors
郭晓媛,在读硕士研究生,专业方向:法医学。E-mail:827633812@qq.com








摘要
样本的族群来源推断在法医调查中可发挥重要作用,一个理想的推断体系是用一组较少的遗传标记实现较高的族群推断准确性。本研究调研搜集了区分东亚北方三个族群北方汉族、日本人和韩国人的428个祖先信息SNP (ancestry informative SNP, AISNP),获取了其在三个族群307份样本中的分型,通过位点Fst值及等位基因频率聚类等信息进一步精简位点,最终得到了一组49AISNP组合。基于307份样本利用留一法对49AISNP进行推断准确性验证,结果表明其在北方汉族、日本和韩国族群中的推断准确性均高于99%。49AISNP组合将有助于东亚地区亚族群的进一步区分。
关键词: 法医遗传学;祖先信息SNP;族群推断;Fst;东亚北方族群

Abstract
The ancestry inference of unknown samples plays an important role in forensic investigations. An ideal panel is a set of few markers with high ancestry inference accuracy. We collected 428 AISNP (ancestry informative SNP) that can distinguish the three ethnic groups in north of East Asia, including northern Han, Japanese and Korean. The genotypes of 428 AISNP in 307 samples from these three ethnic groups were obtained. Based on the information of Fst value and clustering by allele frequency, the panel was further refined into 49AISNP smart panel. Inference accuracy of the 49AISNP was verified by the leave-one-out method with 307 samples, and the results showed that its accuracy was higher than 99% in the northern Han, Japanese and Korean ethnic groups. This panel can also be helpful to further distinguish the ethnic sub-groups in East Asia.
Keywords:forensic genetics;AISNP;ancestry inference;Fst;ethnic groups in north of East Asia


PDF (1129KB)元数据多维度评价相关文章导出EndNote|Ris|Bibtex收藏本文
本文引用格式
郭晓媛, 孙昌春, 薛思瑶, 赵慧, 江丽, 李彩霞. 49AISNP:东亚北方三个族群遗传来源推断. 遗传[J], 2021, 43(9): 880-889 doi:10.16288/j.yczz.21-073
Xiaoyuan Guo. 49AISNP: a study on the ancestry inference of the three ethnic groups in the north of East Asia. Hereditas(Beijing)[J], 2021, 43(9): 880-889 doi:10.16288/j.yczz.21-073


单核苷酸多态性(single nucleotide polymorphisms, SNPs)是指基因组中单个碱基的变异形成的多态性,多数SNP表现为基因座上二等位基因的变化[1]。SNP在人类基因组中分布广泛,突变率低,具有较高的遗传稳定性。在不同族群之间频率分布差异大的SNP标记可用于预测族群来源,被称为祖先信息SNP (ancestry informative SNP, AISNP)[2,3,4]。利用一组AISNP即可判断个体的族源信息。目前已报道了大量洲际族群区分的AISNP体系[5,6,7,8],在洲际内部进行进一步细分的二级族群推断体系也有报道[9,10,11],例如可区分东亚、南亚、北亚的74-plex SNP等[9]。然而对东亚内部的区分推断仍有待进一步深入研究。

东亚人口众多[12,13],有研究表明东亚地区存在从南到北的遗传差异[14,15],且已发表不同的AISNP体系对其内部不同族群进行区分[11,16,17],这为研究东亚内部族群进化和遗传多样性奠定了基础。在东亚北方地区,北方汉族、日本人和韩国人有着相似的外貌特征,如黄皮肤、黑眼睛、鼻子短而扁平等;语言也有相似之处,如韩语和日语在结构、元音以及缺乏连词方面有着显著的共同特点[18]、汉字早在公元5世纪传入日本[19]。基于线粒体DNA (mitochondrial DNA, mtDNA)[20,21]、Y染色体[18]和短串联重复序列[22]的研究也表明三个族群间存在密切的遗传关系。最近有研究发现基于全基因组水平可区分汉族、日本和韩国族群[17,23],也有研究建立了包含142、98、59及36个AISNP的四组嵌套体系,可实现汉族、日本和韩国族群的区分,但是随着位点数目的减少,区分准确率会下降至90%[16]。对法医学应用而言,需筛选一组精简高效的AISNP,用于目标族群的推断区分,同时又可满足现场微量样本的检验需求。本研究从已发表文献中收集了东亚北方三个族群的428个AISNP及307份样本中的分型数据集,筛选了一组适用于法医学应用的AISNP,使其在这些族群中具有良好的区分度,以进一步提高族群推断技术在东亚北方的分辨率。

1 材料与方法

1.1 样本

参考数据库、测试样本群体信息见表1。参考数据库307份样本包括:来自千人基因组计划的103份北方汉族样本和104份日本样本[24],以及亚洲多样性计划 (分型基于affymetrix genome-wide human SNP array 6.0芯片)[17]的100份韩国样本。测试样本包括来自2个族群的38份样本:来自人类基因组多样性计划的10份北方汉族样本以及来自人类基因组多样性计划和西蒙斯基因组多样性计划的28份日本族群样本[25]

1.2 SNP位点来源

通过文献调研,从以下几个方面收集了428个AISNP位点:(1)中国科学院上海生命科学研究院徐书华课题组的203个AISNP[17],用于北方汉族、日本和韩国族群的两两区分;(2)中国科学院北京基因组研究所陈华课题组的142个AISNP[16],用于区分中国、日本和韩国族群的体系;(3)本课题组前期筛选的88个AISNP[26,27,28,29],针对北方汉族与日本族群以及东亚南北方族群区分的位点。具体信息见附表1

Table 1
表1
表1样本信息
Table 1Sample information
族群简称族群信息样本量样本来源
CHB中国北京的汉族人103千人基因组计划
JPT东京的日本人104千人基因组计划
KOR朝鲜人100亚洲多样性计划
CHB-C中国北方汉族人10人类基因组多样性计划
JPT-C日本人28人类基因组多样性计划、西蒙斯基因组多样性计划
测试样本以斜体表示,其余为参考数据库样本信息。

新窗口打开|下载CSV

1.3 质量控制

去除分型缺失率超过10%的样本和AISNP位点。使用Haploview v4.1软件计算总族群中AISNP的连锁不平衡值[30],去除连锁不平衡(r2>0.2)的位点[31,32]

1.4 位点筛选及评估

在法医学应用中,具有相似信息量同时包含较少位点的组合最为实用。为了实现这个目标,需要剔除信息量较少或冗余的SNP以获得更为精简的体系。Fst值是群体遗传学中衡量群体间分化程度的一个重要指标,其范围是0~1,值越大表明群体间的遗传分化程度越大[33]。在族群推断应用方面,可以衡量一组AISNP位点的信息含量。使用Genepop v4.7.5软件计算每个位点在三个族群中总的Fst值以及等位基因频率[34],基于Fst值,以0.01为单位取不同范围的AISNP组合进行分析,确保优化后的AISNP组合提供与全部AISNP相似的族群推断信息量。

使用R v3.3.2软件基于等位基因频率绘制热图并进行主成分分析(principal components analysis, PCA),使用R v4.0.2软件基于每个AISNP位点对PCA中样本聚类结果的贡献度值(Cos2值)进行可视化展示。基于热图聚类结果删除Fst值较小的位点来减少冗余信息[9]。使用Snipper网站(http://mathgene.usc.es/snipper/)的交叉验证选项计算人群特异分化值(population-specific divergence, PSD)[35],通过比较其数值大小可以衡量这组位点在不同人群之间的区分平衡度[5]

1.5 推断方法准确性验证

使用DNA族群推断系统软件(DNA ancestry analyzer version1.0, DAA v1.0)[36]计算三个族群307份样本的群体匹配概率(population assignment match probability, AMP)和似然比(likelihood ratio, LR),并进行族群聚类分析,参数设置为K=3,运行20次,统计每个样本的AMPLR值。通过样本数据库的留一法验证,对筛选的AISNP进行准确性评估,即依次从数据库取出一份样本作为未知族群来源的样本,其余作为参考数据库进行分析,以此类推,直到所有样本均被测试一次为止。基于留一法验证的族群推断结果有3种情况:(1)一致性结论,AMP排名第一位的人族群与样本信息来源一致且LR大于10;(2)不排除结论,LR小于或等于10表明不排除AMP排名前两位的族群;(3)错误结论,AMP排名第一位的族群与样本信息来源不一致且LR大于10。根据样本推断族群来源结果与其实际所属族群的一致与否,统计推断结果的准确性。

测试样本准确性验证:以307份样本为参考数据库,38份测试集样本为未知族群来源的样本,计算每个测试集的AMPLR值,并对推断结果的准确性进行统计。

1.6 位点的对比评估

为了验证本筛选方法的有效性,使用R v4.0.2软件中的sample函数进行随机抽样,从质控后的位点组合中随机抽取两组与最终组合数目相同的位点组合,并对其进行留一法准确性验证,与本研究所筛选的位点组合的准确性进行对比。

2 结果与分析

2.1 位点筛选及评估

307份样本的分型缺失率均小于10%,在428AISNP中25个位点的分型缺失率大于10%,去除后得到403个AISNP。基于三个族群307份样本在403个AISNP上的分型,在三个族群中总Fst值见附表1。连锁不平衡分析结果如附图1所示,共有70个SNP位点分别位于24个单体型块中,每个单体型块代表着位点之间存在高度连锁。附表2列出了相互连锁位点间的r2值,统计各个单体型块中位点的Fst值,仅留下每个单体型块中Fst值最大的AISNP位点,共得到357个AISNP。

Table 2
表2
表249AISNP位点信息
Table 2Information of the 49AISNP
序号rs号染色体第37版序列位置等位基因Fst序号rs号染色体第37版序列位置等位基因Fst
1rs10088365810097398G/A0.270926rs26226378106505971G/A/T0.0951
2rs104897441165380623G/A0.069027rs26420661765639014G/T0.0910
3rs105210769108878562C/T0.077528rs28384082145246422A/G0.0750
4rs1089472411133547942T/A/G0.081229rs29763968143764001G/A0.1521
5rs10973829938446392C/T0.078130rs3111745321827159G/T0.0751
6rs110347091138428289A/G0.108331rs3217805124388084C/A/G/T0.0844
7rs118415891373814891G/T0.086232rs3747222147839830G/A0.0740
8rs118467101458343352G/A0.076833rs39237367155060730A/C/G/T0.0714
9rs119590125167668843C/T0.069434rs4353835332446775C/T0.0849
10rs12006467935090720T/C0.099335rs4372441117657259C/T0.0787
11rs120397151242801261G/C0.172636rs4718412766286867T/C0.0751
12rs124837692247790072T/C0.087037rs50220791876478188A/G0.0931
13rs12565191465736324G/A0.102738rs61237232037082145C/T0.0891
14rs13229441353683163A/G0.074039rs64369712231582797T/C0.0815
15rs13330991373691236G/A0.071540rs657612714106194763C/T0.1076
16rs16785371257900341G/A0.088541rs76558494161867415T/A0.0759
17rs1712180010108710873C/T0.100742rs9288442137999799C/T0.0846
18rs172076815138168527A/G0.082343rs9291151171591189T/G0.0757
19rs174517395144030993C/T0.073244rs93211806130102959C/T0.0866
20rs1745201488006776T/G0.089945rs95492121341022093C/T0.0708
21rs17599827589518433A/C0.081046rs98964431747125982T/C0.0738
22rs17631488518777746A/G0.103047rs2770310670165296C/A/T0.0924
23rs19813701874070815A/G0.072248rs3027238178135061T/C0.9803
24rs2269275583261405A/G0.068849rs457839711131832284G/T0.0737
25rs2295622237599065G/T0.0827

新窗口打开|下载CSV

基于357个AISNP在三个族群中等位基因频率绘制的热图见附图2。在热图中,颜色的深浅代表了AISNP位点的基因频率在不同群体中的相似性和差异性,颜色越红表示基因频率越高,越浅表示基因频率越低,且具有相似频率分布的AISNP通常处于聚类树的同一分枝中。去除信息含量较少及冗余的位点以获得具有相似信息量的较小的体系,例如图中红色标记位点处,rs11104947与rs229562处于同一聚类小枝中且颜色相似,而rs11104947的Fst值(0.0782)低于rs229562的Fst值(0.0827),所以可删除rs11104947位点。经过多轮测试,最终获得了一组49个AISNP(以下简称49AISNP),表2列出了49AISNP的详细信息,三个族群的总Fst值介于0.0688~0.9803,平均值为0.1083。图1显示基于49AISNP在三个族群中的等位基因频率热图。

图1

新窗口打开|下载原图ZIP|生成PPT
图1基于49AISNP的三个族群等位基因频率热图

Fig. 1Heatmap of 49AISNP based on the allele frequencies in three ethnic groups



表3显示了49个位点在各族群中的累计PSD值。可以看到三个族群之间的累计PSD值差异在1.4左右,说明该49AISNP在这三个族群之间的区分平衡度较好。

Table 3
表3
表349AISNP在每个族群中的累计PSD
Table 3The cumulative PSD value of 49AISNP in each ethnic group
49AISNP北方汉族日本韩国
累计PSD3.245663.969982.53635

新窗口打开|下载CSV

2.2 基于49AISNP的群体遗传结构

基于49AISNP的三个族群STRUCTURE聚类分析在K=3时的结果见图2,可观察到北方汉族、日本和韩国三个族群间明显的遗传差异。群体水平的祖先成分见饼状图,各颜色的比例代表该族群的祖先成分比例。如图所示,黄色、绿色和蓝色分别代表韩国、北方汉族和日本的祖先成分,其中,北方汉族族群中绿色成分占0.95;日本族群中蓝色成分占0.95;韩国族群中黄色成分占0.96。个体水平祖先成分见柱状图,其中每个柱形表示为一个样本,每种颜色代表一个祖先成分,每个条中各颜色的长度对应各样本的祖先成分比例。各颜色代表的祖先成分与饼状图相同。其中似然值最高的聚类结果显示:北方汉族族群样本的北方汉族成分中位数为0.98 (95%置信区间为0.94~0.97);日本族群样本的日本成分中位数为0.98 (95%置信区间为0.93~0.97);韩国族群样本的韩国成分中位数为0.98 (95%置信区间为0.95~0.98)。

主成分分析(PCA)可反映个体水平上的族群结构,不同的颜色代表不同的族群,样本的聚类情况可反映族群间的遗传关系,同时根据不同的主成分可进行族群的区分。图3A展示了基于49AISNP的主成分分析图,前2个主成分占变异的23.36%。第一主成分(PC1)占变异的12.96%,可将北方汉族与日本和韩国族群区分开;第二主成分(PC2)占变异的10.40%,可将日本族群与其他两族群区分开。图3B展示了基于每个AISNP位点对PCA中样本聚类结果的Cos2值分析,图中颜色由绿色到红色以及箭头的长短均代表Cos2值的大小,图中只展示Cos2值排名前18个AISNP的名称,其余位点的Cos2值参照箭头长短及颜色变化。结合Fst值来看,每个位点对样本聚类的贡献值大小排序也对应该位点的Fst值大小排序。

图2

新窗口打开|下载原图ZIP|生成PPT
图249AISNP在K=3时的三个族群STRUCTURE分析结果(Ln P(D)=-12,331.7)

Fig. 2The STRUCTURE analysis for three ethnic groups at K = 3 of 49AISNP(Ln P(D)= -12,331.7)



图3

新窗口打开|下载原图ZIP|生成PPT
图3主成分分析及Cos2值分析

A:基于三个族群在49AISNP的等位基因频率进行主成分分析;B:基于每个AISNP位点对PCA中样本聚类结果的Cos2值分析。
Fig. 3Principal component analysis and Cos2 value analysis



2.3 留一法准确性验证

基于49AISNP的留一法准确性验证结果见表4,北方汉族、韩国样本族群推断结果一致性为100%;日本样本族群推断结果一致性为99.04%,错误率为0.96%,1例错判样本信息见表5

Table 4
表4
表4基于49AISNP的参考数据库族群推断结果统计
Table 4Statistic of sample assignment in reference ethnic groups using 49AISNP
族群北方汉族日本韩国总计
北方汉族103(0)103
日本1(0)103(0)104
韩国100(0)100
每格数字代表归到该组的样本数目,括号中的数字代表其中LR值小于10的样本数目。

新窗口打开|下载CSV

Table 5
表5
表5基于参考数据库的错判样本信息
Table 5Information of misjudged sample in reference ethnic groups
样本来源样本编号族群群体匹配概率似然比祖先成分
日本NA18976北方汉族1.96E-150.75
日本8.10E-172.43E+010.16
韩国1.83E-191.07E+040.09

新窗口打开|下载CSV

2.4 测试样本验证

基于49AISNP的测试集准确性验证结果见表6,北方汉族族群推断结果一致性为80%,不排除结论为20%,错误率为0;日本样本族群推断结果一致性占82.15%,不排除结论占10.71%,错误率占7.14%。

Table 6
表6
表6基于49AISNP的测试集族群推断结果统计
Table 6Statistic of sample assignment in test ethnic groups using 49AISNP
族群北方汉族日本韩国总计
北方汉族9(1)1(1)10
日本2(2)24(1)2(0)28
每格数字代表归到该组的样本数目,括号中的数字代表其中LR值小于10的样本数目。

新窗口打开|下载CSV

2.5 49AISNP对比评估

图3显示了基于三个族群307份样本在357AISNP下的STRUCTURE聚类结果,在K=3时,可观察到三个族群明显的遗传差异,基于群体水平的祖先成分图显示北方汉族族群中绿色成分占0.97;日本族群中蓝色成分占0.96;韩国族群中黄色成分占0.98。基于个体水平祖先成分柱状图显示北方汉族族群样本的北方汉族成分中位数为0.99(95%置信区间为0.94~0.97);日本族群样本的日本成分中位数为0.99 (95%置信区间为0.93~0.97);韩国族群样本的韩国成分中位数为0.99 (95%置信区间为0.95~ 0.98)。同时利用主成分分析来评估357AISNP在个体水平上的族群结构,结果如附图4所示,前2个主成分占变异的9.03%。第一主成分(PC1)占变异的5.42%,可区分北方汉族和其他两族群;第二主成分(PC2)占变异的3.61%,可将韩国族群与其他两族群区分开。对比49AISNP与357AISNP的结果,两组AISNP均表现出良好的族群聚类能力,具有相似的族群推断准确性,且49AISNP的PC1和PC2总比例相较于357AISNP有所上升,证明通过一系列的筛选,我们成功地获得了一组具有相似信息量且位点数较少的组合。

图4

新窗口打开|下载原图ZIP|生成PPT
图449AISNP与两组不同的、随机49AISNP在三个族群中的族群推断一致性和错误率比较

A:49AISNP与两组不同的、随机49AISNP在三个族群中的族群推断一致性比较;B:49AISNP与两组不同的、随机49AISNP在三个族群中的族群推断错误率比较。
Fig. 4Comparison of ancestry inference accuracy and error rate for the 49 AISNP with two different, random samplings of 49AISNP in the three ethnic groups



从357AISNP中用随机抽样的方法抽取的两组49AISNP,分别编为49AISNP-Random1和49AISNP- Random2。这两组的族群推断一致性和错误率与49AISNP的结果对比分别见图4A图4B。结果显示两组随机49AISNP在三大族群中的族群推断一致性均比通过Fst值以及热图筛选出的49AISNP的低,且推断错误率较高,进一步说明我们的位点筛选方法是有效的。

3 讨论

汉族人口约占中国总人口的91.6%[37],是世界上最大的遗传群体,有研究表明汉族与韩国人或日本人间的遗传关系比与中国的少数民族间的关系更为密切[22]。现代日本人有双重血统,源于绳纹文化的狩猎采集者与弥生文化的移民之间的混合,他们从中国南部的长江口通过韩国带来了稻谷农业[38]。这种历史迁徙造成了如今汉族、日本与韩国之间的遗传差异,而近年来随着中、日、韩三国之间社会经济的频繁交流,大量人口离开其出生地,旅居他国,融入当地社会。使用遗传标记进行样本的族群来源推断在法医实践应用中起着关键作用。随着测序技术的发展,高密度遗传标记越来越多地被应用在族群推断上,因其能够获到高分辨率的区分度[39],然而,在实际应用中,高密度遗传标记并不适用于微量或降解检材的分型检测,位点数目少但信息量高的族群推断位点组合,对于法医学应用是最实用的。因此,我们需要在高密度遗传标记中剔除冗余的标记,本研究利用较高的Fst值筛选出了一组具有较高信息量且位点数较少的位点组合,为后续的体系构建及进一步验证奠定基础。

在体系的测试评估方面,一般用以下几种方法评估族群推断的准确性:一是用随机抽样的方式,将现有数据库以一定比例划分为训练集和测试集,用训练集内样本数据进行位点筛选,用测试集内样本对筛选结果进行准确性验证[40];二是在应用现有数据库进行位点筛选后,新采集一部分样本,用新的测试集进行验证[41];三是用交叉验证的方法评估数据库的准确性[40],当样本量较大时,用十折交叉验证的方法,当样本量较少时,选择留一法评估数据库的准确性。但是将所有数据均用于训练集和测试集,容易造成筛选体系的过拟合,即最终模型在训练集上效果好;在测试集上效果差。为了避免过拟合的出现,一般可以重采样来评价模型效能,或者采用多次交叉验证的方法,求其均值估计算法准确性。由于本文中样本量较少,故未采用随机抽样的方法划分训练集和测试集,而是采用了留一法来逐一抽取测试样本,可以最大程度避免由于训练集样本量过少造成的偏差,同时另收集一部分样本做为测试集,以验证筛选体系的准确性。

在分析样本的族群来源时,应综合分析AMPLR和族群成分[7,32,41],AMP值是在三个族群中AISNP组合的等位基因频率计算所得,AMP值最高的族群并不一定是真正的来源族群,当两个族群间AMP值的比值即似然比LR大于10时,可以认为AMP值第一位的族群是该样本来源族群,当似然比小于10时,要结合族群成分进行综合分析[41]。在49AISNP中,参考数据库族群来源推断错误的样本来自于日本族群中的NA18976,该样本的北方汉族和日本的似然比LR值计算结果为2.43E+01,其北方汉族、日本和韩国的族群成分分别为:0.75、0.16和0.09,该样本被推断为北方汉族。分析可能的原因如下:一是有可能推断错误,未来还需增加针对日本族群的特异性位点以达到更高的族群推断准确性;二是样本采样标签错误,或者由于历史或社会等原因,样本本身可能也不知道自己的族群来源[9]

从测试集的验证结果来看,利用49AISNP基于307份参考数据库上的测试集族群推断一致性结论占81.58%,不排除结论占13.16%,错误率仅5.26%。证明我们筛选的49AISNP可以有效地区分东亚的这三个族群。综上所述,本研究获得了一组49AISNP可以进行东亚北方三个族群的区分,且具有高鉴别能力和区分平衡度。该组合将有助于东亚地区二级族群推断体系的发展,进一步提升东亚族群的内部区分力。但是本研究中涉及的样本量较少,未来可增加样本量,进一步验证该组合在北方汉族、日本和韩国族群中的推断能力。

附录:

附加材料详见文章电子版 www.chinagene.cn

Supplementary Table 1
附表1
附表1428AISNP位点信息
Supplementary Table 1Information of the 428AISNP
序号rs号染色体第37版序列位置等位基因Fst序号rs号染色体第37版序列位置等位基因Fst
1rs100184324189437086C/A0.046040rs10957985881789921G/A0.0645
2rs100587395144074365G/A0.029741rs1097200693430684T/C0.0266
3rs10088365810097398G/A0.270942rs10973829938446392C/T0.0781
4rs10095442242239348G/C0.043043rs10991718993716118T/C0.0048
5rs10107492880568935T/A/C0.050344rs110127161021762267A/G0.0302
6rs101101948119479124G/C/T0.058545rs110234631115209506A/C0.0052
7rs101323361474869017A/G0.032746rs110323311133678161G/T0.0519
8rs1013248314102299420T/C0.051947rs110347091138428289A/G0.1083
9rs101349031495229777G/A0.036048rs110429111110678785T/C0.0432
10rs101395751457087118A/C/G0.034249rs1105896112127544998G/A0.0438
11rs10166397244372909G/T0.049050rs110860121916032643A/C0.0537
12rs101718912195971878C/T0.043151rs111049471288942980G/A0.0782
13rs101772732122239287C/G0.078252rs11112122159312733C/G/T-0.0007
14rs10269754113971374T/G0.022953rs11128125369389614T/A/C/G0.0035
15rs10277413755238464T/A/G0.029454rs111331444177229719A/C/T0.0446
16rs1043107911116440678A/G0.072855rs111515271867347133A/G0.0506
17rs104596641535064934C/T0.050556rs111598821489525994G/T0.0479
18rs104839911488682616G/A0.060757rs11161614185898160T/G0.0299
19rs1048626078859616C/T0.021058rs111643541102439329G/A0.0436
20rs104897441165380623G/A0.069059rs111742611262362843C/T0.0473
21rs105062941251070554T/C0.001960rs111776981269906287A/G0.0534
22rs105067251277449514T/C0.044461rs1122285111131831335G/A0.0683
23rs105210769108878562C/T0.077562rs1122355011133532372A/G0.0527
24rs10756234911316481C/T-0.000463rs1122476511101310590C/T0.0981
25rs107680171133681025G/T0.057164rs11257349106260717T/G0.0065
26rs107719231232320912C/T0.033265rs11258321652421140T/A0.0337
27rs1077812512101962634C/A0.051366rs11466640438778903G/A/T0.0090
28rs10806975623766367A/G/T0.037267rs116110331263771136C/T0.0488
29rs108190669101356091C/G0.026268rs116211211465822493C/T0.0990
30rs108330711119209050T/C0.032869rs116254851480242132T/C0.0538
31rs108360921133676946G/A/C0.051570rs116258761435248467C/T0.0559
32rs108360931133678526A/G0.053071rs11680024249535856A/C-0.0040
33rs1084761612128910954A/C/T0.030872rs1178148718778113A/C0.0795
34rs1084918112638166C/T0.052573rs118415891373814891G/T0.0862
35rs108588831289893348T/C-0.000474rs118467101458343352G/A0.0768
36rs1088373610104320029G/T0.036575rs118472631465775695T/A/G0.0635
37rs10890510148334490A/C0.049576rs11874960181948608A/C/G/T0.0049
38rs1089403411129255618C/T0.052177rs119590125167668843C/T0.0694
39rs1089472411133547942T/A/G0.081278rs12006467935090720T/C0.0993
79rs120397151242801261G/C0.1726119rs13910994143373910A/G/T0.0025
80rs12045644140084488G/T0.0393120rs14202881654477881T/C0.0519
81rs120955031171653329C/T0.0346121rs14205281652417241C/G0.0374
82rs121340131164459913C/T0.0586122rs14206383181497374G/A0.0407
83rs12141436168463212G/A0.0418123rs14229315167397540C/T0.0462
84rs121517672198274929G/A0.0548124rs14281505132490807A/G0.0666
85rs1223161712124930127C/G0.0458125rs14425021166548317G/A0.0277
86rs12351269916806521T/C0.0558126rs14530542181359907T/C/G0.0119
87rs124599411910666112G/A0.0631127rs1465306766699784T/C0.0672
88rs124837692247790072T/C0.0870128rs14657592142576915T/C0.0365
89rs124886903150490463G/A0.0502129rs148848537951364T/C0.0125
90rs12522710565051804G/A0.0125130rs1522340370369504G/A0.0320
91rs12565191465736324G/A0.1027131rs15249302140968460A/G0.0649
92rs12565201465737193C/A0.0958132rs153752393649097A/C0.0512
93rs125679901207681685C/T-0.0038133rs15383741072718336G/A0.0663
94rs1257194210127954919G/A0.0056134rs1551653834324240A/G0.0084
95rs125869121434023334G/T0.0422135rs15587231454063C/T0.0227
96rs125880611450481926T/C0.0703136rs15612011838205375G/A0.0330
97rs125898351430848932C/T0.0615137rs1566857446048880A/T0.0468
98rs125988521613329469G/A0.0500138rs1605391663353953C/G0.0641
99rs12630087380935171T/C0.0429139rs16097635142018424T/G0.0232
100rs12632233394033744G/T0.0334140rs1613215377738317T/C0.0509
101rs126766848118857704G/C/T0.0584141rs16525191558612113G/A/C0.0219
102rs126915572141469911G/A/T0.0461142rs16785371257900341G/A0.0885
103rs127681451017610277G/A0.0532143rs168347052193385209A/G0.0168
104rs127764421059297413A/G0.0075144rs168509132166421763A/G0.0450
105rs12846051257921188C/T0.0863145rs168626273150114701G/A-0.0022
106rs13076655380920612C/T0.0412146rs16893074524074021T/A0.0042
107rs13094402380710455A/G0.0467147rs16895073565596821T/C0.0302
108rs131603995165711426G/A0.0550148rs169441491710938783T/G0.0223
109rs13175481012245520G/A0.0575149rs169514721397073977C/T0.0503
110rs1318642247271217T/G0.0450150rs169868502041048928T/C0.0307
111rs13229441353683163A/G0.0740151rs169911802010148468G/A0.0655
112rs1324958489710943A/G0.0786152rs169977704124693346T/C0.0611
113rs13251921199244248G/C/T0.0625153rs170027372242282012C/T0.0454
114rs13330991373691236G/A0.0715154rs170161754143391299A/G0.0018
115rs1341969527347211A/T0.0808155rs17029405332451697C/A0.0508
116rs13625531652438954C/G0.0329156rs17034099311220006C/A0.0412
117rs13715662119738187T/G0.0337157rs170729841862051788G/A0.0312
118rs1378365461775645G/A0.0587158rs170763285173143508G/A0.0164
159rs1712180010108710873C/T0.1007199rs2175743170295901C/T0.0413
160rs17129041166996037T/C0.0529200rs2194757221706029G/A0.0108
161rs172076815138168527A/G0.0823201rs21958561187922876A/C0.0548
162rs172392584183263055A/G0.0500202rs22539742123231832C/T0.0187
163rs173726954155060317A/G-0.0030203rs22559572242241372G/A/T0.0440
164rs173776432242152988A/C0.0565204rs2269275583261405A/G0.0688
165rs174517395144030993C/T0.0732205rs22696582242280618T/C0.0375
166rs1745201488006776T/G0.0899206rs22783391850866195T/G0.0298
167rs1745341161549458A/G-0.0023207rs2295622237599065G/T0.0827
168rs1745831161609750C/T-0.0046208rs22957561135241229T/C0.0000
169rs17469810380807568G/A0.0390209rs2313427168815000G/A0.0470
170rs17582830438867427A/G0.0004210rs23300152222536956C/T0.0678
171rs17583068438908616G/C/T0.0558211rs23336561758184540T/C0.0576
172rs17599827589518433A/C0.0810212rs234909312128520953T/A0.0004
173rs17631488518777746A/G0.1030213rs23521811087653766G/A0.0257
174rs17717717371404405G/A0.0320214rs23536861686354605T/C0.0566
175rs177666211471262932T/C0.0349215rs2377962189101722G/T0.0522
176rs178237951463384260G/A0.0411216rs24301841187786661G/T0.0482
177rs17940721161303803C/T0.0446217rs25263711756443545G/A0.0501
178rs17957041258739693C/A0.0502218rs25473531958582797G/C/T0.0666
179rs183529811112057620G/A0.0428219rs2556305127677188T/C0.0133
180rs18413051850163950A/T0.0556220rs258728781663275C/A0.0337
181rs186169312107603157C/T0.0471221rs25876942120213497A/G0.0026
182rs18643071857332158T/G0.0440222rs2589787538260303G/A0.0374
183rs1875174102971488C/G/T0.0806223rs26226378106505971G/A/T0.0951
184rs187769689712822C/G/T0.0956224rs26420661765639014G/T0.0910
185rs1890622137349278G/A0.0429225rs2645645477859067A/G/T0.0469
186rs18982271663282080A/G/T0.0546226rs26461511202649C/T0.0211
187rs19017863130021945G/A0.0411227rs2706151362793T/C0.0761
188rs19449751857276748T/C0.0369228rs2770310670165296C/A/T0.0924
189rs19813701874070815A/G0.0722229rs2793438624461427G/C0.0382
190rs1984641161521621G/A-0.0050230rs28367492140287238A/C/T0.0548
191rs2008819739662268C/T0.0459231rs28373522141363778C/T0.0312
192rs2011071588741077A/G0.0583232rs28384082145246422A/G0.0750
193rs20350234137753683T/C0.0549233rs285366851300025G/T0.0580
194rs20472974121511851G/C/T0.0271234rs29011422124764392C/T0.0727
195rs20687461711389480T/C0.0464235rs29457338134615750T/G0.0602
196rs2099563380740366G/A0.0447236rs2972336576504150G/C-0.0028
197rs2160913188716090A/C/G0.0216237rs29763968143764001G/A0.1521
198rs2174739679659170A/G0.0042238rs3027238178135061T/C0.9803
239rs31045171527940339A/G0.0288279rs46905084178295766G/T0.0504
240rs3111745321827159G/T0.0751280rs4718412766286867T/C0.0751
241rs3158085169750158G/T0.0626281rs4752351810029988G/A/T0.0289
242rs3217805124388084C/A/G/T0.0844282rs47563051136292065T/A/C0.0532
243rs321967778314549G/A/C0.0454283rs4790409171243941C/T0.0600
244rs34052939380765431C/G/T0.0447284rs4806316138406328C/A/T0.0620
245rs342542864164411734A/G0285rs48204282241537589A/G0.0762
246rs3747222147839830G/A0.0740286rs48342264128986874C/T-0.0008
247rs3757419766029429C/T0.0718287rs48395231116800016T/C0.0284
248rs37574257150067640A/C0.0706288rs48469921230137518A/G0.0426
249rs3758229995078852C/A/G0.0574289rs4865142457549583C/T0.0372
250rs3775539499304600A/G0.0452290rs4865290453278689G/A0.0307
251rs39101422012921966T/C0.0613291rs48839261373843655T/C0.0583
252rs39230842227948726A/G/T0.0586292rs49023911465815979G/T0.0929
253rs39237367155060730A/C/G/T0.0714293rs49037541440865770G/A0.0492
254rs39243089130185385C/A0.0129294rs49129335143038150G/A/C/T-0.0034
255rs4133446586155113T/C0.0645295rs49180001094837743C/T0.0399
256rs4144800438006916G/A0.0228296rs4922234816639271G/A/C/T0.0433
257rs42546433792207G/A0.0307297rs493828511116440011T/C0.0697
258rs42630261873309989G/C0.0466298rs50220791876478188A/G0.0931
259rs42802781629439227C/G0.0841299rs50308831070480868A/C/T0.0759
260rs43256221728526475T/C0.3617300rs5206051184795779C/T0.0581
261rs43287002023709545A/G0.0344301rs5364301079001643C/T0.0107
262rs4353835332446775C/T0.0849302rs57700182249595560C/T0.0411
263rs4355871929653331G/A/C0.0665303rs59960642242172080C/T0.0476
264rs4372441117657259C/T0.0787304rs60004012237149336C/T0.0426
265rs43936691867296561T/C0.0330305rs60077562248282309G/A/C0.0571
266rs44140691200391561T/C0.0478306rs60162262038661387C/T0.0181
267rs4466495917496114C/A/T0.0489307rs60309322042146320T/G0.0659
268rs44868871650677571C/T0.0579308rs60315792043015108C/G0.0654
269rs44911751190956214T/C0.0485309rs60490642023727145C/T0.0360
270rs45330761256069231C/A0.0410310rs60745202012905188T/A/C0.0559
271rs4543050374954560A/C/T0.0194311rs611756220753310G/A-0.0011
272rs457839711131832284G/T0.0737312rs61237232037082145C/T0.0891
273rs46037822102735809T/A0.0642313rs6137010202090118C/T0.0110
274rs4668680210529961A/G0.0199314rs61380462023704388G/C/T0.0342
275rs46747592224068666G/A0.0059315rs616022380711349G/C/T0.0253
276rs4677399374519384T/C0.0307316rs64369712231582797T/C0.0815
277rs46781693124543103C/A0.0557317rs6450876531925423A/G-0.0024
278rs4681869358573534C/T0.0583318rs6465469795179593G/A0.0611
319rs64789669101776125G/A0.0192359rs7608732045393072T/G0.0582
320rs6502840174989857C/T0.0515360rs76301113114165901C/A0.0499
321rs6549596374507710T/C0.0271361rs7642488384688723C/T0.0406
322rs6549601374525187C/T0.0307362rs76558494161867415T/A0.0759
323rs6559935989140339G/A0.0425363rs7666030424728480G/A0.0492
324rs65612941346695350T/C0.0215364rs7674135484577025T/C0.0399
325rs657612714106194763C/T0.1076365rs7694112171693639C/G/T0.0537
326rs658907211109322914C/G0.0501366rs7698051460278484A/G0.0445
327rs65993904956047A/G0.0584367rs7705761199884272A/G0.0565
328rs67077732191344132C/T0.0279368rs7802058781697666A/C0.0512
329rs67174062187743228T/G0.0326369rs7845383888289450G/T0.0361
330rs6738590265059603A/G0.0429370rs7893783154079645C/T0.0205
331rs67439982196676489T/A/C/G0.0612371rs80144751465811537T/C0.0937
332rs676096724669324C/T0.0400372rs80155941440942142C/G0.0688
333rs6767410380699208C/T0.0505373rs80496601689857700A/G0.0398
334rs6768622374508075C/T0.0271374rs80509321650687673C/T0.0545
335rs6833150461659236A/G0.0235375rs80602071682273372G/A0.0287
336rs68986535115975656A/G0.0654376rs8079591922115835A/G0.0561
337rs6924957696853579G/T0.0527377rs80848841875513922A/T0.0597
338rs69550187125115418A/G0.0474378rs81070111930841000A/C0.0341
339rs700644389290753T/C0.0383379rs8272871072708276A/T0.0435
340rs7022178929780195G/A0.0720380rs8834331939206288C/T0.0580
341rs70322319117025771C/A0.0706381rs9288442137999799C/T0.0846
342rs70389649104423907C/T0.0419382rs9291151171591189T/G0.0757
343rs70976171077504287A/G0.0447383rs93025501652405521C/T0.0374
344rs711744711101351383G/A0.1118384rs93036601731420730C/T0.0614
345rs7126455110585627C/T0.0604385rs93211806130102959C/T0.0866
346rs71327811133563491G/A0.0619386rs933199626112893T/C0.0107
347rs71737161570123826T/G0.0723387rs93849816116803328T/C0.0259
348rs71739821536685718C/A/T-0.0013388rs93984346116787017T/G0.0229
349rs71935051652401477C/G0.0394389rs9433279113599422T/C0.0695
350rs71954771629288634T/C0.0020390rs95320801338126537G/A0.0395
351rs72114261753654548G/A/C0.0037391rs95492121341022093C/T0.0708
352rs7268940209961532C/T0.0461392rs95902161395905305T/C-0.0035
353rs72719132039263421C/A0.0155393rs96633671019776910A/C/T0.0370
354rs731764313108536028C/T0.1665394rs9786051094903693A/G0.0208
355rs7387452248246398A/G0.0255395rs98257133100936248C/A0.0450
356rs741245124227248G/A0.0535396rs98261483114464858C/A/T0.0677
357rs7555405131183486C/T0.0420397rs9852677350291617G/A/T0.0412
358rs7582191940294953T/C0.0461398rs98577733149344615C/A0.0547
399rs98604833138730737C/T0.0985414rs1991955629884066A/C0.0938
400rs98964431747125982T/C0.0738415rs218226813113170683G/A0.0573
401rs994142618487281T/G0.0689416rs2261033631603591A/G0.1538
402rs994773718455128A/G0.0477417rs2524095631266117A/C0.0456
403rs99797242122447717A/G0.0383418rs2847844100335874C/A0.0255
404rs109012449136079463C/T0.0077419rs3015224972728388A/G0.1419
405rs110789511738831415T/A/G0.1249420rs3130688631210216C/A/T0.0991
406rs116732761955347963C/T0.0264421rs3135402633024654A/C0.1028
407rs128963991492773663G/T0.0615422rs376877633024606C/G0.0902
408rs1414241921744392G/A/T0.0756423rs48001051819651982C/G/T0.0906
409rs1431403633047031T/C-0.0048424rs57586492242605548C/T0.0059
410rs1496279518680420A/T-425rs60622752061146909C/G/T0.1420
411rs15001275165739982C/T0.0416426rs72073461775668619C/A/G/T0.0105
412rs1745921161618608A/G-0.0049427rs80741241736531565C/T0.0457
413rs19275681399740117T/C0.0137428rs901134384688173G/A0.0574

新窗口打开|下载CSV

Supplementary Table 2
附表2
附表2成对连锁不平衡位点间的r2
Supplementary Table 2r2 of AISNP between paired LD
Blockrs号-1rs号-2r2Blockrs号-1rs号-2r2
Block1rs2853668rs270610.243rs5996064rs22559570.869
Block2rs13249584rs18776960.885rs5996064rs22696580.695
Block3rs6074520rs39101420.771rs5996064rs170027370.574
Block4rs6138046rs43287001rs1009544rs22559570.991
rs6138046rs60490640.993rs1009544rs22696580.795
rs4328700rs60490640.993rs1009544rs170027370.679
Block5rs4355871rs70221780.221rs2255957rs22696580.802
Block6rs10836092rs110323310.98rs2255957rs170027370.686
rs10836092rs108360930.987rs2269658rs170027370.816
rs10836092rs107680170.906Block10rs4486887rs80509320.371
rs11032331rs108360930.981Block11rs7193505rs93025500.959
rs11032331rs107680170.899rs7193505rs14205280.959
rs10836093rs107680170.919rs7193505rs11258320.93
Block7rs11466640rs175828300.449rs7193505rs13625530.911
Block8rs4903754rs80155940.285rs9302550rs14205281
Block9rs17377643rs59960640.849rs9302550rs11258320.95
rs17377643rs10095440.735rs9302550rs13625530.912
rs17377643rs22559570.742rs1420528rs11258320.95
rs17377643rs22696580.62rs1420528rs13625530.912
rs17377643rs170027370.66rs1125832rs13625530.961
rs5996064rs10095440.861Block12rs1678537rs12846050.965
Block13rs198464rs1745340.388rs6767410rs340529390.96
Block14rs6833150rs13783650.278rs6767410rs174698100.944
Block15rs1898227rs1605390.326rs6767410rs130766550.898
Block16rs1256519rs12565200.898rs6767410rs126300870.905
rs1256519rs118472630.565rs13094402rs6160220.468
rs1256519rs80144750.61rs13094402rs20995630.992
rs1256519rs49023910.61rs13094402rs340529390.992
rs1256519rs116211210.606rs13094402rs174698100.976
rs1256520rs118472630.528rs13094402rs130766550.929
rs1256520rs80144750.596rs13094402rs126300870.937
rs1256520rs49023910.575rs616022rs20995630.472
rs1256520rs116211210.572rs616022rs340529390.472
rs11847263rs80144750.814rs616022rs174698100.466
rs11847263rs49023910.791rs616022rs130766550.468
rs11847263rs116211210.791rs616022rs126300870.476
rs8014475rs49023910.95rs2099563rs340529391
rs8014475rs116211210.925rs2099563rs174698100.984
rs4902391rs116211210.975rs2099563rs130766550.937
Block17rs3757419rs47184120.312rs2099563rs126300870.944
Block18rs827287rs15383740.258rs34052939rs174698100.984
Block19rs6549596rs67686221rs34052939rs130766550.937
rs6549596rs46773990.923rs34052939rs126300870.944
rs6549596rs65496010.975rs17469810rs130766550.921
rs6768622rs46773990.923rs17469810rs126300870.929
rs6768622rs65496010.975rs13076655rs126300870.992
rs4677399rs65496010.916Block21rs11224765rs71174470.442
Block20rs6767410rs130944020.952Block22rs4938285rs104310790.876
rs6767410rs6160220.468Block23rs9398434rs93849810.949
rs6767410rs20995630.96Block24rs1391099rs170161750.593

新窗口打开|下载CSV

附图1

新窗口打开|下载原图ZIP|生成PPT
附图1403AISNP连锁不平衡图

Supplementary Fig. 1Linkage disequilibrium diagram of 403AISNP



附图2

新窗口打开|下载原图ZIP|生成PPT
附图2基于357AISNP的3个族群等位基因频率热图

Supplementary Fig. 2Heatmap of 357AISNP based on the allele frequencies in 3 ethnic groups



附图3

新窗口打开|下载原图ZIP|生成PPT
附图3357AISNP在K=3时的3个族群STRUCTURE分析结果(Ln P(D)=-107,854.6)

Supplementary Fig. 3The STRUCTURE analysis for 3 ethnic groups at K=3 of 357AISNP(Ln P(D)= -107,854.6)



附图4

新窗口打开|下载原图ZIP|生成PPT
附图4基于3个族群在357AISNP的等位基因频率进行主成分分析

Supplementary Fig. 4Principal component analysis of 3 ethnic groups based on allele frequencies of 357AISNP



致谢:

感谢中国科学院上海生命科学研究院计算生物学研究所的徐书华老师在文章数据方面给予的帮助。

(责任编委: 朱波峰)

参考文献 原文顺序
文献年度倒序
文中引用次数倒序
被引期刊影响因子

Budowle B, van Daal A. Forensically relevant SNP classes
Biotechniques, 2008, 44(5):603-608, 610.

DOI:10.2144/000112806PMID:18474034 [本文引用: 1]
Forensic samples that contain too little template DNA or are too degraded require alternate genetic marker analyses or approaches to what is currently used for routine casework. Single nucleotide polymorphisms (SNPs) offer promise to support forensic DNA analyses because of an abundance of potential markers, amenability to automation, and potential reduction in required fragment length to only 60-80 bp. The SNP markers will serve an important role in analyzing challenging forensic samples, such as those that are very degraded, for augmenting the power of kinship analyses and family reconstructions for missing persons and unidentified human remains, as well as for providing investigative lead value in some cases without a suspect (and no genetic profile match in CODIS). The SNPs for forensic analyses can be divided into four categories: identity-testing SNPs; lineage informative SNPs; ancestry informative SNPs; and phenotype informative SNPs. In addition to discussing the applications of these different types of SNPs, this article provides some discussion on privacy issues so that society and policymakers can be more informed.

Phillips C. Forensic genetic analysis of bio-geographical ancestry
Forensic Sci Int Genet, 2015, 18:49-65.

DOI:10.1016/j.fsigen.2015.05.012URL [本文引用: 1]

Phillips C. Ancestry informative markers
Encycl Forensic Sci, 2013: 323-331.

[本文引用: 1]

Kidd KK, Speed WC, Pakstis AJ, Furtado MR, Fang RX, Madbouly A, Maiers M, Middha M, Friedlaender FR, Kidd JR. Progress toward an efficient panel of SNPs for ancestry inference
Forensic Sci Int Genet, 2014, 10:23-32.

DOI:10.1016/j.fsigen.2014.01.002URL [本文引用: 1]

Phillips C, Salas A, Sánchez JJ, Fondevila M, Gómez-Tato A, Alvarez-Dios J, Calaza M, de Cal MC, Ballard D, Lareu MV, Carracedo A, SNPfor ID Consortium. Inferring ancestral origin using a single multiplex assay of ancestry-informative marker SNPs
Forensic Sci Int Genet, 2007, 1(3-4):273-280.

DOI:10.1016/j.fsigen.2007.06.008URL [本文引用: 2]

Kidd JR, Friedlaender FR, Speed WC, Pakstis AJ, De La Vega FM, Kidd KK. Analyses of a set of 128 ancestry informative single-nucleotide polymorphisms in a global set of 119 population samples
Investig Genet, 2011, 2(1):1.

DOI:10.1186/2041-2223-2-1URL [本文引用: 1]

Wei L, Wei YL, Jiang L, Sun QF, Wang YY, Li CX. The development of a 27-plex SNP multiplex system
Chin J Foren Med, 2016, 31(1):13-17.

[本文引用: 2]

魏丽, 魏以梁, 江丽, 孙启凡, 王英元, 李彩霞. 27-plex SNPs 复合扩增检测体系构建与应用评价
中国法医学杂志, 2016, 31(1):13-17.

[本文引用: 2]

Hao WQ, Liu J, Jiang L, Huang MS, Li JL, Ma Q, Liu C, Li CX, Wang HJ. The study of a SNP-multiplex for the ancestry inference of five continental populations
Acta Univ Med Nanjing, 2018, 38(3):331-337.

[本文引用: 1]

郝伟琪, 刘京, 江丽, 黄美莎, 李玖玲, 马泉, 刘超, 李彩霞, 王慧君. 用于五大洲际人群区分的SNP体系研究
南京医科大学学报(自然科学版), 2018, 38(3):331-337.

[本文引用: 1]

Li CX, Pakstis AJ, Jiang L, Wei YL, Sun QF, Wu H, Bulbul O, Wang P, Kang LL, Kidd JR, Kidd KK. A panel of 74 AISNPs: Improved ancestry inference within Eastern Asia
Forensic Sci Int Genet, 2016, 23:101-110.

DOI:10.1016/j.fsigen.2016.04.002URL [本文引用: 4]

Qu SQ, Zhu J, Wang YJ, Yin L, Lv ML, Wang L, Jian H, Tan Y, Zhang RR, Liu YQ, Li F, Huang SC, Liang WB, Zhang L. Establishing a second-tier panel of 18 ancestry informative markers to improve ancestry distinctions among Asian populations
Forensic Sci Int Genet, 2019, 41:159-167.

DOI:10.1016/j.fsigen.2019.05.001URL [本文引用: 1]

Hwa HL, Wu MY, Lin CP, Hsieh WH, Yin HI, Lee TT, Lee JC. A single nucleotide polymorphism panel for individual identification and ancestry assignment in Caucasians and four East and Southeast Asian populations using a machine learning classifier
Forensic Sci Med Pathol, 2019, 15(1):67-74.

DOI:10.1007/s12024-018-0071-yURL [本文引用: 2]

Zhang F, Su B, Zhang YP, Jin L. Genetic studies of human diversity in East Asia
Philos Trans R Soc Lond B Biol Sci, 2007, 362(1482):987-995.

DOI:10.1098/rstb.2007.2028URL [本文引用: 1]

Tian JY, Li YC, Kong QP, Zhang YP. The origin and evolution history of East Asian populations from genetic perspectives
Hereditas(Beijing), 2018, 40(10):814-824.

[本文引用: 1]

田娇阳, 李玉春, 孔庆鹏, 张亚平. 遗传学视角下东亚人群的起源和演化
遗传, 2018, 40(10):814-824.

[本文引用: 1]

Suo C, Xu HY, Khor CC, Ong RT, Sim XL, Chen JM, Tay WT, Sim KS, Zeng YX, Zhang XJ, Liu JJ, Tai ES, Wong TY, Chia KS, Teo YY. Natural positive selection and north-south genetic diversity in East Asia
Eur J Hum Genet, 2012, 20(1):102-110.

DOI:10.1038/ejhg.2011.139PMID:21792231 [本文引用: 1]
Recent reports have identified a north-south cline in genetic variation in East and South-East Asia, but these studies have not formally explored the basis of these clinical differences. Understanding the origins of these variations may provide valuable insights in tracking down the functional variants in genomic regions identified by genetic association studies. Here we investigate the genetic basis of these differences with genome-wide data from the HapMap, the Human Genome Diversity Project and the Singapore Genome Variation Project. We implemented four bioinformatic measures to discover genomic regions that are considerably differentiated either between two Han Chinese populations in the north and south of China, or across 22 populations in East and South-East Asia. These measures prioritized genomic stretches with: (i) regional differences in the allelic spectrum for SNPs common to the two Han Chinese populations; (ii) differential evidence of positive selection between the two populations as quantified by integrated haplotype score (iHS) and cross-population extended haplotype homozygosity (XP-EHH); (iii) significant correlation between allele frequencies and geographical latitudes of the 22 populations. We also explored the extent of linkage disequilibrium variations in these regions, which is important in combining genetic association studies from North and South Chinese. Two of the regions that emerged are found in HLA class I and II, suggesting that the HLA imputation panel from the HapMap may not be directly applicable to every Chinese sample. This has important implications to autoimmune studies that plan to impute the classical HLA alleles to fine map the SNP association signals.

HUGO Pan-Asian SNP Consortium, Abdulla MA, Ahmed I, Assawamakin A, Bhak J, Brahmachari SK, Calacal GC, Chaurasia A, Chen CH, Chen JM, Chen YT, Chu JY, Cutiongco-de la Paz EMC, De Ungria MCA, Delfin FC, Edo J, Fuchareon S, Ghang H, Gojobori T, Han JS, Ho SF, Hoh BP, Huang W, Inoko H, Jha P, Jinam TA, Jin L, Jung JS, Kangwanpong D, Kampuansai J, Kennedy GC, Khurana P, Kim HL, Kim K, Kim S, Kim WY, Kimm K, Kimura R, Koike T, Kulawonganunchai S, Kumar V, Lai PS, Lee JY, Lee S, Liu ET, Majumder PP, Mandapati KK, Marzuki S, Mitchell W, Mukerji M, Naritomi K, Ngamphiw C, Niikawa N, Nishida N, Oh B, Oh S, Ohashi J, Oka A, Ong R, Padilla CD, Palittapongarnpim P, Perdigon HB, Phipps ME, Png E, Sakaki Y, Salvador JM, Sandraling Y, Scaria V, Seielstad M, Sidek MR, Sinha A, Srikummool M, Sudoyo H, Sugano S, Suryadi H, Suzuki Y, Tabbada KA, Tan A, Tokunaga K, Tongsima S, Villamor LP, Wang E, Wang Y, Wang HF, Wu JY, Xiao HS, Xu SH, Yang JO, Shugart YY, Yoo HS, Yuan WT, Zhao GP, Zilfalil BA; Indian Genome Variation Consortium. Mapping human genetic diversity in Asia
Science, 2009, 326(5959):1541-1545.

DOI:10.1126/science.1177074URL [本文引用: 1]

Shi CM, Liu Q, Zhao SL, Chen H. Ancestry informative SNP panels for discriminating the major East Asian populations: Han Chinese, Japanese and Korean
Ann Hum Genet, 2019, 83(5):348-354.

DOI:10.1111/ahg.v83.5URL [本文引用: 3]

Wang YC, Lu DS, Chung YJ, Xu SH. Genetic structure, divergence and admixture of Han Chinese, Japanese and Korean populations
Hereditas, 2018, 155:19.

DOI:10.1186/s41065-018-0057-5URL [本文引用: 4]

Kim W, Shin DJ, Harihara S, Kim YJ. Y chromosomal DNA variation in east Asian populations and its potential for inferring the peopling of Korea
J Hum Genet, 2000, 45(2):76-83.

PMID:10721667 [本文引用: 2]
We have examined variations of five polymorphic loci (DYS287, DXYS5Y, SRY465, DYS19, and DXYS156Y) on the Y chromosome in samples from a total of 1260 males in eight ethnic groups of East Asia. We found four unique haplotypes constructed from three biallelic markers in these samples of East Asians. The Japanese population was characterized by a relatively high frequency of either the haplotype I-2b (-/Y2/T) or II-1 (+/Y1/C). These dual patterns of the distribution of Y chromosomes (I-2b/II-1) were also found in Korea, although they were present at relatively low frequencies. The haplotype II-1 was present in Northeast Asian populations (Chinese, Japanese, Koreans, and Mongolians) only, except for one male from the Thai population among the Southeast Asian populations (Indonesians, Philippines, Thais, and Vietnamese). The Japanese were revealed to have the highest frequency of this haplotype (27.5%), followed by Koreans (2.9%), Mongolians (2.6%), and mainland Chinese (2.2%). In contrast, the frequency of the haplotype I-2b was found to be 17.1% in the Japanese, 9.5% in Indonesian, 6.3% in Korean, 3.8% in Vietnamese, and 2.7% in Thai samples. These findings suggested that the chromosomes of haplotype I-2b were likely derived from certain areas of Northeast Asia, the region closest to Southeast Asia. Phylogenetic analysis using the neighbor-joining tree also reflected a general distinction between Southeast and Northeast Asian populations. The phylogeny revealed a closer genetic relationship between Japanese and Koreans than to the other surveyed Asian populations. Based on the result of the dual patterns of the haplotype distribution, it is more likely that the population structure of Koreans may not have evolved from a single ancient population derived from Northeast Asians, but through dual infusions of Y chromosomes entering Korea from two different waves of East Asians.

Ding QL, Wang CC, Farina SE, Li H. Mapping human genetic diversity on the Japanese archipelago
Advances in Anthropology, 2011, 1(2):19-25.

DOI:10.4236/aa.2011.12004URL [本文引用: 1]

Harihara S, Saitou N, Hirai M, Gojobori T, Park KS, Misawa S, Ellepola SB, Ishida T, Omoto K. Mitochondrial DNA polymorphism among five Asian populations
Am J Hum Genet, 1988, 43(2):134-143.

PMID:2840820 [本文引用: 1]
Mitochondrial DNA (mtDNA) polymorphisms were detected using 13 restriction enzymes on the total DNA obtained from blood samples of five Asian populations: Japanese and Ainu of northern Japan, Korean, Negrito (Aeta) of the Philippines, and Vedda of Sri Lanka. Of a total of 28 restriction-enzyme morphs detected, eight had not been reported previously. By combining the morphs, we were able to classify mtDNAs of 243 individuals into 20 mtDNA types. Phylogenetic analyses using maximum parsimony and genetic distance methods both showed that the Japanese, Ainu, and Korean populations were closely related to each other. Aeta was found to show a relatively close relationship to these three populations, confirming the conclusion from previous studies of blood markers. In contrast, Vedda was quite different from the other four populations.

Horai S, Murayama K, Hayasaka K, Matsubayashi S, Hattori Y, Fucharoen G, Harihara S, Park KS, Omoto K, Pan IH. mtDNA polymorphism in East Asian populations, with special reference to the peopling of Japan
Am J Hum Genet, 1996, 59(3):579-590.

PMID:8751859 [本文引用: 1]
Nucleotide sequences of the major noncoding (D-loop) region of human mtDNA from five East Asian populations including mainland Japanese, Ainu, Ryukyuans, Koreans, and Chinese were analyzed. On the basis of a comparison of 482-bp sequences in 293 East Asians, 207 different sequence types were observed. Of these, 189 were unique to their respective populations, whereas 18 were shared between two or three populations. Among the shared types, eight were found in common between the mainland Japanese and Koreans, which is the largest number in the comparison. The intergenic COII/tRNA(Lys) 9-bp deletion was observed in every East Asian population with varying frequencies. The D-loop sequence variation suggests that the deletion event occurred only once in the ancestry of East Asians. Phylogenetic analysis revealed that East Asian lineages were classified into at least 18 monophyletic clusters, though lineages from the five populations were completely intermingled in the phylogenetic tree. However, we assigned 14 of the 18 clusters for their specificity on the basis of the population from which the maximum number of individuals in each cluster was derived. Of note is the finding that 50% of the mainland Japanese had continental specificity in which Chinese or Koreans were dominant, while < 20% of either Ryukyuans or Ainu possessed continental specificity. Phylogenetic analysis of the entire human population revealed the closest genetic affinity between the mainland Japanese and Koreans. Thus, the results of this study are compatible with the hybridization model on the origin of modern Japanese. It is suggested that approximately 65% of the gene pool in mainland Japanese was derived from the continental gene flow after the Yayoi Age.

Rolf B, Horst B, Eigel A, Sagansermsri T, Brinkmann B, Horst J. Microsatellite profiles reveal an unexpected genetic relationship between Asian populations
Hum Genet, 1998, 102(6):647-652.

PMID:9703426 [本文引用: 2]
Allelic frequencies for up to five short tandem repeat systems (HumTH01. HumVWA, HumF13B, HumCD4, HumD2111) were analyzed in seven population samples from Asia using the polymerase chain reaction and gel electrophoresis. No deviations from Hardy-Weinberg equilibrium were observed. Two new alleles of the CD4 and TH01 loci were detected, and sequenced and their molecular structure is presented. A phylogenetic tree based on Thai, Han Chinese (from the northeast of China), Japanese, German and Ovambo allelic frequencies was constructed and demonstrates the close relationship of the Asian populations. Additionally, allelic frequency data for the VWA and TH01 systems were determined for the south Chinese minorities Bai, Dai and Qiang and for Koreans and compared with the above data. The Bai and Dai populations were clear outliers of the cluster of all other Asians, indicating an unexpected pattern of genetic heterogeneity of the Chinese nation. Two clusters of Asian populations could be established: the Koreans and Japanese together with the Han and Qiang Chinese, and, forming a separate cluster, the Bai and Dai populations.

Jeon S, Bhak Y, Choi Y, Jeon Y, Kim S, Jang J, Jang J, Blazyte A, Kim C, Kim Y, Shim J, Kim N, Kim YJ, Park SG, Kim J, Cho YS, Park Y, Kim HM, Kim BC, Park NH, Shin ES, Kim BC, Bolser D, Manica A, Edwards JS, Church G, Lee S, Bhak J. Korean Genome Project: 1094 Korean personal genomes with clinical information
Sci Adv, 2020, 6(22): eaaz7835.

DOI:10.1126/sciadv.aaz7835URL [本文引用: 1]

1000 Genomes Project Consortium, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR. A global reference for human genetic variation
Nature, 2015, 526(7571):68-74.

DOI:10.1038/nature15393URL [本文引用: 1]

Clarke L, Fairley S, Zheng-Bradley XQ, Streeter I, Perry E, Lowy E, Tassé AM, Flicek P. The international Genome sample resource (IGSR): A worldwide collection of genome variation incorporating the 1000 Genomes Project data
Nucleic Acids Res, 2017, 45(D1):D854-D859.

DOI:10.1093/nar/gkw829URL [本文引用: 1]

Xu SH, Yin XY, Li SL, Jin WF, Lou HY, Yang L, Gong XH, Wang HY, Shen YP, Pan XD, He YG, Yang YJ, Wang Y, Fu WQ, An Y, Wang JC, Tan JZ, Qian J, Chen XL, Zhang X, Sun YF, Zhang XJ, Wu BL, Jin L. Genomic dissection of population substructure of Han Chinese and its implication in association studies
Am J Hum Genet, 2009, 85(6):762-774.

DOI:10.1016/j.ajhg.2009.10.015URL [本文引用: 1]

Jinam TA, Kanzawa-Kiriyama H, Inoue I, Tokunaga K, Omoto K, Saitou N. Unique characteristics of the Ainu population in Northern Japan
J Hum Genet, 2015, 60(10):565-571.

DOI:10.1038/jhg.2015.79URL [本文引用: 1]

Kim JJ, Verdu P, Pakstis AJ, Speed WC, Kidd JR, Kidd KK. Use of autosomal loci for clustering individuals and populations of East Asian origin
Hum Genet, 2005, 117(6):511-519.

DOI:10.1007/s00439-005-1334-8URL [本文引用: 1]

Qin PF, Li ZQ, Jin WF, Lu DS, Lou HY, Shen JW, Jin L, Shi YY, Xu SH. A panel of ancestry informative markers to estimate and correct potential effects of population stratification in Han Chinese
Eur J Hum Genet, 2014, 22(2):248-253.

DOI:10.1038/ejhg.2013.111URL [本文引用: 1]

Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps
Bioinformatics, 2005, 21(2):263-265.

PMID:15297300 [本文引用: 1]
Research over the last few years has revealed significant haplotype structure in the human genome. The characterization of these patterns, particularly in the context of medical genetic association studies, is becoming a routine research activity. Haploview is a software package that provides computation of linkage disequilibrium statistics and population haplotype patterns from primary genotype data in a visually appealing and interactive interface.http://www.broad.mit.edu/mpg/haploview/jcbarret@broad.mit.edu

Carlson CS, Eberle MA, Rieder MJ, Yi Q, Kruglyak L, Nickerson DA. Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium
Am J Hum Genet, 2004, 74(1):106-120.

DOI:10.1086/381000URL [本文引用: 1]

Wei YL, Wei L, Zhao L, Sun QF, Jiang L, Zhang T, Liu HB, Chen JG, Ye J, Hu L, Li CX. A single-tube 27-plex SNP assay for estimating individual ancestry and admixture from three continents
Int J Legal Med, 2015, 130(1):27-37.

DOI:10.1007/s00414-015-1183-5URL [本文引用: 2]

Nei M. Analysis of gene diversity in subdivided populations
Proc Natl Acad Sci USA, 1973, 70(12):3321-3323.

DOI:10.1073/pnas.70.12.3321URL [本文引用: 1]

Rousset F. genepop'007: a complete re-implementation of the genepop software for Windows and Linux
Mol Ecol Resour, 2008, 8(1):103-106.

DOI:10.1111/j.1471-8286.2007.01931.xURL [本文引用: 1]

de la Puente M, Santos C, Fondevila M, Manzo L, EUROFORGEN-NoE Consortium, Carracedo Á, Lareu MV, Phillips C. The Global AIMs Nano set: A 31-plex SNaPshot assay of ancestry-informative SNPs
Forensic Sci Int Genet, 2016, 22:81-88.

DOI:10.1016/j.fsigen.2016.01.015URL [本文引用: 1]

Liu J, Li S, Jiang L, Zhao L, Zhao WT, Feng L, Liu HB, Ji AQ, Li CX. DNA Ancestry Analyzer: an automatic program for ancestry inference of unknown individuals
Life Sci Res, 2018, 22(1):3-7, 41.

[本文引用: 1]

刘京, 李盛, 江丽, 赵蕾, 赵雯婷, 丰蕾, 刘海渤, 季安全, 李彩霞. 对于未知来源个体进行族群推断的自动分析系统
生命科学研究, 2018, 22(1):3-7, 41.

[本文引用: 1]

Geck C. The world factbook
Charleston Adv, 2017, 19(1):58-60.

DOI:10.5260/chara.19.1.58URL [本文引用: 1]

Siska V, Jones ER, Jeon S, Bhak Y, Kim HM, Cho YS, Kim H, Lee K, Veselovskaya E, Balueva T, Gallego- Llorente M, Hofreiter M, Bradley DG, Eriksson A, Pinhasi R, Bhak J, Manica A. Genome-wide data from two early Neolithic East Asian individuals dating to 7700 years ago
Sci Adv, 2017, 3(2):e1601877.

DOI:10.1126/sciadv.1601877URL [本文引用: 1]

Pan ZQ, Xu SH. Population genomics of East Asian ethnic groups
Hereditas, 2020, 157(1):49.

DOI:10.1186/s41065-020-00162-wURL [本文引用: 1]

Wen H, Wei YL, Guo XY, Sun CC, Xue SY, Liu J, Fan H, Jiang L. High-resolution SNP ancestry inference model and efficiency evaluation in three East Asian populations
Progress in Biochemistry and Biophysics, 2021, 1-11.

[本文引用: 2]

文豪, 魏以梁, 郭晓媛, 孙昌春, 薛思瑶, 刘京, 范虹, 江丽. 东亚三族群SNP高分辨推断模型构建与效能评估
生物化学与生物物理进展, 2021, 1-11.

[本文引用: 2]

Jiang L, Sun QF, Ma Q, Zhao WT, Liu J, Zhao L, Ji AQ, Li CX. Optimization and validation of analysis method based on 27-plex SNP panel for ancestry inference
Hereditas (Beijing), 2017, 39(2):166-173.

[本文引用: 3]

江丽, 孙启凡, 马泉, 赵雯婷, 刘京, 赵蕾, 季安全, 李彩霞. 27-plex SNP 种族推断方法的优化及验证
遗传, 2017, 39(2):166-173.

[本文引用: 3]

相关话题/日本 遗传 测试 信息 数据库