Genetic admixture of Chinese Tajik people inferred from genome-wide array genotyping and mitochondrial genome sequencing
Jing Zhao1,2,3, Qiao Wu1,3, Xin-Hong Bai4, Edward Allen5, Meng-Ge Wang2, Guang-Lin He2, Jian-Xin Guo2, Xiao-Min Yang2, Jian-Xue Xiong5,6, Zi-Xi Jiang5,6, Xiao-Yan Ji5,6, Hui Wang5,6, Jing-Ze Tan1(), Shao-Qing Wen1,5,6(), Chuan-Chao Wang2,7,8,9()
Genetic admixture of Chinese Tajik people inferred from genome-wide array genotyping and mitochondrial genome sequencing
Chinese Tajiks are an Indo-Iranian-speaking population in Xinjiang, northwest China. Although the complex demographic history has been characterized, the ancestral sources and genetic admixture of Indo-Iranian-speaking groups in this region remain poorly understood. We here provide the genome-wide genotyping data for over 700 000 single-nucleotide polymorphisms (SNPs) and mtDNA multiplex sequencing data in 64 Chinese male Tajik individuals from two dialect groups, Wakhi and Selekur. We applied principal component analysis (PCA), ADMIXTURE, f-statistics, treemix, qpWave/qpAdm, Admixture-induced Linkage Disequilibrium for Evolutionary Relationships (ALDER), and Fst analyses to infer a fine-scale population genetic structure and admixture history. Our results reveal that Chinese Tajiks showed the closest affinity and similar genetic admixture pattern with ancient Xinjiang populations, especially Xinjiang samples in the historical era. Chinese Tajiks also have gene flow from European and Neolithic Iran farmers-related populations. We observed a genetic substructure in the two Tajik dialect groups. The Selekur-speaking group who lived in the county had more gene flow from East Asians than Wakhi-speaking people who inhabited the village. These results document the population movements contributed to the influx of diverse ancestries in the Xinjiang region.
Chinese Tajiks / East Asia / genetic structure / population admixture / population history
/
〈 | 〉 |