The impact of estimator choice: Disagreement in clustering solutions across K estimators for Bayesian analysis of population genetic structure across a wide range of empirical data sets
2022
Stankiewicz, Kathryn H. | Vasquez Kuntz, Kate L. | Coral Microsatellite Group | Ledoux, J. B. | Garrabou, Joaquim | Baums, Iliana B. | National Science Foundation (US) | Pennsylvania State University | European Commission | Agencia Estatal de Investigación (España)
14 pages, 5 figures, 2 tables, 1 appendix, supporting information https://doi.org/10.1111/1755-0998.13522.-- Data Availability Statement: All Supporting Information figures and their corresponding raw data can be accessed on Dryad (https://doi.org/10.5061/dryad.zgmsbccck)
اظهر المزيد [+] اقل [-]The software program STRUCTURE is one of the most cited tools for determining population structure. To infer the optimal number of clusters from STRUCTURE output, the ΔK method is often applied. However, a recent study relying on simulated microsatellite data suggested that this method has a downward bias in its estimation of K and is sensitive to uneven sampling. If this finding holds for empirical data sets, conclusions about the scale of gene flow may have to be revised for a large number of studies. To determine the impact of method choice, we applied recently described estimators of K to re-estimate genetic structure in 41 empirical microsatellite data sets; 15 from a broad range of taxa and 26 from one phylogenetic group, coral. We compared alternative estimates of K (Puechmaille statistics) with traditional (ΔK and posterior probability) estimates and found widespread disagreement of estimators across data sets. Thus, one estimator alone is insufficient for determining the optimal number of clusters; this was regardless of study organism or evenness of sampling scheme. Subsequent analysis of molecular variance (AMOVA) did not necessarily clarify which clustering solution was best. To better infer population structure, we suggest a combination of visual inspection of STRUCTURE plots and calculation of the alternative estimators at various thresholds in addition to ΔK. Disagreement between traditional and recent estimators may have important biological implications, such as previously unrecognized population structure, as was the case for many studies reanalysed here
اظهر المزيد [+] اقل [-]This work was made possible by NSF grant OCE-1537959 to IBB, NIH grant T32: Computation, Bioinformatics, and Statistics (CBIOS) Training Program to KHS, a Bunton-Waller fellowship to KLVK, the strategic Funding UIDB/04423/2020 and UIDP/04423/2020 to JBL, and the Pennsylvania State University Biology Department. The project leading to this publication has received funding from European FEDER Fund under project 1166-39417 to DA. We acknowledge the funding of the Spanish government through the “Severo Ochoa Centre of Excellence” accreditation (CEX2019-000928-S)
اظهر المزيد [+] اقل [-]Peer reviewed
اظهر المزيد [+] اقل [-]الكلمات المفتاحية الخاصة بالمكنز الزراعي (أجروفوك)
المعلومات البيبليوغرافية
تم تزويد هذا السجل من قبل Institut de Ciències del Mar