Assessing model adequacy for Bayesian Skyline

Full text

Turn on search term navigation

Introduction

Posterior predictive simulation (PPS) is a commonly used technique for assessing model adequacy in a Bayesian framework [1]. PPS samples parameter values from the posterior distribution of an empirical analysis and simulates data that match the underlying assumptions of the model used to analyze the data. The probability distribution of the simulated data given the model is then compared to the actual data in order to assess model adequacy, either directly or via the use of proxy summary statistics. In either case, empirical data that are consistent with the assumptions of the model used to analyze the data will have probabilities distribution that are similar to the simulated data, and such a result would strengthen the confidence of the researcher in the inferences that result from the analysis of the empirical data. On the other hand, finding that the posterior distribution of the empirical data differs substantially from the posterior predictive distribution provides strong evidence that one or more of the underlying model assumptions has been violated. In sum, PPS may allow researchers to learn how a model does not fit the data [2, 3] providing perhaps the best approach to evaluating model adequacy for complex models [4].

Posterior predictive checks were introduced to molecular systematics by Huelsenbeck et al. [5] in the context of assessing the adequacy of models of sequence evolution, which are essential to the calculation of the posterior distribution in Bayesian inference. Work on assessing the fit of sequence evolution models has continued, with recent authors introducing posterior predictive approaches to evaluating the fit of models of sequence evolution in Bayesian phylogenetic inference [6], and the development of new statistics for detecting cases where model misspecification negatively influences phylogeny estimation [7]. PPS is gaining popularity for analyses conducted at the species level; for example it has been implemented to show that two common population genetic models perform poorly in describing the history of a duck species [8], has been used to identify instances of introgressive hybridization [9], and has been used to explore the accuracy of DNA barcoding efforts [10]. Recently, Duchene et al. [11] introduced a new software to assess the adequacy of phylodynamic models in infectious diseases investigations. Thus, posterior predictive assessments of model adequacy have the potential to improve investigations by verifying that the data collected from empirical systems are adequate to address the research questions. This can be particularly important when data are recycled or repurposed, that is, downloaded from public databases and used to address new questions (e.g., [12–14]).

There are millions of sequences available from ‘first generation’ phylogeographic investigations (e.g., BOLD, GenBank). These mitochondrial or chloroplast phylogeographic data sets are also still being collected by researchers, often as a first pass at data analysis in empirical systems (e.g., [15]). Such data are often used in multispecies comparative analyses, such as those investigating simultaneous divergence (e.g., [16, 17]) or expansion [13, 18] using hierarchical ABC. Similarly, these data can be used in automated phylogeography [19] and predictive phylogeography [20, 21]. However, each of these analyses makes certain assumptions about these data that may be difficult to assess. Hence, a practical limit on the repurposing of phylogeographic data is present when researchers cannot easily assess model adequacy.

Ideally, any phylogeographic data should be assessed by scientists in a manner that considers the model assumptions of the analyses that they plan to conduct. One example is the requirement of some analytical methods that the samples are free of population genetic structure, which can confound hierarchical tests of population expansion. To address this question, researchers have applied a species delimitation approach to identify structure within nominal species [18, 22]. Recently, Fonseca et al. [23] introduced a posterior predictive test of the data given the Generalized Mixed Yule Coalescent (GMYC) model in order to assess model adequacy, showing that large population sizes are likely to confound GMYC analysis. Here, we develop a posterior predictive test of model adequacy for population size changes using PPS on Bayesian Skyline plots (BSP; [24]).

Bayesian Skyline plots are a class of skyline-plot methods devised to infer the demographic history from DNA sequences using coalescent theory and co-estimation of genealogies and nucleotide substitution-parameters [25]. Introduced by Drummond et al. [24], BSPs are an extension of earlier Skyline-plot methods [26, 27] that enable phylogenetic uncertainty to be incorporated. To reconstruct the effective population size through time, BSPs estimate gene genealogies from a DNA alignment and simultaneously infer the demographic history from the gene genealogies. The coalescent model used in BSPs contains inherent assumptions about DNA sequences used in the analysis, notably that these were randomly sampled from a panmictic population and that the sequences are orthologous, nonrecombining, and neutrally evolving [25]. Because many datasets can potentially violate the first assumption (i.e., absence of genetic structure), we built an R package to assess the model adequacy for BSPs using PPS that can easily be incorporated into analysis pipelines.

Demographic history in empirical datasets

Duchene et al. [11] advocated that empiricists explore the model adequacy of the skyline plot model as part of the inference process. To support this suggestion, we include an investigation into two species of amphibians. Both species (Leptodactylus troglodytes and Rhinella granulosa) occur throughout northeastern Brazil in the xeric Caatinga biome, which is bordered by savannahs to the west (the Cerrado biome) and rainforest to the east (the Atlantic Forest biome). While the individual distributions of these species differ slightly, with L. troglodytes widespread across the Caatinga and enclaves of this vegetation within the Cerrado and R. granulosa spanning the Caatinga and the Atlantic Forest, both species have been the subject of recent investigation with genomic data, with approximately 15,000 SNP for L. troglodytes [28] and approximately 7,000 SNP for R. granulosa [29]. In both species demographic model selection was used to detect changes in population sizes (either instantaneous or exponential) as an important component of the demographic history of each species. These systems were chosen because there are existing SNP data that provide evidence for the importance of population size change in the demographic history of each species and because mitochondrial DNA were not included in the original investigations. In addition to analyzing data from these species, we include analysis of published data from eight other taxa.

Material and methods

P2C2M.Skyline package

P2C2M.Skyline is an open-source R package designed to assess the statistical fit of the BSP model. The package is available at: https://github.com/P2C2M. User input to P2C2M.Skyline includes a phylogenetic tree and the log file resulting from a BSP analysis. The package check model fit to BSP model using PPS following Lewis et al. (2014). The general workflow is shown in Fig 1.

[Figure omitted. See PDF.]

Arrows represent the path of the data from step 1 to 6. See P2C2M.Skyline package section on Material and Methods for more information.

1. P2C2M.Skyline requires two input from users:

1. A ultrametric phylogenetic tree

2. The log file resulting from a Bayesian Skyline analyzed in Tracer [30].

2. P2C2M.Skyline calculates a theta value from the phylogenetic tree using the function theta.tree implemented in the pegas R package [31].

3. P2C2M.Skyline calculates the magnitude of the population size change by sampling a random value for ancestral and current population size from the credible interval from the posterior distribution of the Bayesian Skyline analysis (i.e., by choosing a value from the confidence interval). While the ancestral population represents the value of population size in the credible interval from the posterior distribution associate with the oldest coalescent time, the current population represents the value associate with the most recent time. Then, by dividing the ancestral population size by the current population size, the package calculates the population size ratio. This value is used in downstream analyses to model the population size trajectory through time (i.e., constant, expansion, or bottleneck).

4. P2C2M. Skyline uses the software ms [32] to simulate gene trees under a coalescent model. Three parameters are used to simulate the data: (i) the number of individuals from the empirical dataset; (ii) the q value calculated in step 2; (iii) the magnitude of the population size change calculated in step 3. The coalescent simulations are replicated “n” times, with the default value set to 100.

5. P2C2M.Skyline calculates for each simulated gene tree and the user supplied ultrametric tree a summary statistic: the sum of the cumulative coalescent interval divided by the reverse number of elements:(1)

where n is the number of divergence events, i is ordered from the oldest (1) to youngest (n) divergence event, and t_i is the time of the ith divergence event. The summary statistic is calculated from all simulated datasets to construct the null distribution. Next, the summary statistic is calculated for the empirical dataset.

6. P2C2M.Skyline assesses the statistical fit of the Bayesian Skyline plot by calculating the number of simulated summary statistic values falling above and below the empirical value and then, multiplying the lesser value by two, which is equivalent to a two-tailed test. Next, a p-value is calculated by dividing this number by the total number of elements in the null distribution. A poor fit to the Bayesian Skyline is inferred if the p-value is lower than a user-defined threshold α (see Results).

Simulation testing

To evaluate the performance of our package, we simulated datasets under different demographic histories and sampling schemes. Specifically, we used ms to simulate five evolutionary histories: (i) constant population size through time, (ii) population expansion, (ii) population bottleneck, (iv) two populations with shallow divergence, and (v) two populations with deep divergence. Because populations in (iv) and (v) are analyzed as a single population, these last two evolutionary scenarios represent models that violate the assumptions of the Bayesian Skyline model. Each evolutionary scenario was simulated under two sampling schemes: (i) 10 individuals and (ii) 50 individuals. For the two-population model, the number of individuals was equally distributed between the populations (i.e., each population had 5 or 25 individuals). We assumed a generation time between 3–5 years and an effective population size between 10,000–100,000 individuals, which is in the range of that observed in empirical systems (e.g., [20]). For the two-population models, we assumed a divergence time of 4N (shallow) and 8N (deep) generations in the past. We simulated datasets assuming a total of 200 segregating sites on a gene 1,000 bp long, totaling 1,000 datasets (100 replicates for each model under two distinct sampling schemes). DNA sequences of 1,000 bp were generated for each gene tree using Seq‐Gen [33] under the HKY model. We reconstruct for each dataset changes in population size through time using the Bayesian Skyline implemented in Beast2 (source code version; [34]). We used a strict molecular clock and ran the chain for 10⁷ generations, sampling every 10³ generations. We evaluated convergence using Tracer v1.7.1, ensuring the effective sample size was higher than 200 for all parameters. Then, Bayesian Skyline log files were analyzed using Tracer [30]. Gene trees were summarized using the maximum clade credibility tree in TreeAnnotator 1.8.0 [35]. Finally, we used P2C2M.Skyline assess the statistical fit of the BSP model to each simulated dataset. We evaluated the performance of P2C2M.Skyline under four significance values (1%, 2.5%, 5%, and 10%) using the Mathews Correlation Coefficient (MCC; [36]) implemented in the R package mltools [37].

Summary statistics

We assessed the effectiveness of two additional summary statistics: (i) interval lengths [38] and (ii) summed branching times. While the former summary statistic is defined as the summed differences between time-interval lengths, the latter is the sum of the distance from each node to the tips. We used the simulated datasets to test the performance of both summary statistics in comparison to the summary statistic proposed previously.

Applying P2C2M.Skyline to empirical data

We further assessed the utility of P2C2M.Skyline in 10 empirical datasets, consisting of mitochondrial DNA (Anura (4), Squamata (2), Passeriformes (3), and Araneae (1)). While most of the sequences were download from Genbank, we generated fragments of mitochondrial DNA for two of these empirical systems: Rhinella granulosa and Leptodactylus troglodytes. All sequences for both species are deposited in GenBank (accession numbers: numbers will be included upon acceptance). For these species, we extracted total DNA from liver and muscle preserved in ethanol using DNeasy Blood & Tissue kits (Qiagen, Venlo, Netherlands) and sampled the mitochondrial genome by sequencing the cytochrome oxidase subunit one (CO1) gene using the protocol and primers described in Lyra et al. [39]. Recent investigations (Thomé et al., [28]; Thomé et al., [29]), using genomic data, detected two populations for R. granulosa and three populations for L. troglodytes, respectively. Recent demographic changes were also detected for populations of both species based on phylogeographic model selection. In particular, population 3 of L. troglodytes showed a recent bottleneck, and the remaining populations showed signals of recent expansions. Because of this dynamic evolutionary history, both species represent excellent candidates to test P2C2M.skyline.

For all empirical datasets, we first selected the best model of nucleotide substitution using the Bayesian information criterion (BIC) implemented in JModelTest 2.0 [40]. Next, input files for P2C2M.Skyline were generated running BSP analysis as described for the simulated datasets. Empirical datasets were analyzed by grouping all the samples and by splitting them into different lineages as recovered in the original papers. We used a α value of 5% (see the Results of the simulations). We then compared the results of our analyses to those reported in the papers that described these data, where applicable. Our goal here was to assess the extent to which potentially misleading inferences result from cases of poor model fit.

Results

Simulation testing

When there were no model violations (constant, bottleneck, and expansion datasets), P2C2M.Skyline failed to reject the Bayesian Skyline model for nearly all simulated datasets. (Fig 2 and S1 Fig in S1 File for α = 0.025 and 0.1). In contrast, for the two-population model (shallow and deep divergence), our package showed that many of the simulated datasets violate the Bayesian Skyline model (Fig 2 and S1 Fig in S1 File). The Matthews correlation coefficient (MCC) showed that the α value of 2.5% produced better results when compared to thresholds of 1%, 5%, and 10% (Table 1). However, this threshold also produced a high rate of false positives (i.e., datasets simulated under the correct premises that P2C2M.Skyline classified as a model violation). We advise users to use a threshold of 5% in their investigations because the lower rate of false negatives and reasonable values of MCC.

[Figure omitted. See PDF.]

In each chart, the Y‐axis shows the percentage of replicates where the statistical fit of the Bayesian Skyline model is rejected or not under two sampling schemes (10 and 50 individuals).

[Figure omitted. See PDF.]

False positives represent datasets simulated under the Bayesian Skyline model premises (i.e., constant, expansion, and bottleneck) that P2C2M.Skyline classified as a model violation. In contrast, false negatives represent datasets not simulated under Bayesian Skyline model premises (i.e., two-population models) that P2C2M.Skyline classified as not a model violation.

Summary statistics

We used MCC to assess the performance of interval lengths and summed branching times in comparison to the summary statistic proposed in step 5 (see P2C2M.Skyline package section). MCC under significance value of 5%, which showed to be the best significance value, showed that both summary statistics performed poorly compared to our proposed summary statistic. We recovered a MCC value of -0.14, 0.36, and 0.45 for interval lengths, summed branching times, and our proposed summary statistic, respectively. Given these results, which show that the proposed summary statistic maximizes true positive and true negative while minimizes false positives and false negatives, only this summary statistic is discussed further in the text. Information on each summary statistic is presented in S1–S3 Tables in S1 File.

Applying P2C2M.Skyline to empirical data

Overall, our package performed well when applied to empirical datasets. For example, we identified model violations in five of eight systems when all samples were considered to belong to a single population (Table 2). In four of these cases, previous work identified population structure. This lumping of samples, which is a clear violation of the P2C2M.Skyline model, is obvious in retrospect in these cases due to other analyses conducted by the researchers, but since the characterization of population genetic structure is often a goal of investigations into empirical systems this may not be known during the initial stages of data analysis. For three of the eight datasets, despite evidence of population structure based on other molecular markers in previous studies, P2C2M.Skyline did not detect a model violation when all populations were lumped together.

[Figure omitted. See PDF.]

Asterisk indicates datasets with p-value < 0.05.

When data from systems were analyzed on a population basis, P2C2M.skyline was not able to reject the Bayesian Skyline model for six of the eight of the datasets (Table 2). However, some empirical datasets (e.g., population 1 of Pleurodema diplolister) violated the Bayesian Skyline model even after samples were divided into populations (see Discussion for putative explanations). Interestingly, while P2C2M.Skyline did not detect a model violation when data from Sicarius cariri were analyzed as a single population, it did detect a model violation when data were analyzed separately for the two populations. P2C2M.Skyline took less than 1 minute to run in an average laptop (2.6 GHz Intel Core i5, 8 GB RAM) with a dataset composed of 100 DNA sequences. The skyline plots and the phylogenetic tree for each species are shown in S2–S10 Figs in S1 File.

Discussion

P2C2M.Skyline as a useful tool for empiricists

Bayesian Skyline plots are a commonly applied method for inferring the demographic history of populations. For example, they have been applied to characterize the trajectory of population size change in phylogeographic investigations (e.g., Table 2) and are also commonly used in epidemiology (e.g., [11, 46]). However, they make implicit assumptions about the conditions under which the data were sampled, and previous results have demonstrated that the inferences drawn from Bayesian Skyline plot analyses may be incorrect when the underlying assumption of an idealized Wright-Fisher population is violated (e.g., [47]). We developed a fast and friendly approach that applies PPS to evaluate model adequacy under the Bayesian Skyline model. Because it uses information that is available to all researchers who conduct Bayesian Skyline analyses, P2C2M.Skyline can be easily incorporated into the research pipeline and provide some assurance in the inferences that are drawn from Bayesian Skyline analyses. We follow Duchene et al. [11] in advocating that empiricists explore the model adequacy of the skyline plot model as part of the inference process. P2C2M.Skyline is easily incorporated into R analytical pipelines, making it suitable for automated phylogeographic analyses of thousands of datasets.

Our results demonstrate that P2C2M.Skyline is a powerful and fast tool for detecting model violations in Bayesian Skyline analyses. In general, we observed low numbers of false positives across various sample sizes and demographic histories. The ability to detect model violations under population structure scenarios was dependent on both the divergence time between the populations and the number of samples analyzed, with shallow divergence times and smaller sample sizes leading to more false negatives. Since deeper levels of population divergence are expected to have a larger effect on inferences drawn from Bayesian Skyline analyses [47], we believe this result to be ideal; the more extreme the model violation, the more likely P2C2M.Skyline is to detect the violation. We recommend using a conservative α of 0.05 to reduce false negatives, so that researchers are more likely to detect less-extreme model violations that may nevertheless mislead inference, particularly if a small number of samples is available. When a model violation is detected, we recommend that researchers analyze population structure and subsequently divide their dataset by population before performing skyline analyses again. Our empirical analyses show this to be an effective strategy for overcoming model violations (Table 2). Of course, our method may detect violations other than population substructure that were not evaluated here, such as migration. Thus, if dividing datasets into subpopulations still results in model violations, users may want to consider using tools other than BSPs to infer the demographic histories of populations. Overall, our results indicate that users of BSPs would benefit from incorporating P2C2M.Skyline into their workflow due to its fast run times and ability to detect model violations that may otherwise mislead inferences of population size changes.

Our tests of the P2C2M.Skyline pipeline used the cumulative coalescent interval as the summary statistic to determine if empirical datasets significantly differed from simulated ones, indicating model violations. This summary statistic is effective because it relies on distortions in branch lengths caused by population structure. Although we only examined population structure as a model violation, other model violations like selection or migration could also result in distorted branch lengths. Therefore, it is possible that our method could detect model violations other than just population structure. Further, other summary statistics may be powerful at detecting other, untested, model violations. Future research examining other model violations and other summary statistics could further improve the P2C2M.Skyline framework.

Demographic history in the empirical datasets

Our analysis of the empirical data sets illustrates how P2C2M.Skyline can be of use when applied to empirical systems. For L. troglodytes, populations were analyzed individually by Thomé et al. [28], and findings included two populations that expanded in the late Pleistocene, and one population that endured an intense late Holocene bottleneck. Results from our BSP differed from this previous work, which can be attributed in large part due ot the mitochondrial data having less signal than the SNPs used by the previous study. However, the results of P2C2M.Skyline are clearly consistent with previous results, and had the mitochondrial data been analyzed first these results would have served as a useful guide to additional data collection in this system. For R. granulosa, the P2C2M.Skyline results demonstrate that the underlying model is not appropriate for the data when all samples are (incorrectly) combined into a single population. When samples are analyzed in the populations used by Thomé et al. [29], P2C2M.Skyline results are consistent with the findings from the analysis of SNP data, where the best model included some variation in population sizes related to an (smaller) ancestral population.

Results from the analysis of empirical data collected for other species highlight the utility of P2C2M.Skyline. As in Leptodactylus troglodytes, we did not detect a model violation when populations of Polychrus acutirostris were analysed as a single population. In both cases, population structure was determined based on nuclear and mitochondrial data, while the BSP analysis only considers mitochondrial data, perhaps explaining the inconsistencies. On the other hand, for Pleurodema diplolister, population structure was determined based on nuclear DNA, but our analyses still detected a model violation despite only considering mitochondrial data. Although use of only a mitochondrial marker may reduce the ability to reconstruct complex evolutionary scenarios because of the high stochastic variance associated with only one marker [25, 48], mtDNA markers are still commonly applied as a first pass for inference into the drivers of intraspecific diversification, especially if analyzed at the community-level [13, 18, 49]. Thus, attention should be drawn to population structure at this particular type of marker.

Finally, in two cases, we detect model violations even when population structure is taken into account. First, in population 1 of P. diplolister, we detect a model violation. Thomé et al. [41] did report P. alium mitochondria introgressing into population 1 of P. diplolister, which could explain the violation of the BSP model in this case. Similarly, for Sicarius cariri, we did not detect a model violation when populations were analyzed together but did detect model violations when the two populations were analyzed separately. This could be another case in which introgression from another population leads to a model violation and highlights the nuances inherent to determining what units to use when performing BSP analyses as well as the advantages of applying PPS to this problem.

Conclusions

Here we develop a R package for assessing model adequacy for Bayesian Skyline plots using posterior predictive simulation. The package was successfully tested on simulated and empirical datasets. P2C2M.Skyline can be a useful tool for researchers interested in repurposing single locus phylogeographic data to address new questions using hierarchical ABC [13, 18], automated phylogeography (e.g., [20]), or predictive phylogeography (e.g., [21]).

Supporting information

S1 File.

https://doi.org/10.1371/journal.pone.0269438.s001

(DOCX)

Citation: Fonseca EM, Duckett DJ, Almeida FG, Smith ML, Thomé MTC, Carstens BC (2022) Assessing model adequacy for Bayesian Skyline plots using posterior predictive simulation. PLoS ONE 17(7): e0269438. https://doi.org/10.1371/journal.pone.0269438

About the Authors:

Emanuel M. Fonseca

Roles: Conceptualization, Formal analysis, Investigation, Methodology, Writing – original draft

Affiliations Department of Evolution, Ecology and Organismal Biology, The Ohio State University, Columbus, OH, United States of America, Museum of Biological Diversity, The Ohio State University, Columbus, OH, United States of America

https://orcid.org/0000-0002-2952-8816

Drew J. Duckett

Roles: Conceptualization, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

Filipe G. Almeida

Roles: Formal analysis, Investigation, Writing – original draft, Writing – review & editing

Affiliation: Department of Zoology, Federal University at Juiz de Fora, Juiz de Fora, Minas Gerais, Brazil

Megan L. Smith

Roles: Conceptualization, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

Affiliation: Department of Biology and Department of Computer Science, Indiana University, Bloomington, IN, United States of America

https://orcid.org/0000-0002-6362-9354

Maria Tereza C. Thomé

Roles: Conceptualization, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

Bryan C. Carstens

Roles: Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Visualization, Writing – original draft, Writing – review & editing

E-mail: [email protected]

References

1. Gelman A. A Bayesian formulation of exploratory data analysis and goodness-of-fit testing. Int Stat Rev. 2003;71(2):369–82.

2. Gelman A, Shalizi CR. Philosophy and the practice of Bayesian statistics. Br J Math Stat Psychol. 2013;66(1):8–38. pmid:22364575

3. Carstens BC, Smith ML, Duckett DJ, Fonseca EM, Thomé MTC. Assessing model adequacy leads to more robust phylogeographic inference. Trends Ecol Evol [Internet]. 2022 Jan; Available from: https://linkinghub.elsevier.com/retrieve/pii/S0169534721003426. pmid:35027224

4. Kruschke JK. Posterior predictive checks can and should be Bayesian: Comment on Gelman and Shalizi, “Philosophy and the practice of Bayesian statistics.” Br J Math Stat Psychol. 2013;66(1):45–56. pmid:23003325

5. Huelsenbeck JP, Ronquist F, Nielsen R, Bollback JP. Bayesian inference of phylogeny and its impact on evolutionary biology. Science (80-). 2001;294(5550):2310–4. pmid:11743192

6. Lewis PO, Xie W, Chen MH, Fan Y, Kuo L. Posterior predictive bayesian phylogenetic model selection. Syst Biol. 2014;63(3):309–21. pmid:24193892

7. Brown JM. Detection of implausible phylogenetic inferences using posterior predictive assessment of model fit. Syst Biol. 2014;63(3):334–48. pmid:24415681

8. Peters JL, Bolender KA, Pearce JM. Behavioural vs. molecular sources of conflict between nuclear and mitochondrial DNA: The role of male-biased dispersal in a Holarctic sea duck. Mol Ecol. 2012;21(14):3562–75. pmid:22582867

9. Joly S. JML: Testing hybridization from species trees. Mol Ecol Resour. 2012;12(1):179–84. pmid:21899723

10. Barley AJ, Thomson RC. Assessing the performance of DNA barcoding using posterior predictive simulations. Mol Ecol. 2016;25(9):1944–57. pmid:26915049

11. Duchene S, Bouckaert R, Duchene DA, Stadler T, Drummond AJ. Phylodynamic Model Adequacy Using Posterior Predictive Simulations. Syst Biol. 2019;68(2):358–64. pmid:29945220

12. Sidlauskas B, Ganapathy G, Hazkani-Covo E, Jenkins KP, Lapp H, McCall LW, et al. linking big: The continuing promise of evolutionary synthesis. Evolution (N Y). 2010;64(4):871–80. pmid:19895550

13. Burbrink FT, Chan YL, Myers EA, Ruane S, Smith BT, Hickerson MJ. Asynchronous demographic responses to Pleistocene climate change in Eastern Nearctic vertebrates. Ecol Lett. 2016;19(12):1457–67. pmid:27781365

14. Wieringa JG, Boot MR, Dantas-Queiroz M V., Duckett D, Fonseca EM, Glon H, et al. Does habitat stability structure intraspecific genetic diversity? It’s complicated … Front Biogeogr. 2020;12(2).

15. Vasconcellos MM, Colli GR, Weber JN, Ortiz EM, Rodrigues MT, Cannatella DC. Isolation by instability: historical climate change shapes population structure and genomic divergence of treefrogs in the Neotropical Cerrado savanna. Mol Ecol. 2019;28(7):1748–64. pmid:30742734

16. Hickerson MJ, Stahl E, Takebayashi N. msBayes: Pipeline for testing comparative phylogeographic histories using hierarchical approximate Bayesian computation. BMC Bioinformatics. 2007;8:1–7.

17. Oaks JR, Sukumaran J, Esselstyn JA, Linkem CW, Siler CD, Holder MT, et al. Evidence for Climate-Driven Diversification? a Caution for Interpreting Abc Inferences of Simultaneous Historical Events. Evolution (N Y). 2012;67(4):991–1010. pmid:23550751

18. Gehara M, Garda AA, Werneck FP, Oliveira EF, da Fonseca EM, Camurugi F, et al. Estimating synchronous demographic changes across populations using hABC and its application for a herpetological community from northeastern Brazil. Mol Ecol. 2017;26(18):4756–71. pmid:28734050

19. Gratton P, Marta S, Bocksberger G, Winter M, Trucchi E, Kühl H. A world of sequences: can we use georeferenced nucleotide databases for a robust automated phylogeography? J Biogeogr. 2017;44(2):475–86.

20. Carstens BC, Morales AE, Field K, Pelletier TA. A global analysis of bats using automated comparative phylogeography uncovers a surprising impact of Pleistocene glaciation. J Biogeogr [Internet]. 2018 Aug 25;45(8):1795–805. Available from: https://onlinelibrary.wiley.com/doi/10.1111/jbi.13382

21. Espíndola A, Ruffley M, Smith ML, Carstens BC, Tank DC, Sullivan J. Identifying cryptic diversity with predictive phylogeography. Proc R Soc B Biol Sci. 2016;283(1841). pmid:27798300

22. Pelletier TA, Carstens BC. Geographical range size and latitude predict population genetic structure in a global survey. Biol Lett. 2018;14(1).

23. Fonseca EM, Duckett DJ, Carstens BC. P2C2M.GMYC: An R package for assessing the utility of the Generalized Mixed Yule Coalescent model. Methods Ecol Evol. 2021;12(3):487–93.

24. Drummond AJ, Rambaut A, Shapiro B, Pybus OG. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol. 2005;22(5):1185–92. pmid:15703244

25. Ho SYW, Shapiro B. Skyline-plot methods for estimating demographic history from nucleotide sequences. Mol Ecol Resour. 2011;11(3):423–34. pmid:21481200

26. Pybus OG, Rambaut A, Harvey PH. An integrated framework for the inference of viral population history from reconstructed genealogies. Genetics. 2000;155(3):1429–37. pmid:10880500

27. Strimmer K, Pybus OG. Exploring the demographic history of DNA sequences using the generalized skyline plot. Mol Biol Evol. 2001;18(12):2298–305. pmid:11719579

28. Thomé MTC, Carstens BC, Rodrigues MT, Alexandrino J, Haddad CFB. Genomic data from the Brazilian sibilator frog reveal contrasting pleistocene dynamics and regionalism in two South American dry biomes. J Biogeogr. 2021; 19;48(5):1112–23. Available from: https://onlinelibrary.wiley.com/doi/10.1111/jbi.14064

29. Thomé MTC, Carstens BC, Rodrigues MT, Galetti PM, Alexandrino J, Haddad CFB. A role of asynchrony of seasons in explaining genetic differentiation in a Neotropical toad. Heredity. 2021;127(4):363–72. Available from: https://doi.org/10.1038/s41437-021-00460-7 pmid:34304245

30. Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst Biol. 2018;67(5):901–4. pmid:29718447

31. Paradis E. Pegas: An R package for population genetics with an integrated-modular approach. Bioinformatics. 2010;26(3):419–20. pmid:20080509

32. Hudson RR. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics. 2002;18(2):337–8. pmid:11847089

33. Rambaut A, Grassly NC. Seq-gen: An application for the monte carlo simulation of dna sequence evolution along phylogenetic trees. Bioinformatics. 1997;13(3):235–8. pmid:9183526

34. Bouckaert R, Heled J, Kühnert D, Vaughan T, Wu CH, Xie D, et al. BEAST 2: A Software Platform for Bayesian Evolutionary Analysis. PLoS Comput Biol. 2014;10(4):1–6. pmid:24722319

35. Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007;7(1):1–8. pmid:17996036

36. Matthews BW. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. BBA—Protein Struct. 1975;405(2):442–51. pmid:1180967

37. Gorman B. mltools: machine learning tools. 2018.

38. Kuhner MK, Yamato J. Practical performance of tree comparison metrics. Syst Biol. 2015;64(2):205–14. pmid:25378436

39. Lyra ML, Haddad CFB, Azeredo-Espin AML. Meeting the challenge of DNA barcoding Neotropical amphibians: polymerase chain reaction optimization and new COI primers. Mol Ecol Resour. 2017;17(5):966–80. pmid:28029226

40. Posada D. jModelTest: Phylogenetic model averaging. Mol Biol Evol. 2008;25(7):1253–6. pmid:18397919

41. Thomé MTC, Sequeira F, Brusquetti F, Carstens B, Haddad CFB, Rodrigues MT, et al. Recurrent connections between Amazon and Atlantic forests shaped diversity in Caatinga four-eyed frogs. J Biogeogr. 2016;43(5):1045–56.

42. Fonseca EM, Gehara M, Werneck FP, Lanna FM, Colli GR, Sites JW, et al. Diversification with gene flow and niche divergence in a lizard species along the South American “diagonal of open formations.” J Biogeogr. 2018;45(7):1688–700.

43. Lanna FM, Gehara M, Werneck FP, Fonseca EM, Colli GR, Sites JW, et al. Dwarf geckos and giant rivers: The role of the São Francisco River in the evolution of Lygodactylus klugei (Squamata: Gekkonidae) in the semi-arid Caatinga of north-eastern Brazil. Biol J Linn Soc. 2020;129(1):88–98.

44. Amaral FR, Albers PK, Edwards S V., Miyaki CY. Multilocus tests of Pleistocene refugia and ancient divergence in a pair of Atlantic Forest antbirds (Myrmeciza). Mol Ecol. 2013;22(15):3996–4013. pmid:23786305

45. Magalhaes ILF, Oliveira U, Santos FR, Vidigal THDA, Brescovit AD, Santos AJ. Strong spatial structure, Pliocene diversification and cryptic diversity in the Neotropical dry forest spider Sicarius cariri. Mol Ecol. 2014;23(21):5323–36. pmid:25251608

46. Dalai SC, de Oliveira T, Harkins GW, Kassaye SG, Lint J, Manasa J, et al. Evolution and molecular epidemiology of subtype C HIV-1 in Zimbabwe. AIDS. 2009;23(18):2523–32. pmid:19770693

47. Heller R, Chikhi L, Siegismund HR. The Confounding Effect of Population Structure on Bayesian Skyline Plot Inferences of Demographic History. PLoS One. 2013;8(5):e62992. pmid:23667558

48. Knowles LL. The burgeoning field of statistical phylogeography. J Evol Biol. 2004;17(1):1–10. pmid:15000642

49. Myers EA, Hickerson MJ, Burbrink FT. Asynchronous diversification of snakes in the North American warm deserts. J Biogeogr. 2017;44(2):461–74.

Word count: 5570

Show less

© 2022 Fonseca et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Bayesian skyline plots (BSPs) are a useful tool for making inferences about demographic history. For example, researchers typically apply BSPs to test hypotheses regarding how climate changes have influenced intraspecific genetic diversity over time. Like any method, BSP has assumptions that may be violated in some empirical systems (e.g., the absence of population genetic structure), and the naïve analysis of data collected from these systems may lead to spurious results. To address these issues, we introduce P2C2M.Skyline, an R package designed to assess model adequacy for BSPs using posterior predictive simulation. P2C2M.Skyline uses a phylogenetic tree and the log file output from Bayesian Skyline analyses to simulate posterior predictive datasets and then compares this null distribution to statistics calculated from the empirical data to check for model violations. P2C2M.Skyline was able to correctly identify model violations when simulated datasets were generated assuming genetic structure, which is a clear violation of BSP model assumptions. Conversely, P2C2M.Skyline showed low rates of false positives when models were simulated under the BSP model. We also evaluate the P2C2M.Skyline performance in empirical systems, where we detected model violations when DNA sequences from multiple populations were lumped together. P2C2M.Skyline represents a user-friendly and computationally efficient resource for researchers aiming to make inferences from BSP.

Details

Title

Assessing model adequacy for Bayesian Skyline plots using posterior predictive simulation

Author

Fonseca, Emanuel M

; Duckett, Drew J; Almeida, Filipe G; Smith, Megan L

; Thomé, Maria Tereza C; Carstens, Bryan C

First page

e0269438

Section

Research Article

Publication year

2022

Publication date

Jul 2022

Publisher

Public Library of Science

e-ISSN

19326203

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1371/journal.pone.0269438

ProQuest document ID

2694381318

Assessing model adequacy for Bayesian Skyline plots using posterior predictive simulation

Jump to:

Full text

Abstract

Details

Suggested sources