Introduction
Limestone karst landforms are considered “islands” of terrestrial and aquatic habitats that often exhibit extraordinary biodiversity (Barr Jr and Holsinger 1985; Clements et al. 2006; Hutchins et al. 2021; Longley 1981). Within karst islands, cave and aquifer systems form a complex, and often human-inaccessible, subterranean landscape for both terrestrial and aquatic organisms. The subterranean complexity contributes to species richness through dispersal limitation, multiple colonizations, opportunities for vicariance, and diverse, sometimes extreme habitats that foster the evolution of ecological, behavioral, and morphological specialization (Bendik et al. 2013; Culver, Pipan, and Schneider 2009).
Karst biotas include many endemic species with relatively narrow geographic ranges that are also sensitive to perturbations. Conservation of these ecosystems and their constituent species is sometimes hampered by a lack of knowledge about their biogeographic, evolutionary, and demographic histories. In particular: (1) The number of distinct lineages or species might be underestimated due to under sampling, insufficient taxonomic knowledge (e.g., Hutchins et al. 2021), or convergent or parallel evolution (Wiens, Chippindale, and Hillis 2003); alternatively, diversity can be overestimated due to conflicts between morphological and genetic data. (2) The extent of species ranges (or the boundaries between them) are often quite difficult to Edwards Plataue delineate given subterranean complexity and inaccessibility in karst regions. As a result of the complexity of subterranean habitat geography, (3) the conduits for population connectivity (via gene flow) can be difficult to predict and might differ among even closely related lineages with divergent adaptive histories (Katz, Taylor, and Davis 2018). Furthermore, the conduits for gene flow can be dynamic depending on aquifer recharge rates and water levels (Longley 1981). We confront these knowledge gaps with population genomics data from extensive sampling of the
The Edwards-Trinity Aquifer system underlies the Edwards Plateau region of central Texas (Figure 1), an area of uplifted limestone that is home to a large number of cave- and spring-associated species. The rich stygobiont diversity of the aquifers is increasingly impacted by encroaching human population density in the area. Human impacts include altered recharge patterns (Lamichhane and Shakya 2019), higher levels of nutrients (Boyer et al. 2002) and contaminants (Diaz et al. 2020), as well as lowered water tables in parts of the aquifer (Green et al. 2011). These anthropogenic stressors are exacerbated by climate change and have the potential to profoundly affect regional and local biodiversity.
[IMAGE OMITTED. SEE PDF]
Among the Edwards Plateau region stygobionts impacted by these stressors are up to 14 species of endemic, fully aquatic, neotenic salamanders of the genus Eurycea (Plethodontidae: Hemidactyliinae) (Devitt et al. 2019) that inhabit springs and caves throughout the Edwards Plateau (Figure 1). Ten of these species are listed as threatened or endangered at the federal or state level (, ). These salamanders have adapted to the aquifer habitats since their colonization of the aquifers and are quite specialized. Some of the species are adapted to deep aquifer habitats and exhibit classic convergent troglobitic (subsurface) morphologies that include loss of eyes, depigmentation, and elongation of limbs and skull, while other species reside in springs and caves and exhibit surface-dwelling morphologies, retaining eyes and pigmentation (Devitt et al. 2019; Hillis et al. 2001). The Eurycea salamanders of the Edwards Plateau are specialized stygobionts that are relatively intolerant of fluctuations in water temperature (Barr and Babbitt 2002; Crow et al. 2016) and quality (Bowles, Sanders, and Hansen 2006). These salamanders are completely reliant upon groundwater, which makes the encroachment of urban areas and climate change increasing conservation concerns (Bendik et al. 2014; Diaz et al. 2020).
Despite considerable research into the phylogeny and population genetic structure of spring-associated salamanders in central Texas (e.g., Bendik et al. 2013; Chippindale et al. 2000; Devitt et al. 2019; Hillis et al. 2001; Lucas et al. 2009), uncertainty remains regarding the boundaries between, and the extent of gene exchange among, the lineages. We focused on a segment of Eurycea diversity in the southeastern Edwards Plateau that is home to several closely related nominal species in the
Devitt et al. (2019) surveyed genomic variation across the diversity of central Texas Eurycea salamanders, inclusive of the
Here, we sampled the
TABLE 1 Sampling details.
Locality number | Locality name | n | Nominal taxonomy | Major aquifer |
1 | Nueces 335 | 2 |
|
Edwards-Trinity |
2 | Fessenden Springs | 23 | E. sp. 2 | Edwards-Trinity |
3 | Hill Country SNA | 11 |
|
Trinity |
4 | Western Kerr | 14 | E. sp. 2 | Edwards-Trinity |
5 | Osborn Spring | 13 |
|
Trinity |
6 | Possum Creek Spring | 14 |
|
Trinity |
7 | Brown Ranch | 42 |
|
Trinity |
8 | Albert and Bessie Kronkosky SNA | 44 |
|
Trinity |
9 | Salamander Spring | 16 |
|
Trinity |
10 | Pecan Springs Cave | 1 |
|
Trinity |
11 | Government Canyon | 22 |
|
Edwards |
12 | Mueller's Ranch | 2 |
|
Trinity |
13 | Helotes Creek Spring | 3 |
|
Trinity |
14 | Maverick Ranch | 28 |
|
Trinity |
15 | Cascade Caverns | 28 |
|
Trinity |
16 | Pfeiffer Ranch | 4 |
|
Trinity |
17 | Leahs Spring | 9 |
|
Trinity |
18 | Peavey's Spring | 1 |
|
Edwards-Trinity |
19 | Leon Springs | 29 |
|
Trinity |
20 | Badweather Pit | 3 |
|
Trinity |
21 | Camp Bullis Stealth | 8 |
|
Trinity |
22 | Camp Bullis Sharon | 12 |
|
Trinity |
23 | Guadalupe River State Park | 22 |
|
Trinity |
24 | Honey Creek SNA Springs | 32 |
|
Trinity |
25 | Preserve Cave | 10 |
|
Trinity |
26 | Sattler's Deep Pit | 2 |
|
Trinity |
27 | Rebecca Spring | 2 |
|
Trinity |
28 | Devil's Backbone | 18 |
|
Edwards |
29 | Ott's Spring | 17 |
|
Trinity |
30 | Hueco Springs | 15 |
|
Edwards |
31 | Comal Spr Run One | 14 |
|
Edwards |
32 | Comal Spr Run Three | 19 |
|
Edwards |
33 | Comal Spr Spring Island | 22 |
|
Edwards |
34 | Jacobs Well | 21 |
|
Trinity |
35 | Fern Bank | 35 |
|
Trinity |
36 | San Marcos below dam | 18 |
|
Edwards |
37 | San Marcos Diversion | 16 |
|
Edwards |
38 | San Marcos Hotel | 15 |
|
Edwards |
Total | 607 |
Materials and Methods
Sampling for this investigation included extensive collections of the focal
Following the methods of Parchman et al. (2012) and Gompert et al. (2014), we constructed a reduced representation genomic library for each of 643 salamanders. For these libraries, genomic DNA was digested with the EcoR1 and Mse1 restriction enzymes. To the resulting fragments, Illumina adaptors with unique 8–10 bp individual multiplex identifier (MID) sequences were ligated. Fragments were then amplified with two rounds of PCR using iProof high-fidelity polymerase (BioRad Inc.). Amplicons from these PCR reactions were then pooled. Fragments between 300 and 450 bp were selected using a BluePippin (Sage Science Inc., Beverly, MA, USA) and the resulting fragments were sequenced on two lanes of Illumina Novaseq 6000 (single read, 100 bp) at the University of Texas Austin Genomic Sequencing and Analysis Facility (Austin, Texas, USA).
Assembly, Alignment, and Variant Calling
From the resulting sequence reads, PhiX reads were removed by assembly to the PhiX genome using bowtie version 1.1.2 (Langmead et al. 2009). Custom scripts were used to remove MIDs from each read and to filter short reads and reads that contained Mse1 adapter sequence. The resulting sequence reads were written to individual files in fastq format (median = 1,042,576 sequence reads per individual). 36 individuals each produced fewer than 400,000 reads and were excluded from further analyses. Excluded individuals originated from 21 distinct localities and included a maximum of four individuals from any one locality. One specimen from the Texas Natural History Collections (from Pfeiffer Ranch (16); we refer to specific localities by name and corresponding number following Table 1 in parentheses) was among the excluded individuals. The reads from the remaining 607 individuals (Table 1, Figure 1, Table S1) were filtered and clustered using the strategy described in the dDocent variant calling pipeline (Puritz, Hollenbeck, and Gold 2014) and CD-hit (Fu et al. 2012). We ignored sequence reads with less than four copies per individual and shared among less than four individuals. The filtered reads were assembled with a homology threshold of 80% (other thresholds from 80% to 95% were investigated but produced very similar assemblies, data not presented). The consensus reads from this de novo assembly were used as the basis for a reference-based alignment of all reads using the aln and samse algorithms of the Burrows-Wheeler Aligner (bwa version: 0.7.12) (Li and Durbin 2009) with a seed length of 20 and up to four mismatches allowed.
Following assembly and alignment, we identified variable sites, or polymorphic single nucleotide sites (SNPs), with bcftools version 1.9 (Li et al. 2009) using the mpileup and call commands, ignoring indels and retaining variable sites if the posterior probability that the nucleotide was invariant was < 0.05. We performed additional filtering on the Variant Call Format file using custom scripts to exclude variable sites with sequence depth less than 607 reads (an average of one read per site per individual, 1×) and greater than 47,246 reads (equal to the mean sequence depth across sites plus two standard deviations; this filter is designed to remove potentially paralogous reads), less than at least 20 reads of the alternative allele, mapping quality less than 30, an absolute value of the mapping quality rank sum test greater than 1.96, an absolute value of the read position rank sum test greater than 1.96, an absolute value of the base quality rank sum test greater than 1.96, minor allele frequency less than 0.05, or missing data for more than 303 individuals (50% of individuals). One variable site per contig was chosen randomly and retained to maximize independence among loci. The genotype likelihoods from the resulting variable sites (SNP loci) were updated using the Bayesian admixture algorithm entropy (Gompert et al. 2014; Shastry et al. 2021) to provide posterior genotype probabilities for analyses.
Analyses
To answer our first two questions pertaining to the organization of genetic variation, we estimated genotypes, allele frequencies, and admixture proportions and used these to illustrate patterns of genomic variation. We estimated admixture proportions for each individual, allele frequencies for each cluster (population), and posterior genotype probabilities for all loci for all individuals using the Bayesian admixture algorithm entropy (Gompert et al. 2014; Shastry et al. 2021). entropy also facilitates model evaluation using standard Bayesian model diagnostics. A series of models were fit to the data varying the number of clusters or populations (k) from 2 to 15. Two MCMC simulations (chains) of 105,000 steps, with a burnin of 5000 steps, retaining every 10th step (total of 10,000 steps for each chain), were run for each k. We calculated Gelman and Rubin's convergence diagnostic and effective sample sizes (Brooks and Gelman 1998; Gelman and Rubin 1992) for each chain with the package coda version 0.19–1 in R (Plummer et al. 2006; R Core Team 2022) to assess model performance and check that the models reached a stable sampling distribution. We rejected model results when the mean Gelman and Rubin's convergence diagnostic across individual values of the admixture proportion, q, was greater than 1.1 or when increasing k did not result in a recognizable cluster. Models for k = 2–5 satisfied these criteria. Attempts to employ more MCMC steps, longer burnin, and different thinning strategies failed to rescue models with higher numbers of clusters (data not presented). Posterior estimates of genotypes were then averaged across all MCMC steps for models k = 2–5. Patterns of variation among individuals were illustrated by ordination of multilocus genotypes using principal component analysis (PCA), and patterns of admixture were illustrated with barplots, all performed in R (R Core Team 2022). To quantify differentiation among sample sites, we calculated genome-average Nei's GST (Nei 1973). GST is an analog of FST that is generally applicable and appropriate for pairwise comparisons of population samples, and calculated as for all pairwise combinations of sites, which we refer to as FST hereafter. FST was calculated as the average across all loci and 1000 bootstrap resamples were used to calculate 95% confidence intervals.
We then used samtools version 0.1.9 to calculate two measures of genetic diversity for all aligned nucleotide sites for each of the 29 localities with samples of ≥ 8 individuals. We estimated Watterson's θ (based on the number of segregating sites) and Tajima's π (nucleotide diversity or heterozygosity) using the expectation–maximization algorithm with 20 iterations (Li 2011), which was sufficient for values of both metrics to converge for all localities.
To specifically address our first question regarding whether genomic variation is organized according to the nominal taxonomy (i.e., three nominal species in the
In a second method for addressing whether genetic variation was organized along nominal taxonomic divisions or major aquifers, we constructed a Bayesian mixed model regression to compare geographic distance, “taxonomic” distance and “aquifer” distance as predictors of population differentiation (Gompert et al. 2014). Specifically, we used this model to ask whether population differentiation was best explained by geography, nominal taxonomy, major aquifer, or combinations of these predictors. The regression model accounts for the correlated error structure in pairwise distance measures (genetic, geographic, taxonomic, and aquifer) by incorporating random effects for each population that represent mean population-specific deviations in the distance matrices following Clarke, Rothery, and Raybould (2002). Thus,
Our third question regarding the possibility of gene exchange during the history of these populations presents challenges because observed admixture could result from incomplete lineage sorting (ancestral polymorphism), historical gene flow, contemporary gene flow, or combinations of these (e.g., Holder, Anderson, and Holloway 2001; Joly, McLenachan, and Lockhart 2009; Machado et al. 2002; Sang and Zhong 2000). It should be noted that many methods for inference of gene flow, including those used here, cannot readily discriminate between secondary contact with gene flow versus gene flow during divergence (Endler 1977; Pinho and Hey 2010). Given these difficulties, we employed three analyses with different approaches in an attempt to understand the history of gene exchange: We first used the methods of Pickrell and Pritchard (2012) as implemented in treemix version 1.12. Allele frequencies from all focal sampling localities with sample sizes of at least eight individuals were used to estimate the hypothesized population graph representing the evolutionary history of the sampled populations assuming no migration or admixture (m = 0). The sample from Fessenden Springs (2) was designated as the outgroup. We then sequentially evaluated the contribution of additional migration nodes (admixture events, m = 1–10). Each model of m migration nodes was replicated with 1000 runs and the proportion of the total explained variance in allele frequencies was calculated for each replicate using a custom Rscript written by D. Card (). The asymptote in the median proportion of variance explained across replicates for each m was used as a guide to determine whether adding migration events improved model fit. Population graphs for m = 0 and for m with the highest median variance explained were plotted using the R package ggtree (Yu et al. 2017).
In a second analysis of gene exchange, we calculated f3 statistics (Patterson et al. 2012; Reich et al. 2009) with the threepop program from treemix version 1.12. f3 statistics specifically test the hypothesis that the evolution of sampled populations conforms to the expectations of a bifurcating model (i.e., the null hypothesis of no history of gene exchange). The test is conducted for sets of three populations, two potential source populations and a target population. Significantly negative f3 statistics indicate a departure from a strictly bifurcating evolutionary history for the three populations due to population admixture in the target population. We calculated f3 statistics for all sampling localities included in the treemix analyses described above.
As a complement to the above analyses designed to detect gene flow, we used EEMS (version 0.0.0.9000) (Petkova, Novembre, and Stephens 2016) to estimate an effective migration surface which provides a picture of the geography of departures from patterns of isolation-by-distance (IBD). This effective migration surface identifies areas with low or high gene flow relative to a stepping-stone model. Our expectation is that gene flow among localities and lineages of Eurycea salamanders should appear as departures from the IBD expectation such that the effective migration surface will illustrate the presence of barriers to and/or corridors of gene flow. If, on the other hand, the history of divergence in these salamanders is one of primary divergence with little or no gene flow, we expect to observe few or no departures from IBD expectations. We estimated the effective migration surface assuming 700 demes and used three MCMC chains with 6 million iterations each, a burnin of 3 million iterations and a thinning interval or 10,000 iterations.
Results
Sequencing produced 671,939,934 sequence reads with MIDs for 607 individuals (median = 1,072,524 reads per individual, standard error = 13,555.63, minimum reads = 420,551, maximum reads = 2,495,535). 42,274,123 reads were retained in the final assembly after filtering from which a SNP set of 16,094 loci was called. Mean sequence depth was 4.3 reads per individual per locus (standard error = 0.052) with an average of 26.4% missing data (i.e., no reads for an individual) per locus (standard error = 0.0012). Ordination of posterior genotype estimates from entropy showed differentiation of the western localities,
[IMAGE OMITTED. SEE PDF]
Clustering analyses for k > 5 failed to meet our criteria for convergence thus we focus on results of k = 2–5. DIC scores were lowest for MCMC chains at k = 5 (Figure 3). Barplots of admixture proportions (Figure 4) mirror the apparent divisions in the PCA (Figure 2). The western localities (1–4) are differentiated first, followed by
[IMAGE OMITTED. SEE PDF]
[IMAGE OMITTED. SEE PDF]
[IMAGE OMITTED. SEE PDF]
Overall, levels of differentiation are low, especially within the
[IMAGE OMITTED. SEE PDF]
Genetic diversity was slightly lower in the
[IMAGE OMITTED. SEE PDF]
Partitioning genotypic variation by nominal taxonomy and major aquifer (Table 1) and spatial autocorrelation using RDA revealed both low levels of variance explained and predictors that are confounded. Taxonomy alone (i.e., not controlling for spatial variation) explained 17.0% of total genotypic variation, while major aquifer alone explained 9.7%. Space, as captured by eight MEMs, all of which were significantly associated with genotypic variation, explained 13.3%. However, the combined model with nominal taxonomy, major aquifer and eight MEMs explained only 21% of the variance. In this combined model, taxonomy explained 4.7%, aquifer explained less than 1% and space accounted for 3.5% of the total genotypic variation (Figure 8), but the overlap of all three predictors (i.e., the variance explained by the combination of all three) was 6.3%, and two-way entanglements between the predictors accounted for the remaining variance explained (Figure 8). In summary, spatial variation in the distribution of genetic variation in these populations is confounded with taxonomy and their distribution across major aquifers (Devitt et al. 2019).
[IMAGE OMITTED. SEE PDF]
The geographical signal detected in the examination of patterns of differentiation using RDA is further revealed by examining the relationship between pairwise differentiation and geographic distance (Figure 9). Within the three nominal focal species, differentiation increases mostly linearly with distance between localities (Figures 9 and 10). The Bayesian regression showed that more distant localities, localities classified as different taxa, and localities in different major aquifers were more genetically distinct (Table 2). Based on LOOIC, a model with terms for taxonomy and geography had the highest predictive accuracy for all localities as well as for the focal
[IMAGE OMITTED. SEE PDF]
[IMAGE OMITTED. SEE PDF]
TABLE 2 Comparison of competing models predicting pairwise genetic distances between localities with taxonomic distance (Taxon), aquifer distance (Aquifer), and geographic distance (Geog)—see text for details.
Model | Coefficient | Mean (95% HDI) | LOOIC | LOOIC SE | dLOOIC | dLOOIC SE | R 2 |
All sites—Taxon and Geog | Intercept | −0.011 (−0.341, 0.296) | 716.6 | 121.9 | 0 | 0 | 0.79 |
β TAX | 0.386 (0.248, 0.510) | ||||||
β GEO | 0.253 (0.200, 0.306) | ||||||
All sites—Taxon and Aquifer | Intercept | −0.004 (−0.313, 0.332) | 727.8 | 87.2 | 11.2 | 47.4 | 0.74 |
β TAX | 0.705 (0.591, 0.827) | ||||||
β AQU | 0.188 (0.075, 0.308) | ||||||
All sites—Full | Intercept | 0.006 (−0.348, 0.323) | 728.3 | 126.7 | 11.7 | 6.4 | 0.79 |
β TAX | 0.384 (0.249, 0.508) | ||||||
β GEO | 0.251 (0.198, 0.308) | ||||||
β AQU | 0.011 (−0.109, 0.119) | ||||||
All sites—Taxon | Intercept | 0.004 (−0.324, 0.362) | 736.7 | 92.8 | 20.1 | 43.0 | 0.74 |
β TAX | 0.749 (0.627, 0.867) | ||||||
All sites—Geog | Intercept | 0.013 (−0.321, 0.373) | 774.4 | 137.6 | 57.8 | 21.2 | 0.77 |
β GEO | 0.342 (0.298, 0.386) | ||||||
All sites—Geog and Aquifer | Intercept | 0.004 (−0.330, 0.344) | 775.9 | 137.5 | 59.3 | 21.4 | 0.77 |
β GEO | 0.340 (0.291, 0.389) | ||||||
β AQU | 0.009 (−0.100, 0.125) | ||||||
All sites—Aquifer | Intercept | 0.013 (−0.336, 0.380) | 861.1 | 93.1 | 144.5 | 51.6 | 0.71 |
β AQU | 0.343 (0.217, 0.476) | ||||||
Focal sites—Taxon and Geog | Intercept | −0.001 (−0.186, 0.178) | 285.8 | 32.3 | 0 | 0 | 0.62 |
β TAX | 0.219 (0.094, 0.335) | ||||||
β GEO | 0.204 (0.149, 0.263) | ||||||
Focal sites—Full | Intercept | −0.003 (−0.205, 0.197) | 300.1 | 32.9 | 14.3 | 20.4 | 0.62 |
β TAX | 0.210 (0.089, 0.327) | ||||||
β GEO | 0.237 (0.1787, 0.295) | ||||||
β AQU | −0.186 (−0.3237, 0.048) | ||||||
Focal sites—Geog and Aquifer | Intercept | 0.002 (−0.193, 0.198) | 317.3 | 35.0 | 31.5 | 22.7 | 0.62 |
β GEO | 0.294 (0.242, 0.346) | ||||||
β AQU | −0.199 (−0.344, −0.056) | ||||||
Focal sites—Geog | Intercept | −0.004 (−0.184, 0.180) | 322.7 | 35.7 | 36.9 | 23.1 | 0.59 |
β GEO | 0.262 (0.214, 0.308) | ||||||
Focal sites—Taxon | Intercept | 0.000 (−0.194, 0.183) | 343.1 | 29.4 | 57.3 | 22.4 | 0.63 |
β TAX | 0.463 (0.352, 0.571) | ||||||
Focal sites—Taxon and Aquifer | Intercept | −0.003 (−0.105, 0.176) | 344.6 | 29.8 | 58.8 | 22.3 | 0.63 |
β TAX | 0.456 (0.348, 0.569) | ||||||
β AQU | 0.035 (−0.105, 0.176) | ||||||
Focal sites—Aquifer | Intercept | 0.000 (−0.172, 0.183) | 417.5 | 31.7 | 131.6 | 26.8 | 0.60 |
β AQU | 0.140 (−0.011, 0.289) |
[IMAGE OMITTED. SEE PDF]
treemix analyses provided further evidence of gene exchange among populations. Adding migration events continued to improve the amount of variance explained until reaching an asymptote around eight migration events (Figure 12). The initial drift tree (population graph with no migration events, m = 0) has patterns similar to the PCA and barplots (Figure 13A) and explained considerable variance in allele frequencies. Adding migration events improved model fit though the increase in variance explained was small (Figure 12). The population graph incorporating eight migration events (m = 8) (Figure 13B) shows numerous migration events among
[IMAGE OMITTED. SEE PDF]
[IMAGE OMITTED. SEE PDF]
Significantly negative f3 statistics were detected in more than 48% of the 10,962 three-locality comparisons and included every locality as a target (i.e., receiving gene flow from the source populations) (Table S6). These results indicate that the history of the central Texas Eurycea is inconsistent with a bifurcating pattern of divergence and are an indication of a history of some gene exchange among localities. These results, in combination with the treemix results, suggest that admixture among localities has been a component of the history of these salamanders.
The estimated effective migration surface revealed distinct areas where gene flow was higher or lower than expected under a two-dimensional stepping-stone model (Figure 14). A corridor (an area of higher gene flow (m), or lower differentiation than expected given the geographic distances) was observed connecting the eastern localities at Devil's Backbone (28) and Ott's Spring (29) with the Comal Springs complex localities (sites 30–33) to the south, Guadalupe River State Park (23), Honey Creek (24) and Preserve Cave (25) in the northern portion of sampling, and to south westerly localities including Camp Bullis (21, 22). Another corridor connects the western localities of Fessenden Springs (2), Hill Country SNA (3) and Western Kerr (4) with southwestern sites Osborn Springs (5), Albert and Bessie Kronkowsky SNA (8) and Government Canyon (11). Another smaller corridor connects Osborn Springs (5) with Maverick Ranch (14) and Cascade Caverns (15). These areas of higher inferred gene flow roughly parallel the inferred migration paths from Treemix (Figure 13) and correspond with the patterns of admixture (Figure 4). There are also several areas that appear as barriers (or areas with lower-than-expected gene flow or higher differentiation than expected given the geographic distances). One major barrier appears between Possum Creek Spring (6), Brown Ranch (7) and Salamander Spring (9) on the east side of the barrier and western localities at Fessenden Springs (2), Hill Country SNA (3) and Western Kerr (4). Another barrier separates three proximal localities in the center of our sampling (Guadalupe River State Park (23), Honey Creek (24) and Preserve Cave (25)) from more westerly localities (Possum Creek Spring (6), Brown Ranch (7) and Salamander Spring (9)). In general, the estimated effective migration surface suggests that there are clear deviations from a simple IDB pattern across the sample range of this study which supports the hypothesis that there has been some gene flow during divergence or since populations and lineages have diverged.
[IMAGE OMITTED. SEE PDF]
Discussion
The history of salamanders of the
This conclusion is supported by several lines of evidence: First, differentiation among localities and taxa (measured as FST) is low (Figures 2 and 9, Tables S2–S5, Figure 6). Second, clustering analyses reveal prevalent admixture among localities and between nominal species (Figure 4). Third, nominal taxonomy explained only slightly more genetic variation than did geography in an RDA, but overall variance explained was low (Figure 8). Moreover, both taxonomy and geography were important predictors in Bayesian mixed models, but geography alone had greater support than taxonomy alone (Table 2). Fourth, analyses of admixture events revealed evidence of extensive gene exchange both within and among lineages (Figures 5 and 13). These patterns of gene exchange suggest the
Both the PCA and clustering analysis provide important insights into the organization of genetic variation within and among localities and lineages. While three loosely organized clusters are apparent within the
Despite the evidence of admixture among
The inferred patterns of gene exchange provide clues to how Eurycea salamanders are, or have been, capable of moving through the aquifers such that the aquifers serve as a conduit for gene exchange. The apparent admixture is unexpected from the perspective that these salamanders are specialized stygobionts and limited to narrow ranges of water temperature and quality (Barr and Babbitt 2002; Bowles, Sanders, and Hansen 2006; Crow et al. 2016). We might have predicted greater isolation and more differentiation based on our understanding of salamander life histories. The evidence of extensive admixture suggests that the aquatic environment in this aquifer system includes, or has included in the recent past, large areas of relatively homogeneous environmental conditions that permit salamander movements and thus gene flow. Additionally, this suggests that there are passages within the aquifer system that allow such movements.
It is important to note the caveat that the analyses presented here cannot discriminate between gene flow that might have occurred during the divergence of the three admixed lineages that we observe in the
Similarly, inferred migration events from treemix also provide an incomplete picture of the history of gene exchange in these salamanders. Eight migration events inferred from treemix analysis explained a small proportion of the variance in allele frequencies among populations (Figure 12) but likely does not represent the full picture of gene flow. There were likely many more events, perhaps of smaller magnitude in terms of numbers of migrants or distance of migrations, that were not detected. The migration events inferred from treemix do not fully comport with the more extensive patterns of admixture observed in clustering analysis (Figure 4), which suggests that even more migrations have occurred in the history of these populations or that some of the admixture occurred during divergence. In addition, replicate runs for each model produced a range of different solutions (Figure 12) perhaps suggesting that many small migration events are also compatible with the data. Indeed, f3 statistics indicate that gene flow was detectable in all localities, emphasizing again that strictly bifurcating relationships are insufficient to capture the history of divergence at all localities.
The results reported here greatly expand what was known from previous work. Chippindale et al. (2000) used 22 allozyme loci and one mitochondrial DNA (mtDNA) locus to survey genetic variation across central Texas Eurycea and documented especially low levels of differentiation among the
Our results comprise the most detailed and geographically intensive investigation of potential gene flow in the stygobiont salamanders of central Texas. We provide evidence that the aquifer might contain important pathways for gene flow among localities and facilitate connectivity. While it is currently difficult to distinguish historical and contemporary gene flow, the picture for the
Author Contributions
Chris C. Nice: conceptualization (equal), data curation (lead), formal analysis (lead), funding acquisition (equal), writing – original draft (lead), writing – review and editing (equal). Katherine L. Bell: conceptualization (supporting), formal analysis (supporting), writing – review and editing (equal). Zachariah Gompert: conceptualization (supporting), formal analysis (supporting), writing – review and editing (equal). Lauren K. Lucas: conceptualization (supporting), data curation (supporting), formal analysis (supporting), writing – review and editing (equal). James R. Ott: conceptualization (supporting), data curation (supporting), writing – review and editing (equal). Ruben U. Tovar: data curation (supporting), writing – review and editing (equal). Paul Crump: conceptualization (equal), data curation (supporting), funding acquisition (equal), writing – review and editing (equal). Peter H. Diaz: conceptualization (equal), data curation (equal), funding acquisition (equal), writing – review and editing (equal).
Acknowledgments
We thank Travis LaDuc at the Texas Natural History Collections, Texas Memorial Museum at The University of Texas at Austin, and Tom Devitt for providing critical samples. We especially thank numerous land owners for access to springs and cave sites. We appreciate the cooperation from the US Air Force for providing access to historical localities. We thank Tom Devitt and David Hillis for constructive discussion. Sequencing was performed by the Genomic Sequencing and Analysis Facility at UT Austin, Center for Biomedical Research Support (RRID:SCR_021713). Computing was performed at the LEAP High Performance Computing Cluster at Texas State University. We thank Texas Parks and Wildlife for funding support through a Conservation License Plate Grant and the Texas Conservation Field Office for support with funding laboratory supplies. The views expressed in this paper are the authors' and do not necessarily reflect the views of the U.S. Fish and Wildlife Service or Texas Parks and Wildlife Department.
Conflicts of Interest
The authors declare no conflicts of interest.
Data Availability Statement
The DNA sequence data analyzed in this manuscript have been archived on NCBI's SRA (PRJNA1057889). Genotype estimates and a predictors file for Redundancy Analysis and BLMM have been archived on Dryad ().
Barr, G. E., and K. J. Babbitt. 2002. “Effects of Biotic and Abiotic Factors on the Distribution and Abundance of Larval Two‐Lined Salamanders (
Barr, T. C., Jr., and J. R. Holsinger. 1985. “Speciation in Cave Faunas.” Annual Review of Ecology and Systematics 16: 313–337.
Bendik, N. F., J. M. Meik, A. G. Gluesenkamp, C. E. Roelke, and P. T. Chippindale. 2013. “Biogeography, Phylogeny, and Morphological Evolution of Central Texas Cave and Spring Salamanders.” BMC Evolutionary Biology 13: 1–18.
Bendik, N. F., B. N. Sissel, J. R. Fields, L. J. O'Donnell, and M. S. Sanders. 2014. “Effect of Urbanization on Abundance of Jollyville Plateau Salamanders (
Bishop, S. C., and M. R. Wright. 1937. “A New Neotenic Salamander From Texas.” Proceedings of the Biological Society of Washington 50: 141–144.
Borcard, D., and P. Legendre. 2002. “All‐Scale Spatial Analysis of Ecological Data by Means of Principal Coordinates of Neighbour Matrices.” Ecological Modelling 153: 51–68.
Borcard, D., P. Legendre, C. Avois‐Jacquet, and H. Tuomisto. 2004. “Dissecting the Spatial Structure of Ecological Data at Multiple Scales.” Ecology 85: 1826–1832.
Bowles, B. D., M. S. Sanders, and R. S. Hansen. 2006. “Ecology of the Jollyville Plateau Salamander (
Boyer, E. W., C. L. Goodale, N. A. Jaworski, and R. W. Howarth. 2002. “Anthropogenic Nitrogen Sources and Relationships to Riverine Nitrogen Export in the Northeastern USA.” Biogeochemistry 57: 137–169.
Brooks, S. B., and A. Gelman. 1998. “General Methods for Monitoring Convergence of Iterative Simulations.” Journal of Computational and Graphical Statistics 7: 434–455.
Burger, W. L., H. M. Smith, and F. E. Potter Jr. 1950. “Another Neotenic Eurycea From the Edwards Plateau.” Proceedings of the Biological Society of Washington 63: 51–58.
Capblancq, T., and B. R. Forester. 2021. “Redundancy Analysis: A Swiss Army Knife for Landscape Genomics.” Methods in Ecology and Evolution 12: 2298–2309.
Chippindale, P. T., A. H. Price, J. J. Wiens, and D. M. Hillis. 2000. “Phylogenetic Relationships and Systematic Revision of Central Texas Hemidactyliine Plethodontid Salamanders.” Herpetological Monographs 14: 1–80.
Clarke, R. T., P. Rothery, and A. F. Raybould. 2002. “Confidence Limits for Regression Relationships Between Distance Matrices: Estimating Gene Flow With Distance.” Journal of Agricultural, Biological, and Environmental Statistics 7: 361–372.
Clements, R., N. S. Sodhi, M. Schilthuizen, and P. K. Ng. 2006. “Limestone Karsts of Southeast Asia: Imperiled Arks of Biodiversity.” Bioscience 56: 733–742.
Crow, J., M. Forstner, K. Ostrand, and J. Tomasso. 2016. “The Role of Temperature on Survival and Growth of the Barton Springs Salamander (
Culver, D. C., T. Pipan, and K. Schneider. 2009. “Vicariance, Dispersal and Scale in the Aquatic Subterranean Fauna of Karst Regions.” Freshwater Biology 54: 918–929.
Devitt, T. J., A. M. Wright, D. C. Cannatella, and D. M. Hillis. 2019. “Species Delimitation in Endangered Groundwater Salamanders: Implications for Aquifer Management and Biodiversity Conservation.” Proceedings of the National Academy of Sciences of the United States of America 116: 2624–2633.
Diaz, P. H., E. L. Orsak, F. W. Weckerly, M. A. Montagne, and D. A. Alvarez. 2020. “Urban Stream Syndrome and Contaminant Uptake in Salamanders of Central Texas.” Journal of Fish and Wildlife Management 11: 287–299.
Dray, S., P. Legendre, and P. R. Peres‐Neto. 2006. “Spatial Modelling: A Comprehensive Framework for Principal Coordinate Analysis of Neighbour Matrices (Pcnm).” Ecological Modelling 196: 483–493.
Endler, J. A. 1977. Geographic Variation, Speciation, and Clines. Princeton, NJ: Princeton University Press.
Excoffier, L., P. E. Smouse, and J. Quattro. 1992. “Analysis of Molecular Variance Inferred From Metric Distances Among DNA Haplotypes: Application to Human Mitochondrial DNA Restriction Data.” Genetics 131: 479–491.
Forester, B. R., J. R. Lasky, H. H. Wagner, and D. L. Urban. 2018. “Comparing Methods for Detecting Multilocus Adaptation With Multivariate Genotype–Environment Associations.” Molecular Ecology 27: 2215–2233.
Fu, L., B. Niu, Z. Zhu, S. Wu, and W. Li. 2012. “Cd‐Hit: Accelerated for Clustering the Next‐Generation Sequencing Data.” Bioinformatics 28: 3150–3152.
Gelman, A., B. Goodrich, J. Gabry, and A. Vehtari. 2019. “R‐Squared for Bayesian Regression Models.” American Statistician 73: 307–309.
Gelman, A., and D. B. Rubin. 1992. “Inference From Iterative Simulation Using Multiple Sequences.” Statistical Science 7: 457–511.
Gompert, Z., L. K. Lucas, C. A. Buerkle, M. L. Forister, J. A. Fordyce, and C. C. Nice. 2014. “Admixture and the Organization of Genetic Diversity in a Butterfly Species Complex Revealed Through Common and Rare Genetic Variants.” Molecular Ecology 23: 4555–4573.
Green, T. R., M. Taniguchi, H. Kooi, et al. 2011. “Beneath the Surface of Global Change: Impacts of Climate Change on Groundwater.” Journal of Hydrology 405: 532–560.
Hillis, D. M., D. A. Chamberlain, T. P. Wilcox, and P. T. Chippindale. 2001. “A New Species of Subterranean Blind Salamander (Plethodontidae: Hemidactyliini: Eurycea: Typhlomolge) From Austin, Texas, and a Systematic Revision of Central Texas Paedomorphic Salamanders.” Herpetologica 57: 266–280.
Holder, M. T., J. A. Anderson, and A. K. Holloway. 2001. “Difficulties in Detecting Hybridization.” Systematic Biology 50: 978–982.
Hutchins, B. T., J. R. Gibson, P. H. Diaz, and B. F. Schwartz. 2021. “Stygobiont Diversity in the San Marcos Artesian Well and Edwards Aquifer Groundwater Ecosystem, Texas, USA.” Diversity 13: [eLocator: 234].
Joly, S., P. A. McLenachan, and P. J. Lockhart. 2009. “A Statistical Approach for Distinguishing Hybridization and Incomplete Lineage Sorting.” American Naturalist 174: E54–E70.
Katz, A. D., S. J. Taylor, and M. A. Davis. 2018. “At the Confluence of Vicariance and Dispersal: Phylogeography of Cavernicolous Springtails (Collembola: Arrhopalitidae, Tomoceridae) Codistributed Across a Geologically Complex Karst Landscape in Illinois and Missouri.” Ecology and Evolution 8: 10306–10325.
Lamichhane, S., and N. M. Shakya. 2019. “Alteration of Groundwater Recharge Areas Due to Land Use/Cover Change in Kathmandu Valley, Nepal.” Journal of Hydrology: Regional Studies 26: [eLocator: 100635].
Langmead, B., C. Trapnell, M. Pop, and S. L. Salzberg. 2009. “Ultrafast and Memory‐Efficient Alignment of Short DNA Sequences to the Human Genome.” Genome Biology 10: [eLocator: R25].
Li, H. 2011. “A Statistical Framework for SNP Calling, Mutation Discovery, Association Mapping and Population Genetical Parameter Estimation From Sequencing Data.” Bioinformatics 27: 2987–2993.
Li, H., and R. Durbin. 2009. “Fast and Accurate Short Read Alignment With Burrows‐Wheeler Transform.” Bioinformatics 25: 1754–1760.
Li, H., B. Handsaker, A. Wysoker, et al. 2009. “The Sequence Alignment/Map Format and Samtools.” Bioinformatics 25: 2078–2079.
Longley, G. 1981. “The Edwards Aquifer: Earth's Most Diverse Groundwater Ecosystem?” International Journal of Speleology 11: 123–128.
Lucas, L., Z. Gompert, J. Ott, and C. Nice. 2009. “Geographic and Genetic Isolation in Spring‐Associated Eurycea Salamanders Endemic to the Edward's Plateau Region of Texas.” Conservation Genetics 10: 1309–1319. [DOI: https://dx.doi.org/10.1007/s10592-008-9710-2].
Machado, C. A., R. M. Kliman, J. A. Markert, and J. Hey. 2002. “Inferring the History of Speciation From Multilocus DNA Sequence Data: The Case of Drosophila pseudoobscura and Close Relatives.” Molecular Biology and Evolution 19: 472–488.
Maclay, R. W. 1995. Geology and Hydrology of the Edwards Aquifer in the San Antonio Area, Texas. Vol. 95. Austin, Texas: US Geological Survey.
McElreath, R. 2020. Statistical Rethinking: A Bayesian Course With Examples in R and Stan.
Mitchell, R. W., and J. R. Reddell. 1965. “
Nei, M. 1973. “Analysis of Gene Diversity in Subdivided Populations.” Proceedings of the National Academy of Sciences of the United States of America 70: 3321–3323.
Nice, C. C., J. A. Fordyce, V. A. Sotola, J. Crow, and P. H. Diaz. 2021. “Geographic Patterns of Genomic Variation in the Threatened Salado Salamander,
Parchman, T. L., Z. Gompert, J. Mudge, F. D. Schilkey, C. W. Benkman, and C. A. Buerkle. 2012. “Genome‐Wide Association Genetics of an Adaptive Trait in Lodgepole Pine.” Molecular Ecology 21: 2991–3005.
Patterson, N., P. Moorjani, Y. Luo, et al. 2012. “Ancient Admixture in Human History.” Genetics 192: 1065–1093.
Petkova, D., J. Novembre, and M. Stephens. 2016. “Visualizing Spatial Population Structure With Estimated Effective Migration Surfaces.” Nature Genetics 48: 94–100.
Pickrell, J., and J. Pritchard. 2012. “Inference of Population Splits and Mixtures From Genome‐Wide Allele Frequency Data.” PLoS Genetics 8, no. 11: [eLocator: e1002967].
Pinho, C., and J. Hey. 2010. “Divergence With Gene Flow: Models and Data.” Annual Review of Ecology, Evolution, and Systematics 41: 215–230.
Plummer, M. 2003. “Jags: A Program for Analysis of Bayesian Graphical Models Using Gibbs Sampling.” In: Proceedings of the 3rd International Workshop on Distributed Statistical Computing, Vol. 124. Vienna, Austria.
Plummer, M., N. Best, K. Cowles, and K. Vines. 2006. “CODA: Convergence Diagnosis and Output Analysis for MCMC.” R News 5: 7–11.
Puritz, J. B., C. M. Hollenbeck, and J. R. Gold. 2014. “dDocent: A RADseq, Variant‐Calling Pipeline Designed for Population Genomics of Non‐Model Organisms.” PeerJ 2: [eLocator: e431].
R Core Team. 2022. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing.
Reich, D., K. Thangaraj, N. Patterson, A. L. Price, and L. Singh. 2009. “Reconstructing Indian Population History.” Nature 461: 489–494.
Rousset, F. 1997. “Genetic Differentiation and Estimation of Gene Flow From F‐Statistics Under Isolation by Distance.” Genetics 145: 1219–1228.
Sang, T., and Y. Zhong. 2000. “Testing Hybridization Hypotheses Based on Incongruent Gene Trees.” Systematic Biology 49: 422–434.
Shastry, V., P. E. Adams, D. Lindtke, et al. 2021. “Model‐Based Genotype and Ancestry Estimation for Potential Hybrids With Mixed‐Ploidy.” Molecular Ecology Resources 21: 1434–1451.
Smith, H. M., and F. E. Potter. 1946. “A Third Neotenic Salamander of the Genus Eurycea From Texas.” Herpetologica 3: 105–109.
Sweet, S. S. 1984. “Secondary Contact and Hybridization in the Texas Cave Salamanders Eurycea neotenes and
Vehtari, A., A. Gelman, and J. Gabry. 2017. “Practical Bayesian Model Evaluation Using Leave‐One‐Out Cross‐Validation and Waic.” Statistics and Computing 27: 1413–1432.
Vehtari, A., D. Simpson, A. Gelman, Y. Yao, and J. Gabry. 2024. “Pareto Smoothed Importance Sampling.” Journal of Machine Learning Research 25: 1–58.
Wiens, J. J., P. T. Chippindale, and D. M. Hillis. 2003. “When Are Phylogenetic Analyses Misled by Convergence? A Case Study in Texas Cave Salamanders.” Systematic Biology 52: 501–514.
Yu, G., D. K. Smith, H. Zhu, Y. Guan, and T. T.‐Y. Lam. 2017. “Ggtree: An R Package for Visualization and Annotation of Phylogenetic Trees With Their Covariates and Other Associated Data.” Methods in Ecology and Evolution 8: 28–36.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
ABSTRACT
Karst ecosystems often contain extraordinary biodiversity, but the complex underground aquifers of karst regions present challenges for assessing and conserving stygobiont diversity and investigating their evolutionary history. We examined the karst‐obligate salamanders of the
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details



1 Department of Biology, Population and Conservation Biology Program, Texas State University, San Marcos, Texas, USA
2 Department of Biology, University of Nevada, Reno, Nevada, USA
3 Department of Biology, Utah State University, Logan, Utah, USA
4 Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, USA
5 Nongame and Rare Species Program, Wildlife Division, Texas Parks and Wildlife Department, Austin, Texas, USA
6 United States Fish and Wildlife Service, Texas Fish and Wildlife Conservation Office, San Marcos, Texas, USA