Fungi constitute a hyperdiverse kingdom representing an array of ecological lifestyles, including human pathogens, ectomycorrhizae, lichens, and many more (Burgess et al., 2006; Blackwell, 2011; Li et al., 2016; Medeiros et al., 2017; Crossay et al., 2018; Chang et al., 2019; Mujic et al., 2019). Due to their fundamentally microscopic nature and their usually ephemeral reproductive structures (e.g., mushrooms, apothecia, etc.), the identification of fungi has historically been exceptionally difficult, relying on often artificial groupings based on limited morphological features. The advent and maturation of molecular approaches has revolutionized mycology; in the past two decades, a significant number of new orders, classes, and even phyla have been described (Schüβler et al., 2001; Zalar et al., 2005; Hosaka et al., 2006; Schoch et al., 2009; Rosling et al., 2011; Hodkinson et al., 2014). Correspondingly, a massive number of fungal families, genera, and species have been described during this period (Smith et al., 2006; Halling et al., 2012; Wu et al., 2016; Torres‐Cruz et al., 2017; Willis, 2018). Although sequence‐based approaches are not without pitfalls and controversy (Hofstetter et al., 2019), they remain major catalysts for the description and identification of fungal diversity.
Although the study of fungal diversity has been biased toward Northern Hemisphere temperate ecosystems, there have been major efforts to reconcile the gap in knowledge regarding tropical and Southern Hemisphere fungi, with major foci of activity in the eastern paleotropics (Luo et al., 2016; Vadthanarat et al., 2018, 2019; Sukarno et al., 2019), Central and South America (Kuhar et al., 2017; Accioly et al., 2018; Kaishian and Weir, 2018; Ovrebo et al., 2019), Africa (Castellano et al., 2016; Buyck et al., 2018, 2019; Jami et al., 2018), and Australia (Midgley et al., 2018; Davoodian et al., 2019; Ji et al., 2019; Khmelnitsky et al., 2019).
In line with many of the works cited above, systematic mycology studies generally rely on herbaria as sources of samples and repositories for new specimens, allowing the tethering of names and concepts to physical vouchers. Herbaria serve a critical role in the preservation and curation of biological resources and heritage, and ongoing efforts to digitize these resources will continue to positively impact biodiversity sciences (Willis et al., 2017; Thiers and Halling, 2018). Indeed, the acceleration of mycology via molecular approaches has reciprocally enhanced herbarium‐based approaches, with herbaria providing curated collections and molecular techniques providing new insights into these resources.
The Australian Microbiome Initiative (AusMic;
To explore the mycota of Australia, we downloaded an AusMic data set of internal transcribed spacer (ITS1) DNA sequences from fungi found in Australian soils (Bissett et al., 2016). We examined the taxonomic results reported by AusMic, which are derived from the UNITE database taxonomy (Nilsson et al., 2019), then applied filtration steps to the AusMic sequences to allow for the sensible biological interpretation of these data. Next, we compared the filtered sequences against available sequences on GenBank using BLAST (Johnson et al., 2008; Benson et al., 2018). The AusMic data were then compared with a data set of 591 partial ITS sequences from specimens housed in the National Herbarium of Victoria (MEL) at the Royal Botanic Gardens Victoria, Melbourne, Australia; given the research interests of the authors, nearly all of the MEL specimens utilized were epigeous and hypogeous macrobasidiomycetes of the class Agaricomycetes. Furthermore, we generated two different visualizations of diversity for the Australian Agaricomycetes: one provides an overview using filtered operational taxonomic units (OTUs) and the AusMic taxonomy (Fig. 1), while the other is a plot of species diversity based on the AusMic taxonomy against latitude (Fig. 2). Each of our different approaches provided complementary insights, the highlights of which are discussed below.
1 Figure. Histogram of Australian Microbiome Initiative (AusMic) Agaricomycete operational taxonomic units (OTUs) compiled to family and organized by order (background color‐coded based on legend) (total sequences: 27,730).
2 Figure. Histogram of unique taxonomic classifications (species) derived from the AusMic taxonomy for Agaricomycetes per bins of five decimal degrees latitude, estimating the relative diversity of Agaricomycetes across Australia. A total of 1263 unique taxonomic determinations were available within Agaricomycetes; each geographic bin was populated from this pool (counts for each bin are shown above each bar on the graph).
ITS amplicon data and associated metadata generated by AusMic were downloaded from the BioPlatforms Australia data portal (
The sequences were searched against both the National Center for Biotechnology Information (NCBI) Nucleotide (nt) database and an in‐house (MEL) database of fungal ITS sequences using BLASTN version 2.9.0+ with the settings max_target_seqs = 1 and evalue = 1e‐5. In each case, the top BLAST hit was retained if the BLAST alignment covered more than 95% of the query length and the BLAST high‐scoring segment pair identity was greater than 97%. These results were output to a .csv file (Appendix S1). The in‐house database is derived from gDNA extracts of macromycete collections housed at MEL, with the ITS sequences acquired by PCR amplification using the ITS1 (or ITS1‐F) and ITS4 primers (White et al., 1990; Gardes and Bruns, 1993) under the following thermocycling protocol: 95°C for 5 min, 38 cycles of 94°C for 35 s, 50°C for 60 s, and 72°C for 60 s, with a final extension at 72°C for 60 s. After manually inspecting all of the MEL chromatograms to ensure quality and retaining only sequences assembled from multiple reads (i.e., forward and reverse primers and/or multiple sequencing attempts) with unambiguous base calls (a small number of ambiguous base calls were marked as N), a total of 591 ITS sequences were found to be of sufficiently high quality to include in this study. Because AusMic retained only data corresponding to ITS1 (derived from forward reads) for public release (Bissett et al., 2016), we manually trimmed our sequences to correspond to ITS1 as well (some portions of adjacent regions, such as 5.8S, were also retained during the trimming process).
Agaricomycete OTUs present in the filtered AusMic data (192,325 sequences) were visualized with ggplot2 (Wickham, 2016), using R version 3.6.1 (R Core Team, 2019) within RStudio version 1.2.1335 (RStudio Team, 2019). Briefly, the .csv file (Appendix S1) was used as input, and OTUs for which the AusMic classification included ‘phylum = k__fungi_unclassified’ (that is, the sequence could be assigned to the kingdom Fungi, but not to a specific phylum) were removed. This data set was further filtered to include only sequences where the AusMic classification included ‘class = c__Agaricomycetes’, leaving 35,905 sequences. For visualization purposes, OTUs with the classification ‘c__Agaricomycetes_unclassified’, ‘f__Agaricomycetes_family_Incertae_sedis’, or ‘f__unclassified_Agaricomycetes’ were removed. Moreover, in several cases where classification to family level was uncertain, counts of OTUs were merged under the corresponding order name (see Appendix S2). The resultant 27,730 Agaricomycete OTUs were visualized (Fig. 1, Appendix S3).
The following steps were carried out to create a plot of species determinations (based on the AusMic taxonomy) against latitude. Sequences from Antarctica were removed from the AusMic data set as described above. AusMic OTUs with an AusMic classification of ‘phylum = k__fungi_unclassified’ were removed, and the data set was filtered to include only sequences where the AusMic classification included ‘class = c__Agaricomycetes', leaving 216,295 sequences. Latitude information for each sequence was extracted from the metadata file downloaded from BioPlatforms Australia, and for each latitude ‘bin’ of five decimal degrees a list of unique AusMic classifications was created based on the Species column of the table presented in Appendix S4. Counts of unique classifications per bin were visualized using ggplot2 and R version 3.6.1, as described above (Fig. 2).
The results of the BLAST analysis are included in Appendix S1. Of the 192,325 OTUs BLASTed against the NCBI nt database, 46,099 (~24%) retrieved hits. Against the MEL data (591 ITS sequences), 935 (<1% of the 192,325 OTUs) hits were retrieved. A graph of the Agaricomycete OTUs grouped to family and order is shown in Fig. 1. Of the Agaricomycetes present in Australia’s soil mycota (27,730 OTUs), Agaricales was inferred to be the most diverse order (16,337 OTUs), followed by Thelephorales (1991 OTUs), Russulales (1957 OTUs), Cantharellales (1691 OTUs), Sebacinales (1629 OTUs), Boletales (1002 OTUs), and Polyporales (907 OTUs). Overall, 17 orders of Agaricomycetes were inferred. Counts of the Agaricomycete OTUs at familial and ordinal levels are included in Appendix S3. A graph of Australian Agaricomycete species (derived from a pool of 1263 unique taxonomic determinations from AusMic) plotted against binned latitude values is presented in Fig. 2.
Agaricales, which includes many familiar gilled mushrooms (e.g., species of Agaricus L., Amanita Pers., and numerous “weedy” groups that occur on wood matter, leaf litter, and soil such as Mycena (Pers.) Roussel, Marasmius Fr., and Psathyrella (Fr.) Quél.), is inferred to be the most diverse order of Agaricomycetes in Australia’s soil mycota, with 16,337 OTUs (Fig. 1). This is significantly higher than the other 16 orders of Agaricomycetes inferred in our study. It should be noted that the AusMic classifications, which are reflected in Fig. 1, should be interpreted with some caution, as the taxonomic assignment was performed using a bootstrap‐based confidence cut‐off value of at least 60% (see Bissett et al., 2016 for details on OTU taxonomic assignment for AusMic).
Using the AusMic taxonomic determinations, we observed a general decrease in Australian Agaricomycete species diversity toward the −20.0 to −25.0 latitudes (decimal degrees), which harbor the lowest diversity of all the latitude bins (Fig. 2). This is likely due to these areas comprising substantial portions of Australia’s arid interior. Using the AusMic taxonomy alone underestimates diversity, as each taxonomic determination can correspond to many OTUs.
The BLAST comparisons of the AusMic data to GenBank and the MEL data set yielded ecological, morphological, and taxonomic insights. One example of an ecological insight arose from the 43 AusMic OTUs that matched a Mycena sequence from GenBank (AY627835.1) that was generated from fungal material associated with roots of Epacris pulchella Cav. (Ericaceae). Although the genus Mycena is generally known to be non‐mycorrhizal, this result aligns with other instances of Mycena species reported to be in mycorrhizal or mycorrhizal‐like relationships with Ericaceae and other groups (Zhang et al., 2012; Grelet et al., 2017). We are confident in the generic determination of Mycena in this case because (a) the AusMic classification placed the 43 OTUs in Mycena (except for two OTUs that fell into “unclassified Agaricales”), (b) a post‐hoc BLAST search of GenBank using AY627835.1 retrieved additional Mycena sequences, and (c) four of the 43 AusMic OTUs hit a Mycena specimen in the MEL data.
For morphological insights, the MEL data set was especially useful. MEL houses many specimens of sequestrate fungi (enclosed or truffle‐like, and often buried in soil), which are diverse and abundant throughout Australia. Our BLAST analysis of the AusMic OTUs against the MEL data retrieved numerous hits for the sequestrate genus Zelleromyces Singer & A. H. Sm. (Russulaceae). While corresponding GenBank hits and AusMic determinations occasionally reported sequestrate genera, the majority of GenBank and AusMic determinations corresponding to MEL specimens of sequestrate Russulaceae were for the epigeous mushroom genera Russula Pers. and Lactarius Pers. As such, the MEL data set elucidates the morphological form of some of our study organisms. Although the majority of sequestrate Russulaceae genera are polyphyletic within the family (Vidal et al., 2019), indicating the need for systematic revision, the retention of “field identification” names on herbarium specimens adds another layer of information to this study.
Our analysis provides insights into the differences in taxonomic classification between the AusMic, GenBank, and MEL taxon determinations. Many MEL specimens have been identified by taxonomic specialists, therefore, in some cases MEL hits provided informative names where AusMic and GenBank did not. For example, the AusMic sequence 459338 (Appendix S1) is determined as “Agaricales unclassified” and retrieved no GenBank hits; however, a MEL sequence for Richoniella Costantin & L. M. Dufour (a sequestrate genus of the Entolomataceae) was retrieved. A post‐hoc BLAST search of the AusMic sequence 459338 against GenBank to include identities as low as 82% retrieved taxonomic determinations at various levels, such as Entoloma (Fr.) P. Kumm. and “Uncultured Agaricales.” In other cases, the determinations of MEL specimens at high taxonomic ranks were better achieved using AusMic or GenBank. For example, the AusMic sequence 407378 retrieved a specimen submitted to MEL with only the ordinal identification of Agaricales; however, AusMic determined it as a Crepidotus (Fr.) Staude, and while no GenBank hits above a 97% identity were found, a post‐hoc search below 97% retrieved numerous Crepidotus hits.
The use of ITS sequences for fungal metagenomics is powerful, but not without problems. The existence of extensive intragenomic variation in fungal ITS sequences due to multiple polymorphic copies within a species has been reported for various orders of Ascomycota and Basidiomycota (e.g., Vydryakova et al., 2012; Stadler et al., 2020). The causes of this are likely heterogeneous, and may include the release of concerted evolution and/or the multinucleate (dikaryotic) condition of the Ascomycota and Basidiomycota (Roper et al., 2011; Roberts and Gladfelter, 2015). This variability in fungal ITS sequences can lead to the overestimation of species diversity (Lindner and Banik, 2011), which may be a caveat for the interpretation of the AusMic data. The utilization of alternative loci for fungal metagenomics, such as rpb2 (Větrovský et al., 2016) or 28S (Kivlin et al., 2011), may provide a fruitful basis of comparison and calibration for ITS‐based studies in the future.
In summary, our work outlines protocols to (a) establish OTUs from large metagenomic data sets while avoiding a potential overestimation of diversity, (b) cross‐reference these OTUs against existing taxonomies and sequence data, (c) use geographic metadata and taxonomic determinations from metagenomic studies to analyze diversity against geographic variables, and (d) provide useful outputs and visualizations of these findings. We draw insights into the Australian fungi, especially Agaricomycetes. Our methods can be applied and expanded with the data used in this study or similar data.
The authors acknowledge support from Bioplatforms Australia through funding from the Australian Government National Collaborative Research Infrastructure Strategy, through use of the Biomes of Australian Soil Environments (BASE) data, and through contributions toward sequencing efforts for the National Herbarium of Victoria (MEL) database. This study was also supported in part by an Australian Biological Resources Study grant (RF217‐63).
N.D. contributed to the conception, planning, and writing of this article and generated a portion of the MEL data. C.J.J. contributed to the planning of this article and conducted the bioinformatic analyses, including the data visualization. G.D.H. generated most of the MEL data and contributed to conversations reflected in the paper. T.L. contributed to the planning of this study, generated a portion of the MEL data, and contributed to the information in the paper.
The AusMic data utilized in this study were downloaded from the BioPlatforms Australia data portal (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2020. This work is published under http://creativecommons.org/licenses/by-nc/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Premise
Motivated to make sensible interpretations of the massive volume of data from the Australian Microbiome Initiative (AusMic), we characterize the soil mycota of Australia. We establish operational taxonomic units (OTUs) from the data and compare these to GenBank and a data set from the National Herbarium of Victoria (MEL), Melbourne, Australia. We also provide visualizations of Agaricomycete diversity, drawn from our analyses of the AusMic sequences and taxonomy.
Methods
The AusMic internal transcribed spacer (ITS) data were filtered to create OTUs, which were searched against the National Center for Biotechnology Information Nucleotide database and the MEL database. We further characterized a portion of our OTUs by graphing the counts of the families and orders of Agaricomycetes. We also graphed AusMic species determinations for Australian Agaricomycetes against latitude.
Results
Our filtering process generated 192,325 OTUs; for Agaricomycetes, there were 27,730 OTUs. Based on the existing AusMic taxonomy at species level, we inferred the diversity of Australian Agaricomycetes against latitude to be lowest between −20 and −25 decimal degrees.
Discussion
BLAST comparisons provided reciprocal insights between the three data sets, including the detection of unusual root‐associated species in the AusMic data, insights into mushroom morphology from the MEL data, and points of comparison for the taxonomic determinations between AusMic, GenBank, and MEL. This study provides a tabulation of Australian fungi, different visual snapshots of a subset of those taxa, and a springboard for future studies.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer