Metatranscriptomics as a tool to identify fungal species and subspecies in mixed communities

Vanesa R. Marcelino; Laszlo Irinyi; John-Sebastian Eden; Wieland Meyer; Edward C. Holmes; Tania C. Sorrell

doi:10.1101/584649

Abstract

High-throughput sequencing (HTS) enables the generation of large amounts of genome sequence data at a reasonable cost. Organisms in mixed microbial communities can now be sequenced and identified in a culture-independent way, usually using amplicon sequencing of a DNA barcode. Bulk RNA-seq (metatranscriptomics) has several advantages over DNA-based amplicon sequencing: it is less susceptible to amplification biases, it captures only living organisms, and it enables a larger set of genes to be used for taxonomic identification. Using a defined mock community comprised of 17 fungal isolates, we evaluated whether metatranscriptomics can accurately identify fungal species and subspecies in mixed communities. Overall, 72.9% of the RNA transcripts were classified, from which the vast majority (99.5%) were correctly identified at the species-level. Of the 15 species sequenced, 13 were retrieved and identified correctly. We also detected strain-level variation within the Cryptococcus species complexes: 99.3% of transcripts assigned to Cryptococcus were classified as one of the four strains used in the mock community. Laboratory contaminants and/or misclassifications were diverse but represented only 0.44% of the transcripts. Hence, these results show that it is possible to obtain accurate species- and strain-level fungal identification from metatranscriptome data as long as taxa identified at low abundance are discarded to avoid false-positives derived from contamination or misclassifications. This study therefore establishes a base-line for the application of metatranscriptomics in clinical mycology and ecological studies.

Introduction

Microscopic fungal species, such as yeasts and some filamentous fungi, do not live in isolation, they are most commonly found within mixed microbial communities inhabiting soil, water systems, plants and animal hosts. Assessing the diversity of fungi in mixed communities is important because different fungal taxa may exhibit distinctive phenotypes, and consequently may have different pathogenicity or functional roles. For example, in the rhizosphere, changes in fungal community composition have been associated with shifts in nutrient cycling (Hannula et al. 2017). Humans also harbor, or are exposed to, a diverse fungal community that provides a source of opportunistic pathogens (Bandara et al. 2019; Huffnagle & Noverr 2013; Seed 2014). Although it is typically assumed that invasive fungal infections are caused by a single strain, multiple Candida strains have been observed during the course of a single episode of infection (Soll et al. 1988). Furthermore, nearly 20% of patients with cryptococcosis are infected with multiple strains, with different phenotypes and virulence traits (Desnos-Ollivier et al. 2015; Desnos-Ollivier et al. 2010). Strain-level fungal diversity may influence therapeutic responsiveness and needs further investigation.

Despite its importance, fungal taxonomic diversity is poorly characterized. From over two million fungal species estimated to exist, less than 8% have been described (Hawksworth & Lucking 2017). Even well-known fungal species are often overlooked during routine diagnostic procedures, surveillance and biodiversity surveys (Brown et al. 2012; Enaud et al. 2018; Yahr et al. 2016). This is in part due to challenges in the detection and classification of these organisms, especially microscopic and cryptic species, for example, the etiologic agents of cryptococcosis. Currently, two species complexes are recognized: Cryptococcus neoformans and Cryptococcus gattii (Kwon-Chung et al. 2002). Seven major haploid lineages are found within these two species complexes (C. neoformans species complex: VNI, VNII, VNIV, and C. gattii species complex: VGI, VGII, VGIII and VGIV) and their recognition as distinct biological species has been debated (Hagen et al. 2015; Kwon-Chung et al. 2017; Ngamskulrungroj et al. 2009). Being able to distinguish closely-related lineages is important because their phenotype, virulence and ecophysiology can vary substantially. For example, the JEC21 and B-3501 strains of C. neoformans var. neoformans (VNIV) are 99.5% identical at the genomic sequence level but differ substantially in thermotolerance and virulence (Loftus et al. 2005). Likewise, different virulence and antifungal tolerance traits were observed within lineages of C. gattii VGIII (Firacative et al. 2016).

The introduction of high-throughput sequencing (HTS) marked a new era in mycological research, where the vast diversity of fungi can be studied without the need for culture (Nilsson et al. 2019). To date, amplicon sequencing of marker genes (metabarcoding) has been the most popular HTS method used to identify fungal species in mixed communities. Despite its indisputable utility, metabarcoding surveys are affected by PCR amplification biases, and even abundant species can go undetected due to primer mismatch (Marcelino & Verbruggen 2016; Nilsson et al. 2019; Tedersoo et al. 2015). In addition, DNA fragments from dead organisms inflate biodiversity estimates in metabarcoding surveys (Carini et al. 2016). Stool samples, for instance, naturally contain food-derived DNA, which cannot be distinguished from the genetic material of the resident gut microbiota when using DNA-based methods. These challenges can be circumvented by directly sequencing actively transcribed genes, via RNA-Seq, hence avoiding the amplification step, and obtaining an unbiased characterization of the living microbial community. Metatranscriptomics has been used to identify RNA viruses in a range of animal samples (Shi et al. 2016; Shi et al. 2017; Wille et al. 2018; Zhang et al. 2018) and to characterize the functional profile of microbial communities (Bashiardes et al. 2016; Kuske et al. 2015). Studies applying metatranscriptomics to mycorrhizal communities have provided valuable insights into the functional roles of fungi in these symbiotic systems (Gonzalez et al. 2018; Liao et al. 2014). However, links between functional and species-level taxonomy have been sought infrequently, likely because fungal identification from metatranscriptome data is considered unreliable below phylum level (Nilsson et al. 2019). Critically, it is currently unknown whether metatranscriptomics can accurately identify fungi at the species and subspecies level within a mixed community. This information is fundamental to the investigation of the potential and utility of metatranscriptomics in diagnostics and ecological studies.

Herein, we evaluated the utility of metatranscriptomics as a tool for the simultaneous identification of fungal species, using a defined mock community containing 15 fungal species. In addition, we investigated whether strains belonging to sister species, such as the C. neoformans and C. gattii species complexes could be identified correctly using metatranscriptomics. Rather than focusing on marker genes, we sought to classify fungal species using the information from all expressed genes, using the totality of NCBI’s nucleotide collection as a reference database. This study paves the way to apply state-of-the art techniques in fungal biodiversity surveys and clinical diagnostics.

Methods

A defined fungal community was constructed from 17 isolates, including 15 fungal species and three strains of the C. neoformans species complex in addition to one strain of C. gattii (Table 1). Fungal strains were obtained from the Westmead Mycology Culture Collection and cultured on Sabouraud agar at 27°C for 72 hours. A sweep of colonies was made with a disposable inoculating loop and dispersed in PBS. Fungal cells were quantified in a Neubauer chamber and their concentration adjusted such that the fungal mixture contained equal concentrations of each species (10⁸ cells/mL). RNA was isolated with the RNeasy Plus kit (Qiagen), following the manufacture’s protocol, with an initial freeze-thaw step in liquid nitrogen to disrupt fungal cells. The quantity and quality of the RNA extract was determined with the Nanodrop Spectrophotometer (Thermo Scientific) and the Agilent 2200 TapeStation. As some residual DNA was detected, the RNA extract was further treated with DNase I (Qiagen). Ribosomal depletion (Ribo-Zero Gold technology), library preparation and sequencing (Illumina HiSeq HT, 125bp Paired End) were performed by the Australian Genomics Research Facility. The raw sequence data were deposited in the NCBI Short Read Archive (accession PRJNA521097).

View this table:

Table 1.

Species and strains used to construct a mock fungal community for metatranscriptome sequencing.

Sequence reads containing more than five ambiguous positions or with average quality scores ≤ 25 were filtered from the dataset using prinseq-lite v.0.20.4 (Schmieder & Edwards 2011) with the options -ns_max_n 5 -min_qual_mean 25 -out_format 3. Assembly of sequence reads into contigs was performed with Trinity v.2.5.1 (Grabherr et al. 2011). Contigs were mapped to the NCBI nucleotide collection using KMA (Clausen et al. 2018), a novel approach that has proven to be more accurate than other mapping software. Prior to mapping, NCBI’s taxonomic identifier codes (taxids) were appended to each sequence record in the nucleotide collection, and the reference database was indexed using KMA’s options -NI -Sparse TG. Contigs were then mapped to the indexed database with the options -mem_mode -and -apm f. Matches to the reference database with low support (i.e. coverage < 20 and depth < 0.05) were excluded from the analyses. The species-level taxonomic classifications were based on NCBI’s taxonomy identifiers (taxids) to minimize the issue of changing species nomenclature (Federhen 2012). Subspecies-level classifications within the Cryptococcus neoformans and C. gattii species complexes were examined manually.

Abundance was estimated at the level of sequence reads and transcripts. For read-level abundances, sequence reads were mapped to transcripts using Bowtie2 (Langmead & Salzberg 2012) and quantified in Transcripts Per Million (TPM) with RSEM (Li & Dewey 2011), using the Trinity pipeline. For transcript-level abundances, the depth values estimated within KMA were used, which is the total number of nucleotides (in each contig) covering the reference sequence divided by the length of the reference sequence. The number and length of assembled contigs for each taxon is likely a better proxy for species abundance than read-level abundances (which are subject to gene expression), and therefore were used for graphic representation and analyses. For simplicity, we refer as ‘abundance’ the transcript-level abundance, unless otherwise stated.

It would be logical to expect that species with larger and gene-rich genomes would express a greater number of transcripts. To test for this potential correlation, genome sizes and the estimated number of proteins were obtained from the Fungal Genome Size Database (Kullman et al. 2005), Loftus et al. (2005), Muñoz et al. (2018) and NCBI’s Genome database (Supplementary table S1). The correlation coefficients between genome size, number of proteins and abundance of transcripts were estimated using Person’s correlation and visualized using the R package ggpubr (Kassambara 2017).

Results

RNA sequencing yielded a total of 26,558,491 paired end reads, of which 98.3% passed quality control. Overall, 277,404 contigs (transcripts) were obtained, from which 202,219 (72.9%) were classified. The majority of the sequence reads (80.2%) mapped to a classified contig. Of the 15 fungal species sequenced, 13 were retrieved and correctly classified at the species level (Figure 1, Table 2, Supplementary table S2). The two false-negatives were Debaryomyces hansenii and Schizosaccharomyces pombe; these may have been misclassified as another fungus or were lost due to cell pooling inaccuracy and/or RNA extraction biases. A small proportion of bacterial transcripts (0.03%) and other eukaryotic microbes (0.4%, including 31 fungi that were not present in the mock community) was also observed (Table 2, Supplementary table S2), which likely represent laboratory contaminants and misclassifications (see discussion). However, these were present at a consistently lower frequency than true members of the mock community, with the most common – Candida glycerinogenes – only present in 0.08% of the transcripts. Some of the transcripts were assigned to entries in GenBank that do not have a species-level classification (e.g. Candida sp. and Pichia sp.). These assignments were considered misclassifications here, although it is possible that the species in our mock community are the correct species-level identity of these GenBank sequences.

View this table:

Table 2.

Abundance of reads (TPM) and abundance of transcripts (Depth) per fungal species detected with metatranscriptomics. True members of the mock community – at species level – are shown in bold.

Figure 1.

Relative abundance of transcripts assigned to microbial species recovered in the metatranscriptome of a mock community. See Table 2 for a full list of species and more details about their abundance.

Overall, the commonest species detected was C. neoformans, which was to be expected as it comprised three strains in the mock community and therefore was three times more abundant than other fungal species. Transcripts belonging to Candida tropicalis and Pichia kudriavzevii (former Candida krusei) – were also common (19.2% and 18.8%, respectively), while C. albicans, C. orthopsilosis and C. glabrata (other causes of candidaemia in humans) were detected at lower abundance (2.0 – 2.9%). There was no relationship evident between abundance of transcripts and phylogenetic relatedness. Genomes with low GC content can be overrepresented in metagenomic sequencing (Shakya et al. 2013). Conversely, some of the species detected here in high abundance (Cryptococcus neoformans and Clavispora lusitaniae) have a higher GC content than most other fungal species (Dujon 2010), suggesting that GC bias is unlikely to affect our results. No correlation between abundance of transcripts and genome size or estimated number of proteins was observed (p > 0.05, Supplementary figure 1).

Molecular type and strain-level variation within the Cryptococcus neoformans and C. gattii species complexes was also detected, with contigs matching to C. gattii VGI WM 276, C. neoformans var. grubii VNI H99 and C. neoformans var. neoformans VNIV strains B-3501A and JEC21 (Figure 2, Supplementary table S3). A proportion of the transcripts (1.6%) matched with equal probability scores to both strains of C. neoformans var. neoformans (B-3501A and JEC21, Supplementary tables S2 and S3). From the transcripts classified as Cryptococcus spp., 99.3% were classified as one of the four Cryptococcus strains (or both B-3501A and JEC21) used in the mock community. It is possible that misclassifications occurred within the strains analyzed. For example, transcripts originally from JEC21 might have been classified as B-3501A and vice versa. As it is not possible to know from which strain the transcripts originated, these possible misclassifications would be undetected.

Figure 2.

Strain-level classifications of taxa within the Cryptococcus neoformans and C. gattii species complexes.

Discussion

Our metatranscriptomics approach yielded taxonomic identification of fungi from a defined mock community with high success, while false-positives were detected at far lower abundance. These results indicate that it is possible to obtain accurate species- and strain-level identifications for fungi from metatranscriptome data, as long as taxa identified at low abundance are removed from the analyses to avoid false-positives derived from contamination or misclassifications.

Taxonomic classification at species and strain levels using metabarcoding and metagenomic data has been considered inaccurate (Nilsson et al. 2019; Sczyrba et al. 2017; Yamamoto et al. 2014), raising the question of how our metatranscriptomics approach succeeded in identifying closely-related fungal strains. Metabarcoding relies on a single marker gene (Banchi et al. 2018; e.g. McGuire et al. 2013; Schmidt et al. 2013), which does not contain sufficient phylogenetic information to differentiate some closely related fungal lineages (Balasundaram et al. 2015; Nilsson et al. 2008). Metatranscriptomics, on the other hand, yields data on all expressed coding sequences. Classifications derived from metagenomes are likely to be equally accurate as the ones obtained from metatranscriptomes, except that dead organisms might also be sequenced. Additionally, we used a new alignment method that is both highly accurate and fast (Clausen et al. 2018), allowing us to map sequences against the complete NCBI nucleotide collection. Complete genomes of all fungal isolates used here are available in NCBI, and it is likely that the accuracy of identifications is reduced for poorly-documented microorganisms. However, it is possible to extract informative genes from metatranscriptome data and subsequently perform phylogenomic analyses to identify rare and novel taxa (e.g. Shi et al. 2017; Wille et al. 2018; Zhang et al. 2018). Besides being highly accurate, metatranscriptomics is less susceptible to amplification bias, no information about the community members is required a priori, and it only detects functionally active members of a microbial community. These advantages make metatranscriptomics a promising tool in biodiversity surveys, functional assessments of microbial communities, pathogen detection and biosecurity surveillance (e.g. Kuske et al. 2015; Shi et al. 2016; Wille et al. 2018).

Even though false-positives were present at low abundance, they pose a challenge in the interpretation of metatranscriptomic and metagenomic data. False-positives generally result from spurious classifications and laboratory contaminants, which may be common in laboratory reagents (Salter et al. 2014). However, metatranscriptomics is less sensitive to laboratory contamination than DNA-based metagenomics or metabarcoding, as only living microorganisms are sequenced. Nevertheless, contamination can occur at all stages of the library preparation and is routinely observed in RNA-Seq studies (Quince et al. 2017; Strong et al. 2014). Misclassifications occur because some genome regions are very similar (or identical) across closely-related species and cannot be differentiated. Errors in reference databases can also result in misclassifications. Sequences attributed to incorrectly-classified species are not uncommon in GenBank and result in downstream classification errors (Li et al. 2018). It is also not unusual to find bacterial regions misassembled into eukaryotic genomes (e.g. Koutsovoulos et al. 2016), which can result in sequences from common laboratory contaminants being classified as a eukaryote. Filtering out organisms found in low abundance is an option to reduce the incidence of false-positives in downstream analyses. In this study, filtering organisms for which the abundance of transcripts is lower than 0.1% would eliminate false-positives, at the cost of excluding one true-positive from the analyses (Table 2). The application of this abundance-filtering step might not be feasible when sequencing depth (per microbial species) is limited. Species present in low abundances will be represented by a small number of transcripts and so are more likely to be misclassified or undetected.

The abundance of transcripts and sequence reads can vary according to genome size, number of coding sequences and gene expression. Therefore, the abundance disparity across species observed here is unsurprising. Interestingly, we found no correlation between the abundance of transcripts and genome size or number of genes (Supplementary figure 1). Imprecise estimates of cell abundance and RNA extraction biases could also have influenced abundance estimates, and might be the reason why two species in the mock community (D. hansenii and S. pombe) were not detected in the analyses. Metabarcoding studies have suggested that performing DNA extraction in triplicate minimizes biases for bacteria, but it had no effect in fungal communities (Feinstein et al. 2009). To our knowledge, the effect of RNA extraction bias in metatranscriptomics has yet to be studied. As metagenomics surveys are not affected by gene expression, they might be more appropriate for studies where it is important to quantify species abundance.

Although fungal species and their genes can be confidently identified, it remains challenging to link some genes with particular species using metatranscriptomics. A large portion of fungal genomes are highly similar among species, making it difficult, if not impossible, to infer which species in the community are expressing which genes. Recently, a method was developed to perform species-level functional profiling of metagenome data (Franzosa et al. 2018). This method, however, relies on a reference database of complete genomes that currently contains few fungal representatives, limiting its application in fungal metagenomics. Contrary to metatranscriptomics, metagenomics yields coding and non-coding sequences, which can facilitate linking genes to species if sequencing depth is large enough to assemble large parts of fungal genomes (e.g. Olm et al. 2019).

In sum, we show that metatranscriptomics is a useful approach to identify fungal species and subspecies in mixed samples. The major advantages of metatranscriptomics over other HTS technologies include the selective sequencing of living organisms and the ability to detect a wide range of microorganisms in one step, which has multiple applications in biological research, surveillance and diagnosis. There is an increasing literature reporting that virulence and antimicrobial tolerance traits vary within species (Firacative et al. 2016; Rizzetto et al. 2013; Schauwvlieghe et al. 2017; Strope et al. 2015) and that multiple strains or species can co-infect a host (Desnos-Ollivier et al. 2010; Gupta et al. 2014; Seki et al. 2014; Soll et al. 1988; Tati et al. 2016). The high discriminatory power obtained for closely-related lineages of Cryptococcus provides a good example of where metatranscriptomics would be valuable in precision medicine, where therapy practices are defined according to strain-specific pathogenicity and drug susceptibility traits. However, it must be acknowledged that metatranscriptomics also has limitations that are common to high-throughput sequencing methods, as it is susceptible to DNA/RNA extraction biases, contamination and misclassifications. These limitations can be significantly minimized if appropriate controls are in place (e.g. abundance filtering before statistical analyses). Besides its application to identify well-known fungal species, metatranscriptomics can help to identify novel functional roles of fungi (e.g. Gonzalez et al. 2018; Liao et al. 2014) and novel species when used within a phylogenomic framework.

Acknowledgments

We thank Fabio Santos and Krystyna Maszewska for helping with the fungal cultures and cell abundance estimates, and the High Performance Computing support at Sydney University. TCS and VRM are Sydney Medical Foundation Fellows whose work is supported by the Sydney Medical School Foundation. This work was funded in part by an NHMRC Centre of Research Excellence Grant (APP1102962) to TCS and an NHMRC grant (APP1121936) to WM and TC. ECH is funded by an ARC Australian Laureate Fellowship (FL170100022).

References

↵
Balasundaram SV, Engh IB, Skrede I, Kauserud H (2015) How many DNA markers are needed to reveal cryptic fungal species? Fungal Biol 119: 940–945.
OpenUrl
↵
Banchi E, Ametrano CG, Stankovic D, et al. (2018) DNA metabarcoding uncovers fungal diversity of mixed airborne samples in Italy. PLoS One 13: e0194489.
OpenUrl
↵
Bandara H, Panduwawala CP, Samaranayake LP (2019) Biodiversity of the human oral mycobiome in health and disease. Oral Dis 25: 363–371.
OpenUrl
↵
Bashiardes S, Zilberman-Schapira G, Elinav E (2016) Use of Metatranscriptomics in Microbiome Research. Bioinform Biol Insights 10: 19–25.
OpenUrl CrossRef
↵
Brown GD, Denning DW, Gow NA, et al. (2012) Hidden killers: human fungal infections. Sci Transl Med 4: 165rv113.
OpenUrl CrossRef
↵
Carini P, Marsden PJ, Leff JW, et al. (2016) Relic DNA is abundant in soil and obscures estimates of soil microbial diversity. Nat Microbiol 2: 16242.
OpenUrl
↵
Clausen P, Aarestrup FM, Lund O (2018) Rapid and precise alignment of raw reads against redundant databases with KMA. BMC Bioinformatics 19: 307.
OpenUrl
↵
Desnos-Ollivier M, Patel S, Raoux-Barbot D, et al. (2015) Cryptococcosis Serotypes Impact Outcome and Provide Evidence of Cryptococcus neoformans Speciation. MBio 6: e00311.
OpenUrl CrossRef PubMed
↵
Desnos-Ollivier M, Patel S, Spaulding AR, et al. (2010) Mixed infections and In Vivo evolution in the human fungal pathogen Cryptococcus neoformans. MBio 1.
↵
Dujon B (2010) Yeast evolutionary genomics. Nature Reviews: Genetics 11: 512–524.
OpenUrl CrossRef PubMed
↵
Enaud R, Vandenborght LE, Coron N, et al. (2018) The Mycobiome: A Neglected Component in the Microbiota-Gut-Brain Axis. Microorganisms 6.
↵
Federhen S (2012) The NCBI Taxonomy database. Nucleic Acids Research 40: D136–143.
OpenUrl CrossRef PubMed Web of Science
↵
Feinstein LM, Sul WJ, Blackwood CB (2009) Assessment of bias associated with incomplete extraction of microbial DNA from soil. Applied and Environmental Microbiology 75: 5428–5433.
OpenUrl Abstract/FREE Full Text
↵
Firacative C, Roe CC, Malik R, et al. (2016) MLST and Whole-Genome-Based Population Analysis of Cryptococcus gattii VGIII Links Clinical, Veterinary and Environmental Strains, and Reveals Divergent Serotype Specific Sub-populations and Distant Ancestors. PLoS Negl Trop Dis 10: e0004861.
OpenUrl CrossRef
↵
Franzosa EA, McIver LJ, Rahnavard G, et al. (2018) Species-level functional profiling of metagenomes and metatranscriptomes. Nature Methods 15: 962–968.
OpenUrl
↵
Gonzalez E, Pitre FE, Page AP, et al. (2018) Trees, fungi and bacteria: tripartite metatranscriptomics of a root microbiome responding to soil contamination. Microbiome 6: 53.
OpenUrl
↵
Grabherr MG, Haas BJ, Yassour M, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnology 29: 644–652.
OpenUrl CrossRef PubMed
↵
Gupta V, Rajagopalan N, Patil M, Shivaprasad C (2014) Aspergillus and mucormycosis presenting with normal chest X-ray in an immunocompromised host. BMJ Case Rep 2014.
↵
Hagen F, Khayhan K, Theelen B, et al. (2015) Recognition of seven species in the Cryptococcus gattii/Cryptococcus neoformans species complex. Fungal Genetics and Biology 78: 16–48.
OpenUrl CrossRef PubMed
↵
Hannula SE, Morrien E, de Hollander M, et al. (2017) Shifts in rhizosphere fungal community during secondary succession following abandonment from agriculture. ISME Journal 11: 2294–2304.
OpenUrl
↵
Hawksworth DL, Lucking R (2017) Fungal Diversity Revisited: 2.2 to 3.8 Million Species. Microbiol Spectr 5.
↵
Huffnagle GB, Noverr MC (2013) The emerging world of the fungal microbiome. Trends in Microbiology 21: 334–341.
OpenUrl CrossRef PubMed
↵
Kassambara A (2017) ggpubr:“ggplot2” based publication ready plots. R package version 0.1. 6.
↵
Koutsovoulos G, Kumar S, Laetsch DR, et al. (2016) No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini. Proc Natl Acad Sci U S A 113: 5053–5058.
OpenUrl Abstract/FREE Full Text
↵
Kullman B, Tamm H, Kullman K (2005) Fungal Genome Size Database. http://www.zbi.ee/fungal-genomesize/
↵
Kuske CR, Hesse CN, Challacombe JF, et al. (2015) Prospects and challenges for fungal metatranscriptomics of complex communities. Fungal Ecology 14: 133–137.
OpenUrl
↵
Kwon-Chung KJ, Bennett JE, Wickes BL, et al. (2017) The Case for Adopting the “Species Complex” Nomenclature for the Etiologic Agents of Cryptococcosis. mSphere 2.
↵
Kwon-Chung KJ, Boekhout T, Fell JW, Diaz M (2002) Proposal to conserve the name Cryptococcus gattii against C. hondurianus and C. bacillisporus (Basidiomycota, Hymenomycetes, Tremellomycetidae). Taxon 51: 804–806.
OpenUrl CrossRef
↵
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nature Methods 9: 357–359.
OpenUrl CrossRef
↵
Li B, Dewey CN (2011) RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12: 323.
OpenUrl CrossRef PubMed
↵
Li X, Shen X, Chen X, et al. (2018) Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank. Front Genet 9: 30.
OpenUrl
↵
Liao HL, Chen Y, Bruns TD, et al. (2014) Metatranscriptomic analysis of ectomycorrhizal roots reveals genes associated with Piloderma-Pinus symbiosis: improved methodologies for assessing gene expression in situ. Environmental Microbiology 16: 3730–3742.
OpenUrl CrossRef
↵
Loftus BJ, Fung E, Roncaglia P, et al. (2005) The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans. Science 307: 1321–1324.
OpenUrl Abstract/FREE Full Text
↵
Marcelino VR, Verbruggen H (2016) Multi-marker metabarcoding of coral skeletons reveals a rich microbiome and diverse evolutionary origins of endolithic algae. Scientific Reports 6: 31508.
OpenUrl
↵
McGuire KL, Payne SG, Palmer MI, et al. (2013) Digging the New York City Skyline: soil fungal communities in green roofs and city parks. PLoS One 8: e58020.
OpenUrl CrossRef PubMed
Munoz JF, Gade L, Chow NA, et al. (2018) Genomic insights into multidrug-resistance, mating and virulence in Candida auris and related emerging species. Nature Communications 9: 5346.
OpenUrl
↵
Ngamskulrungroj P, Gilgado F, Faganello J, et al. (2009) Genetic diversity of the Cryptococcus species complex suggests that Cryptococcus gattii deserves to have varieties. PLoS One 4: e5862.
OpenUrl CrossRef PubMed
↵
Nilsson RH, Anslan S, Bahram M, et al. (2019) Mycobiome diversity: high-throughput sequencing and identification of fungi. Nature Reviews: Microbiology 17: 95–109.
OpenUrl CrossRef
↵
Nilsson RH, Kristiansson E, Ryberg M, Hallenberg N, Larsson K-H (2008) Intraspecific ITS Variability in the Kingdom Fungi as Expressed in the International Sequence Databases and Its Implications for Molecular Species Identification. Evolutionary Bioinformatics 4.
↵
Olm MR, West PT, Brooks B, et al. (2019) Genome-resolved metagenomics of eukaryotic populations during early colonization of premature infants and in hospital rooms. Microbiome 7: 26.
OpenUrl
↵
Quince C, Walker AW, Simpson JT, Loman NJ, Segata N (2017) Shotgun metagenomics, from sampling to analysis. Nature Biotechnology 35: 833–844.
OpenUrl CrossRef
↵
Rizzetto L, Giovannini G, Bromley M, et al. (2013) Strain dependent variation of immune responses to A. fumigatus: definition of pathogenic species. PLoS One 8: e56651.
OpenUrl CrossRef PubMed
↵
Salter SJ, Cox MJ, Turek EM, et al. (2014) Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biology 12: 87.
OpenUrl
↵
Schauwvlieghe A, Vonk AG, Buddingh EP, et al. (2017) Detection of azole-susceptible and azole-resistant Aspergillus coinfection by cyp51A PCR amplicon melting curve analysis. Journal of Antimicrobial Chemotherapy 72: 3047–3050.
OpenUrl
↵
Schmidt P-A, Bálint M, Greshake B, et al. (2013) Illumina metabarcoding of a soil fungal community. Soil Biology and Biochemistry 65: 128–132.
OpenUrl
↵
Schmieder R, Edwards R (2011) Quality control and preprocessing of metagenomic datasets. Bioinformatics 27: 863–864.
OpenUrl CrossRef PubMed Web of Science
↵
Sczyrba A, Hofmann P, Belmann P, et al. (2017) Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software. Nature Methods 14: 1063–1071.
OpenUrl
↵
Seed PC (2014) The human mycobiome. Cold Spring Harb Perspect Med 5: a019810.
OpenUrl CrossRef
↵
Seki M, Ohno H, Gotoh K, et al. (2014) Allergic bronchopulmonary mycosis due to co-infection with Aspergillus fumigatus and Schizophyllum commune. IDCases 1: 5–8.
OpenUrl CrossRef PubMed
↵
Shakya M, Quince C, Campbell JH, et al. (2013) Comparative metagenomic and rRNA microbial diversity characterization using archaeal and bacterial synthetic communities. Environmental Microbiology 15: 1882–1899.
OpenUrl CrossRef Web of Science
↵
Shi M, Lin XD, Tian JH, et al. (2016) Redefining the invertebrate RNA virosphere. Nature.
↵
Shi M, Neville P, Nicholson J, et al. (2017) High-Resolution Metatranscriptomics Reveals the Ecological Dynamics of Mosquito-Associated RNA Viruses in Western Australia. Journal of Virology 91.
↵
Soll DR, Staebell M, Langtimm C, et al. (1988) Multiple Candida strains in the course of a single systemic infection. Journal of Clinical Microbiology 26: 1448–1459.
OpenUrl Abstract/FREE Full Text
↵
Strong MJ, Xu G, Morici L, et al. (2014) Microbial contamination in next generation sequencing: implications for sequence-based analysis of clinical samples. PLoS Pathog 10: e1004437.
OpenUrl CrossRef PubMed
↵
Strope PK, Skelly DA, Kozmin SG, et al. (2015) The 100-genomes strains, an S. cerevisiae resource that illuminates its natural phenotypic and genotypic variation and emergence as an opportunistic pathogen. Genome Research 25: 762–774.
OpenUrl Abstract/FREE Full Text
↵
Tati S, Davidow P, McCall A, et al. (2016) Candida glabrata Binding to Candida albicans Hyphae Enables Its Development in Oropharyngeal Candidiasis. PLoS Pathog 12: e1005522.
OpenUrl CrossRef
↵
Tedersoo L, Anslan S, Bahram M, et al. (2015) Shotgun metagenomes and multiple primer pair-barcode combinations of amplicons reveal biases in metabarcoding analyses of fungi. MycoKeys 10: 1–43.
OpenUrl CrossRef
↵
Wille M, Eden JS, Shi M, et al. (2018) Virus-virus interactions and host ecology are associated with RNA virome structure in wild birds. Molecular Ecology 27: 5263–5278.
OpenUrl
↵
Yahr R, Schoch CL, Dentinger BT (2016) Scaling up discovery of hidden diversity in fungi: impacts of barcoding approaches. Philos Trans R Soc Lond B Biol Sci 371.
↵
Yamamoto N, Dannemiller KC, Bibby K, Peccia J (2014) Identification accuracy and diversity reproducibility associated with internal transcribed spacer-based fungal taxonomic library preparation. Environmental Microbiology 16: 2764–2776.
OpenUrl
↵
Zhang YZ, Shi M, Holmes EC (2018) Using Metagenomics to Characterize an Expanding Virosphere. Cell 172: 1168–1172.
OpenUrl CrossRef

View the discussion thread.

Posted March 21, 2019.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Molecular Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11753)
Bioengineering (8752)
Bioinformatics (29201)
Biophysics (14974)
Cancer Biology (12100)
Cell Biology (17413)
Clinical Trials (138)
Developmental Biology (9422)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18309)
Genetics (12245)
Genomics (16804)
Immunology (11869)
Microbiology (28098)
Molecular Biology (11596)
Neuroscience (60975)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] ↵
Balasundaram SV, Engh IB, Skrede I, Kauserud H (2015) How many DNA markers are needed to reveal cryptic fungal species? Fungal Biol 119: 940–945.
OpenUrl

[2] ↵
Banchi E, Ametrano CG, Stankovic D, et al. (2018) DNA metabarcoding uncovers fungal diversity of mixed airborne samples in Italy. PLoS One 13: e0194489.
OpenUrl

[3] ↵
Bandara H, Panduwawala CP, Samaranayake LP (2019) Biodiversity of the human oral mycobiome in health and disease. Oral Dis 25: 363–371.
OpenUrl

[4] ↵
Bashiardes S, Zilberman-Schapira G, Elinav E (2016) Use of Metatranscriptomics in Microbiome Research. Bioinform Biol Insights 10: 19–25.
OpenUrl CrossRef

[5] ↵
Brown GD, Denning DW, Gow NA, et al. (2012) Hidden killers: human fungal infections. Sci Transl Med 4: 165rv113.
OpenUrl CrossRef

[6] ↵
Carini P, Marsden PJ, Leff JW, et al. (2016) Relic DNA is abundant in soil and obscures estimates of soil microbial diversity. Nat Microbiol 2: 16242.
OpenUrl

[7] ↵
Clausen P, Aarestrup FM, Lund O (2018) Rapid and precise alignment of raw reads against redundant databases with KMA. BMC Bioinformatics 19: 307.
OpenUrl

[8] ↵
Desnos-Ollivier M, Patel S, Raoux-Barbot D, et al. (2015) Cryptococcosis Serotypes Impact Outcome and Provide Evidence of Cryptococcus neoformans Speciation. MBio 6: e00311.
OpenUrl CrossRef PubMed

[9] ↵
Desnos-Ollivier M, Patel S, Spaulding AR, et al. (2010) Mixed infections and In Vivo evolution in the human fungal pathogen Cryptococcus neoformans. MBio 1.

[10] ↵
Dujon B (2010) Yeast evolutionary genomics. Nature Reviews: Genetics 11: 512–524.
OpenUrl CrossRef PubMed

[11] ↵
Enaud R, Vandenborght LE, Coron N, et al. (2018) The Mycobiome: A Neglected Component in the Microbiota-Gut-Brain Axis. Microorganisms 6.

[12] ↵
Federhen S (2012) The NCBI Taxonomy database. Nucleic Acids Research 40: D136–143.
OpenUrl CrossRef PubMed Web of Science

[13] ↵
Feinstein LM, Sul WJ, Blackwood CB (2009) Assessment of bias associated with incomplete extraction of microbial DNA from soil. Applied and Environmental Microbiology 75: 5428–5433.
OpenUrl Abstract/FREE Full Text

[14] ↵
Firacative C, Roe CC, Malik R, et al. (2016) MLST and Whole-Genome-Based Population Analysis of Cryptococcus gattii VGIII Links Clinical, Veterinary and Environmental Strains, and Reveals Divergent Serotype Specific Sub-populations and Distant Ancestors. PLoS Negl Trop Dis 10: e0004861.
OpenUrl CrossRef

[15] ↵
Franzosa EA, McIver LJ, Rahnavard G, et al. (2018) Species-level functional profiling of metagenomes and metatranscriptomes. Nature Methods 15: 962–968.
OpenUrl

[16] ↵
Gonzalez E, Pitre FE, Page AP, et al. (2018) Trees, fungi and bacteria: tripartite metatranscriptomics of a root microbiome responding to soil contamination. Microbiome 6: 53.
OpenUrl

[17] ↵
Grabherr MG, Haas BJ, Yassour M, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnology 29: 644–652.
OpenUrl CrossRef PubMed

[18] ↵
Gupta V, Rajagopalan N, Patil M, Shivaprasad C (2014) Aspergillus and mucormycosis presenting with normal chest X-ray in an immunocompromised host. BMJ Case Rep 2014.

[19] ↵
Hagen F, Khayhan K, Theelen B, et al. (2015) Recognition of seven species in the Cryptococcus gattii/Cryptococcus neoformans species complex. Fungal Genetics and Biology 78: 16–48.
OpenUrl CrossRef PubMed

[20] ↵
Hannula SE, Morrien E, de Hollander M, et al. (2017) Shifts in rhizosphere fungal community during secondary succession following abandonment from agriculture. ISME Journal 11: 2294–2304.
OpenUrl

[21] ↵
Hawksworth DL, Lucking R (2017) Fungal Diversity Revisited: 2.2 to 3.8 Million Species. Microbiol Spectr 5.

[22] ↵
Huffnagle GB, Noverr MC (2013) The emerging world of the fungal microbiome. Trends in Microbiology 21: 334–341.
OpenUrl CrossRef PubMed

[23] ↵
Kassambara A (2017) ggpubr:“ggplot2” based publication ready plots. R package version 0.1. 6.

[24] ↵
Koutsovoulos G, Kumar S, Laetsch DR, et al. (2016) No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini. Proc Natl Acad Sci U S A 113: 5053–5058.
OpenUrl Abstract/FREE Full Text

[25] ↵
Kullman B, Tamm H, Kullman K (2005) Fungal Genome Size Database. http://www.zbi.ee/fungal-genomesize/

[26] ↵
Kuske CR, Hesse CN, Challacombe JF, et al. (2015) Prospects and challenges for fungal metatranscriptomics of complex communities. Fungal Ecology 14: 133–137.
OpenUrl

[27] ↵
Kwon-Chung KJ, Bennett JE, Wickes BL, et al. (2017) The Case for Adopting the “Species Complex” Nomenclature for the Etiologic Agents of Cryptococcosis. mSphere 2.

[28] ↵
Kwon-Chung KJ, Boekhout T, Fell JW, Diaz M (2002) Proposal to conserve the name Cryptococcus gattii against C. hondurianus and C. bacillisporus (Basidiomycota, Hymenomycetes, Tremellomycetidae). Taxon 51: 804–806.
OpenUrl CrossRef

[29] ↵
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nature Methods 9: 357–359.
OpenUrl CrossRef

[30] ↵
Li B, Dewey CN (2011) RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12: 323.
OpenUrl CrossRef PubMed

[31] ↵
Li X, Shen X, Chen X, et al. (2018) Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank. Front Genet 9: 30.
OpenUrl

[32] ↵
Liao HL, Chen Y, Bruns TD, et al. (2014) Metatranscriptomic analysis of ectomycorrhizal roots reveals genes associated with Piloderma-Pinus symbiosis: improved methodologies for assessing gene expression in situ. Environmental Microbiology 16: 3730–3742.
OpenUrl CrossRef

[33] ↵
Loftus BJ, Fung E, Roncaglia P, et al. (2005) The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans. Science 307: 1321–1324.
OpenUrl Abstract/FREE Full Text

[34] ↵
Marcelino VR, Verbruggen H (2016) Multi-marker metabarcoding of coral skeletons reveals a rich microbiome and diverse evolutionary origins of endolithic algae. Scientific Reports 6: 31508.
OpenUrl

[35] ↵
McGuire KL, Payne SG, Palmer MI, et al. (2013) Digging the New York City Skyline: soil fungal communities in green roofs and city parks. PLoS One 8: e58020.
OpenUrl CrossRef PubMed

[36] Munoz JF, Gade L, Chow NA, et al. (2018) Genomic insights into multidrug-resistance, mating and virulence in Candida auris and related emerging species. Nature Communications 9: 5346.
OpenUrl

[37] ↵
Ngamskulrungroj P, Gilgado F, Faganello J, et al. (2009) Genetic diversity of the Cryptococcus species complex suggests that Cryptococcus gattii deserves to have varieties. PLoS One 4: e5862.
OpenUrl CrossRef PubMed

[38] ↵
Nilsson RH, Anslan S, Bahram M, et al. (2019) Mycobiome diversity: high-throughput sequencing and identification of fungi. Nature Reviews: Microbiology 17: 95–109.
OpenUrl CrossRef

[39] ↵
Nilsson RH, Kristiansson E, Ryberg M, Hallenberg N, Larsson K-H (2008) Intraspecific ITS Variability in the Kingdom Fungi as Expressed in the International Sequence Databases and Its Implications for Molecular Species Identification. Evolutionary Bioinformatics 4.

[40] ↵
Olm MR, West PT, Brooks B, et al. (2019) Genome-resolved metagenomics of eukaryotic populations during early colonization of premature infants and in hospital rooms. Microbiome 7: 26.
OpenUrl

[41] ↵
Quince C, Walker AW, Simpson JT, Loman NJ, Segata N (2017) Shotgun metagenomics, from sampling to analysis. Nature Biotechnology 35: 833–844.
OpenUrl CrossRef

[42] ↵
Rizzetto L, Giovannini G, Bromley M, et al. (2013) Strain dependent variation of immune responses to A. fumigatus: definition of pathogenic species. PLoS One 8: e56651.
OpenUrl CrossRef PubMed

[43] ↵
Salter SJ, Cox MJ, Turek EM, et al. (2014) Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biology 12: 87.
OpenUrl

[44] ↵
Schauwvlieghe A, Vonk AG, Buddingh EP, et al. (2017) Detection of azole-susceptible and azole-resistant Aspergillus coinfection by cyp51A PCR amplicon melting curve analysis. Journal of Antimicrobial Chemotherapy 72: 3047–3050.
OpenUrl

[45] ↵
Schmidt P-A, Bálint M, Greshake B, et al. (2013) Illumina metabarcoding of a soil fungal community. Soil Biology and Biochemistry 65: 128–132.
OpenUrl

[46] ↵
Schmieder R, Edwards R (2011) Quality control and preprocessing of metagenomic datasets. Bioinformatics 27: 863–864.
OpenUrl CrossRef PubMed Web of Science

[47] ↵
Sczyrba A, Hofmann P, Belmann P, et al. (2017) Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software. Nature Methods 14: 1063–1071.
OpenUrl

[48] ↵
Seed PC (2014) The human mycobiome. Cold Spring Harb Perspect Med 5: a019810.
OpenUrl CrossRef

[49] ↵
Seki M, Ohno H, Gotoh K, et al. (2014) Allergic bronchopulmonary mycosis due to co-infection with Aspergillus fumigatus and Schizophyllum commune. IDCases 1: 5–8.
OpenUrl CrossRef PubMed

[50] ↵
Shakya M, Quince C, Campbell JH, et al. (2013) Comparative metagenomic and rRNA microbial diversity characterization using archaeal and bacterial synthetic communities. Environmental Microbiology 15: 1882–1899.
OpenUrl CrossRef Web of Science

[51] ↵
Shi M, Lin XD, Tian JH, et al. (2016) Redefining the invertebrate RNA virosphere. Nature.

[52] ↵
Shi M, Neville P, Nicholson J, et al. (2017) High-Resolution Metatranscriptomics Reveals the Ecological Dynamics of Mosquito-Associated RNA Viruses in Western Australia. Journal of Virology 91.

[53] ↵
Soll DR, Staebell M, Langtimm C, et al. (1988) Multiple Candida strains in the course of a single systemic infection. Journal of Clinical Microbiology 26: 1448–1459.
OpenUrl Abstract/FREE Full Text

[54] ↵
Strong MJ, Xu G, Morici L, et al. (2014) Microbial contamination in next generation sequencing: implications for sequence-based analysis of clinical samples. PLoS Pathog 10: e1004437.
OpenUrl CrossRef PubMed

[55] ↵
Strope PK, Skelly DA, Kozmin SG, et al. (2015) The 100-genomes strains, an S. cerevisiae resource that illuminates its natural phenotypic and genotypic variation and emergence as an opportunistic pathogen. Genome Research 25: 762–774.
OpenUrl Abstract/FREE Full Text

[56] ↵
Tati S, Davidow P, McCall A, et al. (2016) Candida glabrata Binding to Candida albicans Hyphae Enables Its Development in Oropharyngeal Candidiasis. PLoS Pathog 12: e1005522.
OpenUrl CrossRef

[57] ↵
Tedersoo L, Anslan S, Bahram M, et al. (2015) Shotgun metagenomes and multiple primer pair-barcode combinations of amplicons reveal biases in metabarcoding analyses of fungi. MycoKeys 10: 1–43.
OpenUrl CrossRef

[58] ↵
Wille M, Eden JS, Shi M, et al. (2018) Virus-virus interactions and host ecology are associated with RNA virome structure in wild birds. Molecular Ecology 27: 5263–5278.
OpenUrl

[59] ↵
Yahr R, Schoch CL, Dentinger BT (2016) Scaling up discovery of hidden diversity in fungi: impacts of barcoding approaches. Philos Trans R Soc Lond B Biol Sci 371.

[60] ↵
Yamamoto N, Dannemiller KC, Bibby K, Peccia J (2014) Identification accuracy and diversity reproducibility associated with internal transcribed spacer-based fungal taxonomic library preparation. Environmental Microbiology 16: 2764–2776.
OpenUrl

[61] ↵
Zhang YZ, Shi M, Holmes EC (2018) Using Metagenomics to Characterize an Expanding Virosphere. Cell 172: 1168–1172.
OpenUrl CrossRef