Abstract
Cryptococcus neoformans is the causative agent of cryptococcosis, a disease with poor patient outcomes, accounting for approximately 180,000 deaths each year. Patient outcomes may be impacted by the underlying genetics of the infecting isolate, however, our current understanding of how genetic diversity contributes to clinical outcomes is limited. Here, we leverage clinical, in vitro growth and genomic data for 284 C. neoformans isolates to identify clinically relevant pathogen variants within a population of clinical isolates from patients with HIV-associated cryptococcosis in Malawi. Through a genome-wide association study (GWAS) approach, we identify variants associated with fungal burden and growth rate. We also find both small and large-scale variation, including aneuploidy, associated with alternate growth phenotypes, which may impact the course of infection. Genes impacted by these variants are involved in transcriptional regulation, signal transduction, glycolysis, sugar transport, and glycosylation. When combined with clinical data, we show that growth within the CNS is reliant upon glycolysis in an animal model, and likely impacts patient mortality, as CNS burden modulates patient outcome. Additionally, we find genes with roles in sugar transport are under selection in the majority of these clinical isolates. Further, we demonstrate that two hypothetical proteins identified by GWAS impact virulence in animal models. Our approach illustrates links between genetic variation and clinically relevant phenotypes, shedding light on survival mechanisms within the CNS and pathways involved in this persistence.
Importance Infection outcomes for cryptococcosis, most commonly caused by C. neoformans, are influenced by host immune responses, as well as host and pathogen genetics. Infecting yeast isolates are genetically diverse, however, we lack a deep understanding of how this diversity impacts patient outcomes. To better understand both clinical isolate diversity and how diversity contributes to infection outcome, we utilize a large collection of clinical C. neoformans samples, isolated from patients enrolled in a clinical trial across 3 hospitals in Malawi. By combining whole-genome sequence data, clinical data, and in vitro growth data, we utilize genome-wide association approaches to examine the genetic basis of virulence. Genes with significant associations show virulence phenotypes in both murine and rabbit models, demonstrating that our approach successfully identifies links between genetic variation and biologically significant phenotypes.