Integrating Host Response and Unbiased Microbe Detection for Lower Respiratory Tract Infection Diagnosis in Critically Ill Adults

Charles Langelier; Katrina L Kalantar; Farzad Moazed; Michael R. Wilson; Emily Crawford; Thomas Deiss; Annika Belzer; Samaneh Bolourchi; Saharai Caldera; Monica Fung; Alejandra Jauregui; Katherine Malcolm; Amy Lyden; Lillian Khan; Kathryn Vessel; Jenai Quan; Matt Zinter; Charles Y. Chiu; Eric D. Chow; Jenny Wilson; Steve Miller; Michael A. Matthay; Katherine S. Pollard; Stephanie Christenson; Carolyn S. Calfee; Joseph L. DeRisi

doi:10.1101/341149

ABSTRACT

Lower respiratory tract infections (LRTI) lead to more deaths each year than any other infectious disease category(1). Despite this, etiologic LRTI pathogens are infrequently identified due to limitations of existing microbiologic tests(2). In critically ill patients, non-infectious inflammatory syndromes resembling LRTI further complicate diagnosis. To address the need for improved LRTI diagnostics, we performed metagenomic next-generation sequencing (mNGS) on tracheal aspirates from 92 adults with acute respiratory failure and simultaneously assessed pathogens, the lung microbiome and the host transcriptome. To differentiate pathogens from respiratory commensals, we developed rules-based and logistic regression models (RBM, LRM) in a derivation cohort of 20 patients with LRTI or non-infectious acute respiratory illnesses. When tested in an independent validation cohort of 24 patients, both models achieved accuracies of 95.5%. We next developed pathogen, microbiome diversity, and host gene expression metrics to identify LRTI-positive patients and differentiate them from critically ill controls with non-infectious acute respiratory illnesses. When tested in the validation cohort, the pathogen metric performed with an AUC of 0.96 (95% CI = 0.86 - 1.00), the diversity metric with an AUC of 0.80 (95% CI = 0.63 – 0.98), and the host transcriptional classifier with an AUC of 0.91 (95% CI = 0.80 – 1.00). Combining all three achieved an AUC of 0.99 (95% CI = 0.97 – 1.00) and negative predictive value of 100%. This study suggests that a single streamlined protocol offering an integrated genomic portrait of pathogen, microbiome and host transcriptome may hold promise as a novel tool for LRTI diagnosis.

SIGNIFICANCE STATEMENT Lower respiratory tract infections (LRTI) are the leading cause of infectious disease-related death worldwide yet remain challenging to diagnose because of limitations in existing microbiologic tests. In critically ill patients, non-infectious respiratory syndromes that resemble LRTI further complicate diagnosis and confound targeted treatment. To address this, we developed a novel metagenomic sequencing-based approach that simultaneously interrogates three core elements of acute airway infections: the pathogen, lung microbiome and host response. We studied this approach in a prospective cohort of critically ill patients with acute respiratory failure and found that combining pathogen, microbiome and host gene expression metrics achieved accurate LRTI diagnosis and identified etiologic pathogens in patients with clinically identified infections but otherwise negative testing.

Funding NHLBI K12HL119997 (Langelier C), NHLBI K23HL123778 (Christensen S), NIAID P01AI091575 and the Chan Zuckerberg Biohub (DeRisi JL), NHLBI K23 HL136844 (Moazed F), NHLBI R01HL110969, K24HL133390, R35HL140026 (Calfee C), Gladstone Institutes (Pollard KS).

INTRODUCTION

Lower respiratory tract infections (LRTI) are a leading cause of mortality worldwide(1, 3, 4). Early and accurate determination of acute respiratory disease etiology is crucial for implementing effective pathogen-targeted therapies but is often not possible due to the limitations of current microbiologic tests in terms of sensitivity, speed, and spectrum of available assay targets(2). For instance, even with the best available clinical diagnostics, a contributory pathogen can be detected in only 38% of adults with community acquired pneumonia, due to the low sensitivity and time requirements of culture, and the limited number of microbes detectable by serologic and polymerase chain reaction (PCR) assays(2, 5).

In the absence of a definitive microbiologic diagnosis, clinicians may presume symptoms are due to a non-infectious inflammatory condition and initiate empiric corticosteroids, which can exacerbate an occult infection(6). Furthermore, even with negative microbiologic testing, providers often continue empiric antibiotics due to concerns of falsely negative results, a practice that drives emergence of antibiotic resistance and increases risk of Clostridium difficile infection(7). In the intensive care unit (ICU), LRTI diagnosis is particularly complex due to a high prevalence of non-infectious inflammatory conditions with overlapping clinical features(8) and a patient demographic that includes severely immunocompromised individuals who may exhibit atypical presentations of pulmonary infections.

Advancements in genome sequencing hold promise for overcoming these diagnostic challenges by affording culture-independent assessment of microbial genomes from microliter volumes of clinical samples(9, 10). Recent work has highlighted the utility of metagenomic next-generation sequencing (mNGS) for rapid and actionable diagnosis of complicated infections(6, 11–13). While these results are encouraging, most mNGS computational pipelines have been developed for analysis of sterile fluids or cultured bacterial isolates and have limited capacity to identify pathogens amidst the complex background of commensal microbiota present in respiratory specimens^13–15.

Host transcriptional profiling from peripheral blood has emerged as a promising alternative to pathogen-based diagnostics that can distinguish viral from bacterial LRTIs as well as differentiate between patients with acute respiratory infections versus those with non-infectious illnesses(5, 16, 17). This approach, while highly promising, has not been well studied in ICU patients with respiratory failure or in severely immunocompromised subjects. Furthermore, host transcriptional profiling has not yet been coupled with simultaneous detection of pulmonary pathogens(5, 18), which could improve diagnostic accuracy and more precisely inform optimal antimicrobial treatment.

mNGS can extend both host gene expression assays and current microbe-based diagnostics by simultaneously detecting pathogens, the lung microbiome, and transcriptional biomarkers of the host’s immune response. Here we address the need for better LRTI diagnostics by developing an mNGS-based method that integrates host response and unbiased microbe detection. We then evaluate the performance of this approach in a prospective cohort of critically ill patients with acute respiratory failure.

RESULTS

We prospectively enrolled 92 adults admitted to the ICU with acute respiratory failure and collected tracheal aspirate (TA) samples within 72 hours of intubation (Table 1). Patients underwent testing with clinician-ordered standard of care microbiologic diagnostics at the University of California San Francisco Moffitt-Long Hospital, a tertiary care referral center. Subjects with LRTI were identified by two-physician adjudication using United States Centers for Disease Control/National Healthcare Safety Network (CDC/NHSN) surveillance case definitions and retrospective electronic medical record review, with blinding to mNGS results (Table S2A)(19). Using this approach, patients were assigned to one of four groups: 1) LRTI defined by both clinical and microbiologic criteria (LRTI^+C+M, n=26); 2) no evidence of LRTI and a clear alternative explanation for acute respiratory failure (no-LRTI, n=18), 3) LRTI defined by clinical criteria alone with negative conventional microbiologic testing (LRTI^+C, n=34) and 4) respiratory failure due to unclear cause, infectious or non-infectious (unk-LRTI, n=14).

View this table:

Table 1. Demographics and clinical characteristics of study cohort.

Legend: LRTI^+C+M = subjects who met both clinical and microbiologic criteria for LRTI. no-LRTI = subjects with a non-infectious etiology of acute respiratory failure. SIRS = systemic inflammatory response syndrome, defined as two or more abnormalities in white blood cell count (>12,000 cells/μL or <4,000 cells/μL), temperature (>38°C or < 36°C), heart rate (>90 beats per minute) or respiratory rate (> 20 breaths per minute). APACHEIII score predicts mortality and disease severity for critically ill patients⁶⁴. Pneumonia severity index score estimates mortality for adult patients with community-acquired pneumonia⁶⁵. COPD = chronic obstructive pulmonary disease. ^♦Chi-squared test. *Wilcoxon rank sum test.

From extracted nucleic acid samples, we performed both metagenomic shotgun DNA sequencing (DNA-Seq) as well as RNA sequencing (RNA-Seq). We first developed computational algorithms to sift respiratory pathogens from background commensal flora in an effort to enhance detection of LRTI etiology. To differentiate patients with LRTI from those with non-infectious critical respiratory illnesses, we next developed metrics of LRTI probability based on pathogen, lung microbiome diversity, and host gene expression (Fig 1). To assess assay performance, we focused on the most unambiguously LRTI positive and negative subjects (LRTI^+C+M and no-LRTI) by randomly dividing them into independent derivation (n=20, used for model training) and validation cohorts (n=24, used for model testing). Each metric (pathogen, microbiome, host) was evaluated independently and then in combination.

Figure 1. Study overview and novel analysis workflow.

Patients with acute respiratory failure were enrolled within 72 hours of ICU admission. TA samples were collected and underwent both RNA sequencing (RNA-Seq) and shotgun DNA sequencing (DNA-Seq). Retrospective clinical adjudication blinded to mNGS results identified patients with LRTI defined by clinical and microbiologic criteria (LRTI^+C+M); LRTI defined by clinical criteria only (LRTI^+C); patients with non-infectious reasons for acute respiratory failure (no-LRTI); and respiratory failure due to unknown cause (unk-LRTI). The LRTI^+C+M and no-LRTI groups were divided into derivation and validation cohorts. To differentiate pathogens from respiratory commensals, we developed rules-based and logistic regression models in the derivation cohort which were tested in the validation cohort. Subsequently, microbe and host gene expression metrics were combined in a logistic regression model to determine the probability of LRTI versus non-infectious acute respiratory illness.

Pathogen Detection

While many NGS platforms utilize only one nucleic acid type, we combined both RNA and DNA sequencing. This approach allowed for simultaneous host transcriptional profiling, permitted detection of RNA viruses, and enriched for actively transcribing microbes (versus latent or nonviable taxa). In addition, requiring concordant detection of microbes across both nucleic acid types reduced spurious alignments derived from reagent contaminants intrinsic to the library preparations of each nucleic-acid type(20). From each TA sample, we generated a mean of 26.3 million paired-end sequencing reads of which the median fraction of microbial reads was 0.04% (IQR 0.01% - 0.16%). Raw reads were analyzed using a rapid computational pipeline that aligns and classifies microbial taxa by nucleotide and peptide translation using the National Center for Biotechnology Information (NCBI) NT and NR databases, respectively(20, 21). RNA-Seq yielded a greater abundance of sequences as compared to DNA-Seq for 78% of identified microbes, with a median of 2.2 times more reads per microbe.

We and others have previously developed NGS methodologies for “sterile site” clinical fluids such as cerebrospinal fluid (CSF)(14, 21, 22). The lung however is not a sterile environment and in fact harbors microbial communities during states of both health and disease(23–26). Asymptomatic carriage of potentially pathogenic organisms is common(27, 28), and only in a subset of cases do these microbes overtake airway microbial communities and precipitate LRTI(29). As such, distinguishing legitimate pathogens from commensal or colonizing microbiota is a central challenge for LRTI diagnostics and adds complexity to the interpretation of metagenomic sequencing data. To this point, while we detected all 38 pathogens identified from clinician-ordered microbiologic tests in the 26 LRTI^+C+M patients using mNGS (Table S3), a tenfold greater number of airway commensals were also identified.

Thus, to distinguish probable pathogens from airway commensals, we developed two complementary algorithms: 1) a rules-based model (RBM) optimized for detecting well-established respiratory pathogens, and 2) a more flexible logistic regression model (LRM) that also permitted novel pathogen detection (Figure 1). The goal of both models was to correctly identify pathogens amidst abundant and heterogeneous populations of commensals. Microbes identified by clinician-ordered diagnostics plus all viruses with established respiratory pathogenicity in the LRTI^+C+M group were categorized as pathogens (n=12 in derivation cohort and n=26 in validation cohort, Table S1). Any additional microbes identified by mNGS were considered commensals (n = 155 in derivation cohort, n = 174 in validation cohort). We accepted that this “practical” gold standard would provide an attenuated estimate of performance due to the sensitivity limitations of microbial culture in the setting of antibiotic pre-administration(2).

In the RBM, respiratory microbes from each patient were assigned an abundance score based on the sum of log(RNA-Seq) and log(DNA-Seq) genus rpm (Table S3). After ranking microbes by this abundance score, the greatest score difference between sequentially ranked microbes was identified and used to distinguish the group of highest-scoring microbes within each patient (Fig 2A, Fig S1). These high scoring microbes plus all RNA viruses detected at a conservative threshold of > 0.1 rpm were indexed against an a priori developed table of established lower respiratory pathogens derived from landmark surveillance studies and clinical guidelines (Table S2B), and if present were identified as putative pathogens by the RBM(2, 30–32).

Figure 2. Workflow for distinguishing LRTI pathogens from commensal respiratory microbiota using an algorithmic approach.

A) Projection of microbial relative abundance in log rpm (reads per million reads sequenced) by RNA sequencing (RNA-Seq, X-axis) versus DNA sequencing (DNA-Seq, Y-axis) for representative cases. In the LRTI^+C+M group, pathogens identified by standard clinical microbiology (filled shapes) had higher overall relative abundance as compared to other taxa detected by sequencing (open shapes). The largest score differential between ranked microbes (max Δrpm) was used as a threshold to identify high-scoring taxa, distinct from the other microbes based on abundance. Red indicates taxa represented in the reference list of established LRTI pathogens. B) ROC curve demonstrating logistic regression model (LRM) performance for detecting pathogens versus commensal microbiota in both the derivation and validation cohorts. The grey ROC curve and shaded region indicate results from 1000 rounds of training and testing on randomized sets the derivation cohort (Supplemental Methods). The blue and green lines indicate predictions using leave-one-patient-out cross-validation (LOPO-CV) on the derivation and validation on the validation cohort, respectively. C) Microbes predicted by the LRM to represent putative pathogens. The X-axis represents combined RNA-Seq and DNA-Seq rpm and the Y-axis indicates pathogen probability. The dashed line reflects the optimized probability threshold for pathogen assignment. Legend: Red filled circles: microbes predicted by LRM to represent putative LRTI pathogens that were also identified by conventional microbiologic tests. Blue filled circles: microbes predicted to represent putative LRTI pathogens by LRM only. Blue open circles: microbes identified by mNGS but not predicted by the LRM to represent putative pathogens. Red open circles: microbes identified using NGS and by standard microbiologic testing but not predicted to be putative pathogens. Dark red outlined circles: microbes detected as part of a polymicrobial culture.

The RBM achieved an accuracy for pathogen detection of 98.8% and 95.5% in the derivation and validation cohorts, respectively (Table S3). In subjects whose respiratory cultures grew three or more different bacteria, mNGS was able to detect each of the species. In most cases however, their abundance differed by several hundred-fold, which confounded detection of the lower abundance taxa (Table S3). Given the unclear significance of single species in such polymicrobial cases with respect to pathogenicity(33), we performed a secondary analysis in which only the most abundant microbe was considered a pathogen, and this approach yielded an accuracy of 98.4%.

While the RBM performed well for identifying microbes with established pulmonary pathogenicity, we recognized the need to also detect novel or atypical species. We thus employed machine learning to distinguish respiratory pathogens from commensals using a logistic regression model (LRM) trained on microbes detected in the derivation cohort patients (n=20) using the predictor variables of: RNA-Seq rpm, DNA-Seq rpm, rank by RNA-Seq rpm, established LRTI pathogen (yes/no), and virus (yes/no). These features were selected to preferentially favor highly abundant organisms with established pathogenicity in the lung, but still permit detection of uncommon taxa that could represent putative pathogens.

To evaluate LRM performance in the derivation cohort, we performed leave-one-patient-out cross validation, in which all microbes from a single patient were held out in each round of cross-validation. This yielded an AUC = 0.90 (95% CI = 0.76 – 0.99). A final model was trained on all microbes from derivation cohort patients, and this achieved an AUC = 0.91 (95% CI = 0.83– 0.97) for pathogen identification in the validation cohort (Fig 2B, Table S3). At an optimized probability threshold of 0.36 (Methods), this translated to an accuracy of 96.4% and 95.5% in the derivation and validation cohorts, respectively. As with the RBM, LRM performance suffered in polymicrobial culture cases with species that differed by several magnitudes in abundance when assessed by mNGS. As such, when only the most abundant microbe identified by clinical microbiologic diagnostics per LRTI^+C+M patient was considered as the etiologic pathogen, the AUC increased to 0.997 (95% CI = 0.99 – 1.00) in the validation cohort.

Combining the RBM and LRM identified more putative pathogens than either model alone, and revealed a potential LRTI etiology in 62% (n=21) of the LRTI^+C patients with clinically adjudicated LRTI but negative microbiologic testing (Figs 3, S2, Table S3). Compared to clinician-ordered diagnostics, this permitted a microbiologic diagnosis in a greater number of LRTI-positive subjects (78% vs 43%, p < 1.00 ×10⁻⁴ by McNemar’s test, Fig 3). Putative new pathogens in a representative subset of the LRTI^+C group patients (n=11, 32%) were orthogonally confirmed by clinical multiplex respiratory virus PCR, influenza C PCR(34) or by 16S bacterial rRNA gene sequencing (Table S3).

Figure 3. Distribution of respiratory pathogens identified in patients using clinician-ordered diagnostics versus mNGS.

For each microbe, the number of subjects from whom that microbe was identified is plotted on the X-axis. Red bars indicate microbes detected by clinician-ordered diagnostics and also predicted as pathogens by either the rules based or logistic regression models. All microbes detected by clinician-ordered diagnostics were detected by mNGS, however pink bars indicate microbes misclassified as negative by either the rules-based or logistic regression models. All microbes identified by clinician-ordered diagnostics and misclassified by either the rules-based or logistic regression models (pink bars) were found in polymicrobial cultures. Additional detail on which model identified each microbe can be found in Figure S2.

Putative pathogens identified in the un-kLRTI group (n=6, 42%) may have represented atypically presenting respiratory infections or incidental carriage in the respiratory tract (Fig S2, Table S3). Microbes identified in the no-LRTI group (n=3, 17%) were present at lower abundance compared to microbes in LRTI+C+M (p < 0.01 by Wilcoxon rank sum), LRTI^+C (p < 0.01) and un-kLRTI subjects (p = 0.02), and included contextual pathogens such as S. pneumoniae and H. influenzae that colonize the airways of 20-50% of healthy individuals(33, 35, 36). Together, these findings highlighted the reality of asymptotic carriage of potentially pathogenic species, emphasizing the need to contextualize microbial detection with respect to other key elements of an airway infection, in particular the lung microbiome and the host’s immune response(27, 37). We thus undertook further analytical development to predict LRTI status by calculating combined metrics based on pathogen, microbiome and host transcriptional response.

LRTI Prediction Based on Pathogen

We recognized that the highest per-patient LRM pathogen versus commensal probability value differed significantly between LRTI^+C+M and no-LRTI subjects (p = 3.8 × 10⁻⁴ by Wilcoxon rank sum). As such, we hypothesized that this value might have utility not only for pathogen versus commensal prediction, but also for LRTI prediction in general. Testing this idea, we found that the maximum per patient LRM probability value predicted LRTI status with an AUC of 0.97 (95% CI = 0.90 - 1.00) in the derivation cohort and 0.96 (95% CI = 0.86 - 1.00) in the validation cohort (Fig S3).

LRTI Prediction Based on Lung Microbiome Diversity

Several studies have demonstrated reduced diversity of the lung microbiome in the setting of LRTI(20, 38–40). We measured intra-patient (alpha) diversity of airway genera using the Shannon Diversity Index (SDI) and found that LRTI^+C+M subjects had significantly lower SDI compared to no-LRTI subjects when assessed by both RNA-Seq (Fig 4A, p = 1.3 × 10⁻⁴) and DNA-Seq (Fig S4A, p = 8.9 × 10⁻³) (Table S4). We next examined inter-patient (beta) diversity(41) using the Bray-Curtis Index(42) and found that this also differed between LRTI^+C+M and no-LRTI subjects, with assessment by RNA-Seq again yielding a more significant difference versus DNA-Seq (p = 5 × 10⁻³ versus p = 9 × 10⁻³ by PERMANOVA, respectively, Figs 4B and S4B). We then tested whether diversity alone might predict LRTI and found that RNA-Seq SDI differentiated LRTI^+C+M from no-LRTI subjects with an AUC of 0.96 (95% CI = 0.89 - 1.00) in the derivation cohort and an AUC of 0.80 (95% CI = 0.63 – 0.96) in the validation cohort (Fig 4C). These findings suggested that genus diversity assessed by RNA-Seq was a useful, albeit imperfect, biomarker of LRTI.

Figure 4. Diversity of the transcriptionally active lung microbiome in patients with LRTI (LRTI^+C+M) versus patients with non-infectious respiratory illnesses (no-LRTI).

A) Shannon Diversity Index (SDI) of the lung microbiome assessed by RNA-Seq at the genus level differed between LRTI^+C+M and no-LRTI groups (derivation cohort). B) Beta diversity assessed by PERMANOVA on Bray-Curtis dissimilarity values also differed between LRTI^+C+M and no-LRTI groups. C) ROC curve demonstrating performance of SDI for distinguishing LRTI^+C+M from no-LRTI groups.

LRTI Prediction Based on Host Response

In the setting of critical illness, systemic inflammatory responses due to diverse physiologic processes can make true LRTI clinically indistinguishable from non-infectious respiratory failure or severe extra-pulmonary infection. Consistent with this, we found that the systemic inflammatory response syndrome (SIRS) criteria (temperature, white blood cell count, heart rate, respiratory rate) had limited utility for LRTI detection despite being widely used for infection assessment (Table S1). We thus hypothesized that transcriptional profiling, which has emerged as a promising and accurate host-based approach for assessing infection, might provide diagnostic insight in settings when clinical rules are uninformative(5, 16, 43).

As such, we examined differential gene expression between LRTI^+C+M and no-LRTI subjects in the derivation cohort to define a host transcriptional signature of LRTI in patients with critical illness. Using a false discovery rate (FDR) of < 0.05, we identified a total of 882 differentially expressed genes, 414 of which were upregulated in LRTI^+C+M subjects (Fig S5, Table S5A). Gene set enrichment analysis(44) identified upregulation of pathways related to innate immune responses, NF-κβ signaling, cytokine production, and the type I interferon response in LRTI^+C+M subjects. In comparison, gene expression pathways in the no-LRTI group were enriched for oxidative stress responses and MHC class II receptor signaling (Table S5B).

We next sought to construct an airway-specific host transcriptional classifier that could differentiate LRTI^+C+M patients from no-LRTI subjects by employing machine-learning (Methods). Elastic net regularized regression in the derivation cohort identified a 12-gene classifier that was then used to score patients based on a weighted sum of scaled expression values (Table S6A, Fig 5A, B). We found that predictive classifier genes upregulated in LRTI^+C+M patients compared to no-LRTI patients included NFAT-5, which plays a role in T cell function and inducible gene transcription during immune responses(45); ZC3H11A, which encodes a zinc-finger protein involved in the regulation of cytokine production and immune cell activation(46) and PRRC2C, which functions in RNA binding and may play a role in hematopoietic progenitor cell differentiation in response to infection(47). Genes upregulated in no-LRTI patients compared to LRTI^+C+M patients included: CD36, which encodes a macrophage phagocytic receptor involved in scavenging dying/dead cells and oxidized lipids(48, 49); BLVRB, which is involved in oxidative stress responses(50), EDF1, which contributes to the regulation of nitric oxide release in endothelial cells(51) and ENG, an integral membrane glycoprotein receptor that may modulate inflammation and angiogenesis(52).

Figure 5. Host transcriptional profiling distinguishes critically ill patients with LRTI (LRTI^+C+M) from those with non-infectious acute respiratory illness (no-LRTI).

A) Host classifier scores for all patients in the derivation and validation cohorts, each bar indicates a patient score and is colored as follows: LRTI^+C+M = red, no-LRTI = blue. Orange dotted line indicates the host classifier threshold (score = −4) that achieved 100% sensitivity in the training set and was used to classify the test set samples. B) Normalized expression levels, arranged by unsupervised hierarchical clustering, reflect over-expression (blue) or under-expression (turquoise) of classifier genes (rows) for each patient (columns). 12 genes were identified as predictive in the derivation cohort and subsequently applied to predict LRTI status in the validation cohort. Column colors above the heatmap indicate whether a patient belonged to the derivation cohort (dark grey) or validation cohort (light grey) and whether they were adjudicated to have LRTI^+C+M (red) or no-LRTI (blue). C) ROC curves demonstrating host classifier performance for derivation (blue) and validation (green) cohorts.

Classifier performance assessed by leave-one-out cross-validation demonstrated an AUC of 0.90 (95% CI 0.75 – 1.00) in the derivation cohort and an AUC of 0.88 (95% CI 0.75 – 1.00) in the validation cohort (Fig 5C). Covariates for immune suppression, concurrent non-pulmonary infection, antibiotic use, age, and gender were iteratively incorporated into the regression model but none were significant enough to be maintained when sparsity was added by elastic net (Table S6B). We tested whether differences in host gene expression could be attributed to enrichment of specific cell types using CIBERSORT(53) (Table S6C) and found that only M2 Macrophages were enriched in the no-LRTI group (p = 0.03 by Wilcox Rank Sum).

Finally, given our modest sample size, we tested the statistical power of our host classifier by computing learning curves (Methods). We observed that even with subsampling, the 12 classifier genes were continually represented. While the derivation cohort sample size approached the limit required for robust performance assessment, the analysis suggested that additional patients might lead to further improvement (Figure S5A). A similar analysis for the pathogen versus commensal LRM indicated that microbial sample size was saturated, and that performance assessment was optimized (Figure S5B).

Evaluation of a Combined LRTI Metric

Given the relative success of each independent metric (pathogen, microbiome and host) for discerning presence of infection, we asked whether combining them could enhance LRTI detection. We first incorporated the values of each individual metric into a combined logistic regression model (Methods). Although SDI did not improve performance, combining host and microbe scores enhanced performance for LRTI detection in both the derivation (AUC 1.00, 95% CI = 1.00 – 1.00) and validation cohorts (AUC 0.99, 95% CI = 0.97 – 1.00) compared to the individual performance of each metric (Fig 6A).

Figure 6. Combined LRTI prediction metric integrating pathogen detection and host gene expression.

A) ROC curves demonstrating combined (pathogen and host) logistic regression model performance for derivation (blue) and validation (green) cohorts. B) Scores per patient for each of the two components of this LRTI rule-out model projected in a scatterplot where the X-axis represents the host metric and the Y-axis represents the microbe score. The thresholds optimized for sensitivity in the derivation cohort are indicated by grey dashed lines. Each point represents one patient – those that were in the derivation cohort have no fill and those that were in the validation cohort are filled. Red indicates LRTI^+C+M and blue indicates no-LRTI subjects. C) LRTI rule-out model results for each patient are shown for both the derivation and validation cohorts, with study subjects shown in rows and metrics in columns. Dark grey indicates a metric exceeded the optimized LRTI threshold, light grey indicates it did not. Dark red indicates the subject was positive for both pathogen and host metrics, and thus was classified as LRTI-positive.

We recognized the potential for mNGS to empower a data-driven assessment of a patient’s LRTI status during the critical timeframe following ICU admission, especially since mNGS assays can now be performed in under 24 hours(6, 54, 55). As such, we sought to deliver a readily interpretable compilation of host and pathogen mNGS metrics by developing a rule-out model that maximized LRTI diagnostic sensitivity (Fig 6B). This process, which involved optimizing intra-metric LRTI positivity thresholds in the derivation cohort and calling positivity based on either the host or pathogen scores, achieved a sensitivity and specificity of 100% and 87.5%, respectively in the validation cohort (Fig 6C). We considered the future potential value of the rule-out model for curbing broad-spectrum antibiotic overuse in the ICU(56), and thus performed a theoretical calculation in the no-LRTI group to estimate the potential impact of mNGS result availability at 48 hours post-enrollment. This estimate suggested that a significant reduction in unnecessary empiric antibiotic use could have been possible (78 versus 50 days of therapy, p = 0.03, supplemental methods).

DISCUSSION

Of all infectious disease categories, LRTIs impart the greatest mortality both worldwide and in the United States(1). Contributing to this is the rising rate of treatment failure due to antibiotic resistance(57) and the limited performance of existing diagnostics for identifying respiratory pathogens(2, 58). In this prospective cohort study, we describe the use of unbiased mNGS for respiratory infectious disease diagnosis in the ICU. We develop methods that advance pathogen-based genomic diagnostics as well as existing host transcriptional classifier platforms by simultaneously assessing respiratory pathogens, the lung microbiome, and the host transcriptome in a single test to predict LRTI and identify disease etiology. We find that host/pathogen mNGS accurately detects LRTI in patients with acute respiratory failure and can provide a microbiologic diagnosis in cases due to unknown etiology.

Host transcriptional profiling has gained attention as a promising approach to LRTI diagnosis(59, 60) but is understudied in critically ill and immunocompromised patients, who may be the most likely to benefit from this technology. We addressed this gap by interrogating airway gene expression in a critically ill cohort with 45% immunocompromised patients to develop an accurate host transcriptional classifier. Unlike existing classifiers, host-microbe mNGS offers the advantage of simultaneous species-level microbial identification.

The role of commensal lung microbiota in health and disease is an area of active investigation. We corroborated prior findings demonstrating microbiome differences between subjects with respiratory infections and those with non-infectious airway disease(20, 38). More specifically, we found that LRTI was associated with reduced intra-patient alpha diversity of the lung microbiome and that collectively, patients with LRTI differed significantly from those without in terms of beta diversity and microbial burden. This diversity difference was more pronounced when assessed by RNA-Seq, potentially due to inclusion of RNA viruses and transcripts from actively replicating pathogens in infected patients. As a biomarker, RNA-Seq SDI had moderate utility for predicting LRTI; however, it did not enhance performance in combination with the other metrics, perhaps due to negative correlation with microbe score (r = −0.84 in the derivation cohort).

Discriminating respiratory pathogens from background commensal microbiota is a key challenge for LRTI diagnostics and is particularly relevant for sensitive molecular assays(61). We directly addressed this by developing two complementary algorithms (RBM, LRM) that parsed putative pathogens from airway commensals. When combined, these models enabled a microbiologic diagnosis in significantly more patients with LRTI compared to clinician-ordered diagnostics. Notably, both models also proved useful despite widespread antibiotic use prior to airway sampling (90% of subjects), a practice that occurs commonly and that can sterilize microbial cultures(62).

The capacity for mNGS to detect pathogens unidentifiable by standard clinical diagnostics was highlighted in several cases, including that of subject 254, who developed rapidly worsening respiratory failure and fever during a prolonged post-surgical admission. He was treated empirically for hospital acquired pneumonia with linezolid, aztreonam and metronidazole. Lower respiratory cultures returned negative, but mNGS identified influenza C, which is not available on most clinical multiplex viral PCR assays. Notably, 12% of subjects were found to have undetected and potentially transmissible respiratory viruses despite strict precautionary respiratory contact policies at the study site, a finding that suggests the potential value of mNGS for hospital infection control. Several cases also highlighted the potential for mNGS to enhance antibiotic stewardship, and we estimated that theoretical implementation of the rule-out model within 48 hours could have reduced antibiotic days of therapy by 36% in the no-LRTI validation cohort patients.

Since at the time of ICU admission it is often difficult to distinguish infectious from noninfectious acute respiratory disease, a theoretical workflow for host/microbe mNGS could involve first employing the rule-out model to asses LRTI probability and complement clinical decision making regarding discontinuation of empiric antimicrobials. In cases where LRTI was ultimately suspected, a microbiologic diagnosis could then be obtained using a combination of the RBM and LRM to accurately screen for both well-established and uncommon respiratory pathogens. A principal advantage of mNGS is that all potential infectious agents can be simultaneously assessed, which avoids the need for ordering multiple individual tests for each different pathogen of concern. Future studies in a larger validation cohort can help optimize host and microbe LRTI rule-out thresholds and further assess test performance prior to deployment in a clinical setting.

Some limitations of host/microbe mNGS were apparent and included false-positive detection of pathobionts such as H. influenzae and S. pneumoniae in the no-LRTI group, and false positivity of the host-response metric in subjects including patient 349, who was diagnosed with alpha-1 antitrypsin deficiency-associated pulmonary disease. The relatively small sample size of our derivation and validation cohorts increased the potential for data overfitting and is a limitation of our study. Learning curve estimates, however, indicated an appropriate sample size for optimal pathogen versus commensal prediction, and an adequate sample size for host classifier performance assessment, although a larger cohort size will be necessary to improve the reliability of estimated model performance.

Strengths of this study include an innovative bioinformatics approach, detailed patient phenotyping, and a study population reflective of the true heterogeneity of ICU patients, including severely immunocompromised subjects and patients receiving broad spectrum antibiotics. Future studies in a larger cohort can further validate these findings, strengthen the utility of these models, and assess the impact of mNGS on clinical outcomes. In summary, we report a multifaceted approach to LRTI diagnosis that is the first to integrate three central elements of airway infections: the pathogen, lung microbiome and host’s response.

METHODS

Study Design and Subjects

This prospective observational study evaluated adults with acute respiratory failure requiring mechanical ventilation who were admitted to the University of California San Francisco (UCSF) Moffitt-Long Hospital ICUs. Subjects were enrolled sequentially between 7/25/13 and 10/17/17 within the first 72 hours of intubation for respiratory failure. The UCSF Institutional Review Board approved an initial waiver consent for obtaining excess respiratory fluid, blood and urine samples, and informed consent was subsequently obtained from patients or their surrogates for continued study participation. For patients whose surrogates provided informed consent, follow-up consent was then obtained if patients survived their acute illness and regained the ability to consent. For subjects who died prior to consent being obtained, a full waiver of consent was approved. For all surviving subjects, if consent was not eventually obtained from either patient or surrogate, all specimens were discarded.

Clinical Microbiologic Testing

During the period of study enrollment, subjects received standard of care microbiologic testing ordered by the treating clinicians. Respiratory testing from TA, bronchial alveolar lavage (BAL) or mini-BAL included: bacterial and fungal stains and semi-quantitative cultures (n=90); AFB stains and cultures (n=8); 12-target clinical multiplex PCR (Luminex) for influenza A/B, respiratory syncytial virus (RSV), human metapneumovirus (HMPV), human rhinovirus (HRV), adenovirus (ADV) and parainfluenza viruses (PIV) 1-4 (n=23); Legionella culture (n=1); Legionella pneumophila PCR (n=4), cytomegalovirus (CMV) culture (n=4) and cytology for Pneumocystis jiroveccii (n=4). Other microbiologic testing included blood culture (n=89); urine culture (n=87); serum cryptococcal antigen (n=4); serum galactomannan (n=1); serum β-D-glucan (n=1).

Definitions and Clinical Adjudication of LRTI

Because admission diagnoses made by treating clinicians at the time of study enrollment were by necessity based on incomplete clinical, microbiologic and treatment outcome information, a post-hoc adjudication approach was carried out to enhance accuracy of LRTI diagnosis. For this, two attending physicians (one from infectious disease [CL], one from pulmonary medicine [FM]) blinded to mNGS results, retrospectively reviewed each patient’s medical record following hospital discharge or death to determine if they met the United States Centers for Disease Control/National Healthcare Safety Network (CDC/NHSN) surveillance definition of pneumonia, with respect to clinical and/or microbiologic criteria (Table S1)(19). Chart review consisted of in-depth analysis of complete patient histories, including laboratory and radiographic results, inpatient notes, and post-discharge clinic notes. Using this approach, subjects were assigned to one of four groups, consistent with a recently described approach(59): 1) LRTI defined by both clinical and laboratory criteria; 2) no evidence of respiratory infection and with a clear alternative explanation for respiratory failure (no-LRTI); 3) LRTI defined by clinical criteria only (LRTI^+C) and 4) unknown, LRTI possible (unk-LRT). A determination of noninfectious etiology was made only if an alternative diagnosis could be established and results of standard clinical microbiological testing for LRTI were negative.

Host/Microbe Metagenomic Next-Generation Sequencing

Excess TA was collected on ice, mixed 1:1 with DNA/RNA Shield (Zymo) and frozen at - 80°C. RNA and DNA were extracted from 300μl of patient TA using bead-based lysis and the Allprep DNA/RNA kit (Qiagen). RNA was reverse transcribed to generate complementary DNA (cDNA) and used to construct sequencing libraries using the NEBNext Ultra II Library Prep Kit (New England Biolabs). DNA underwent adapter addition and barcoding using the Nextera library preparation kit (Illumina) as previously described(20). Depletion of abundant sequences by hybridization (DASH) was employed to selectively deplete human mitochondrial cDNA, thus enriching for both microbial and human protein coding transcripts(63). The final RNA-Seq and DNA sequencing (DNA-Seq) libraries underwent 125 nucleotide paired-end Illumina sequencing on a HiSeq 4000.

Pathogen Detection Bioinformatics

Detection of host transcripts and airway microbes leveraged a custom bioinformatics pipeline(20) that incorporated quality filtering using PRICESeqfilter²⁴ and alignment against the human genome (NCBI GRC h38) using the STAR(65) aligner to extract genecounts. To capture respiratory pathogens, additional filtering to remove Pan troglodytes (UCSC PanTro4) using STAR and removal of non-fungal eukaryotes, cloning vectors and phiX phage was performed using Bowtie2(66). The identities of the remaining microbial reads were determined by querying the NCBI nucleotide (NT) and non-redundant protein (NR) databases using GSNAP-L and RAPSEARCH2, respectively.

Microbial alignments detected by RNA-Seq and DNA-Seq were aggregated to the genus-level and independently evaluated to determine genus alpha diversity as described below. The sequencing reads comprising each genus were then evaluated for taxonomic assignment at the species level based on species relative abundance as previously described(20). For each patient, the top 15 most abundant taxa by RNA rpm were identified and evaluated under the requirement that all bacteria, fungi, and DNA viruses had concordant detection of their genomes by DNA-Seq and concordant alignments in NR and NT. RNA viruses did not require concordant DNA-Seq reads. (Figure 2 and Table S3). To differentiate putative pathogens from commensal microbiota, we developed rules-based (RBM) and logistic regression (LRM) methods and benchmarked each on sequencing data from LRTI^+C+M and no-LRTI subjects.

Statistical Analysis

Statistical significance was defined as P less than 0.05, using two-tailed tests of hypotheses. Categorical data were analyzed by chi-squared test and nonparametric continuous variables were analyzed by Wilcoxon rank-sum. For statistical validation in the pathogen versus commensal and LRTI prediction metrics, 10 LRTI^+C+M and 10 no-LRTI cases were randomly assigned to create a derivation cohort. Model performance was assessed in an independent validation cohort consisting of 16 LRTI^+C+M and 8 no-LRTI cases.

Pathogen versus Commensal Models

We found that all clinically-confirmed LRTI pathogens were present within the top 15 most abundant microbes by RNA-Seq rpm, which on average represented 99% of reads across all samples. We thus limited analysis to the 15 most abundant NGS-detected genera in each sample. For both models, microbes identified using clinician-ordered diagnostics and all viruses with established respiratory pathogenicity in the derivation cohort subjects were considered “pathogens.” Any additional microbes identified by mNGS in these subjects were considered “commensals”. This equated to 12 “pathogens” and 155 “commensals” in the 20 derivation cohort patients and 26 “pathogens” and 174 “commensals” in the 24 validation cohort patients.

Rules-Based Model (RBM)

This model leveraged previous findings demonstrating that microbial communities in patients with LRTI are characterized by one or more dominant pathogens present in high abundance(20, 40). Using either RNA-Seq rpm alone (RNA-viruses) or the combination of RNA-Seq and DNA-Seq rpm (all others), this model identified the subset of microbes with the greatest relative abundance in each sample, which consisted of single microbes in cases of a dominant pathogen and also identified co-infections where several microbes were present within a similar range. All viruses detected by RNA-Seq at > 0.1 rpm and present within the a priori – developed reference index of established respiratory pathogens were considered putative pathogens in the model. The remaining taxa (bacteria, fungi, and DNA viruses) were then aggregated at the genus level, assigned an abundance score based on (log(RNA-Seq rpm) + log(DNA-Seq rpm)), and sorted in descending order by this score. The greatest change in abundance score between sequentially ranked microbes was identified, and all genera with an abundance score greater than this threshold were then evaluated at the species level, by identifying the most abundant species within each genus. If the species was present within the a. priori – developed reference index of established respiratory pathogens, it was selected as a putative pathogen by the model (Fig 2).

Logistic Regression Model (LRM)

This model employed the Python (v 3.6.1) sklearn (v 0.18.1) package to train on distinguishing between “pathogen” and “commensals” using the following five input features: log(RNA-Seq rpm), log(DNA-Seq rpm), per-patient RNA-Seq abundance rank, and two binary variables indicating whether the microbe could be identified in the established index of respiratory pathogens or was a virus. These features were selected in alignment with the observation that the pathogens identified in the LRTI^+C+M group were more abundant and within the top-ranked microbes. Moreover, the individual features were significantly different between the pathogens and commensals: (RNA-Seq rpm p = 2.44×10⁻⁴, DNA-Seq rpm p = 3.55×10⁻³, scoring rank P = 3.51×10⁻⁶). Model performance was estimated in the derivation and validation cohorts and learning curves were computed (Supplemental Methods). For identification of etiologic pathogens reported (Fig 3, Table S3, Table S2) the threshold of 0.36 was used for consistency between the LRM for pathogen identification and LRTI detection.

LRTI Prediction Based on Pathogen

Outside of identifying putative LRTI pathogens, we evaluated whether LRM microbial score alone could be used to classify subjects as LRTI positive or LRTI negative. To do so, we used the top LRM-derived pathogen probability score per patient and evaluated the performance of this value alone to predict likelihood of infection in the LRTI^+C+M versus no-LRTI subjects.

Lung Microbiome Diversity Analysis

Alpha diversity of the respiratory microbiome for each subject was assessed by Shannon Diversity Index (SDI) and Simpson Diversity Index at the genus level using NT rpm and the Vegan (version 2.4.4)(67) package in R (version 3.4.0)(68). Richness (total number of genera) and Burden (combined rpm of all genera) were also evaluated. Viral, bacterial, and fungal microbes were included in all diversity analyses, computed independently for RNA- and DNA-Seq samples without requiring that taxa be concordant on both nucleic acids. Diversity values were then compared between patients with clinically adjudicated LRTI (LRTI^+C+M) and those with respiratory failure due to non-infectious causes (no-LRTI) using the nonparametric Wilcoxon Rank Sum test. Evaluation of alpha diversity for prediction LRTI status was performed using the SDI value. Beta diversity was evaluated using the Bray-Curtis dissimilarity metric calculated at the genus level using NT rpm and the Vegan package in R. Statistical significance of the beta diversity between LRTI^+C+M and no-LRTI patients was assessed using permutation analysis of variance (PERMANOVA, 999 permutations) and the results were visualized using Non-metric Multidimensional Scaling (NMDS).

Host Gene Expression Analysis

Following quality filtration with PRICESeqfilter(64), RNA transcripts were aligned to the ENSEMBL CRCh38 human genome build using STAR. Patients with fewer than 10,000 total protein-coding human gene counts were removed from the analysis. Subsequently, genes were filtered to include only protein-coding genes that were expressed in at least 50% of patients.

Differential Expression Analysis

Gene count data were analyzed using the Bioconductor package DESeq2 (v 1.16.1)(69) in R statistical programming environment. To avoid batch-related confounding and class imbalance, we limited our differential expression analysis to the derivation cohort of 10 LRTI^+C+M and 10 no-LRTI samples, sequenced in the same batch. Differentially expressed genes with FDR < .05 were used as input to ToppGene(44) to evaluate for functional pathway enrichment.

Host Gene Expression Classifier for LRTI Prediction

The derivation cohort was independently normalized using DESeq2 and log-transformed. The values for each gene in the derivation cohort were then scaled and centered by z-score. A classifier was built using the elastic net regularized regression model implementation from the glmnet package (version 2.0.13) in the R Statistical Programming Language (version 3.4.0). Regularization parameter alpha = .5 was selected using leave-one-out cross-validation and optimizing for AUC. To account for heterogeneity in the cohort, the model included covariates of concurrent bloodstream infection, immunosuppression, and gender. No significant difference was seen in these parameters between LRTI^+C+M and no-LRTI (Table S6). These covariates were reduced to zero in the model fitting stage. Genes with non-zero weights were used for classification. To obtain a single-value score for each patient, genes selected by the elastic net were evaluated for their correlation with each of the two groups. Genes for which the mean expression was greater in the LRTI^+C+M were assigned a weight of 1, and those with mean expression greater in no-LRTI were assigned a weight of −1. The normalized, scaled, expression values for each patient were multiplied by the weight vector and summed across all genes. The total sum was used as a representative score and the AUC was calculated. Given the importance of sensitivity in the context of diagnostics, the threshold selected for analysis of the test cohort and combined metrics (scores = −4) was chosen as the threshold which provided 100% sensitivity in the derivation cohort. The host gene expression classifier was then validated on the validation set and learning curves were used to estimate the reliability of the performance metrics (Supplemental Methods).

Classifier Combination

Two methods for combining the pathogen, microbial diversity, and host models for predicting LRTI versus non-infectious lung disease were evaluated. In the first model, score outputs from each of the three individual metrics (pathogen, diversity, host) were used as features in a combined logistic regression model. Performance was evaluated through leave-one-out cross-validation in the derivation cohort, followed by testing in the validation cohort. After determining that microbial diversity did not improve the overall performance of the combined model, we evaluated the model’s performance by combining only the pathogen and host metrics. In the second rule-out model, we identified score thresholds from the pathogen and host metrics required to achieve 100% sensitivity in the derivation cohort (pathogen > 0.36, and host > −4) and applied these to the validation cohort to predict LRTI using the following combinatorial rule: LRTI = (Host)postive OR (Microbe)positive.

Identification and Mitigation of Environmental Contaminants

To minimize inaccurate taxonomic assignments due to environmental contaminants, we processed negative water controls with each group of samples that underwent nucleic acid extraction, and included these, as well as positive control clinical samples, with each sequencing run. We directly subtracted alignments to those taxa in water control samples detected by both RNA-Seq and DNA-Seq analyses from the raw rpm values in all samples. To account for selective amplification bias of contaminants in water controls resulting from PCR amplification of metagenomic libraries to a fixed standard concentration across all samples, prior to direct subtraction we scaled taxa rpms in the water controls to the median percent microbial reads present across all samples (0.04%). In addition, we confirmed reproducibility of results by sequencing 10% of samples in triplicate, and evaluated discrepancies between mNGS and standard diagnostics in a random subset of LRTI^+C patients using clinically validated 16S bacterial rRNA gene sequencing and/or viral PCR testing, as described above.

Data Availability

Raw microbial sequences are available via SRA BioProject Accession ID SUB3898227. Host transcript counts are tabulated in (Table S7).

REFERENCES

1.↵
World Health Organization (2017) The top 10 causes of death. Available at: http://www.who.int/mediacentre/factsheets/fs310/en/.
2.↵
Jain S, et al. (2015) Community-Acquired Pneumonia Requiring Hospitalization among U.S. Adults. N Engl J Med 373(5):415–427.
OpenUrl CrossRef PubMed
3.↵
U.S. Centers for Disease Control and Prevention Lead Causes Death. Available at: http://www.cdc.gov/nchs/fastats/leading-causes-of-death.htm [Accessed November 2, 2015].
4.↵
el Bcheraoui C, et al. (2018) Trends and Patterns of Differences in Infectious Disease Mortality Among US Counties, 1980-2014. JAMA 319(12):1248.
OpenUrl
5.↵
Zaas AK, et al. (2014) The current epidemiology and clinical decisions surrounding acute respiratory infections. Trends Mol Med 20(10):579–588.
OpenUrl CrossRef PubMed
6.↵
Wilson MR, et al. (2014) Actionable diagnosis of neuroleptospirosis by next-generation sequencing. N Engl J Med 370(25):2408–2417.
OpenUrl CrossRef PubMed Web of Science
7.↵
Leffler DA, Lamont JT (2015) Clostridium difficile infection. N Engl J Med 372(16):1539–1548.
OpenUrl CrossRef PubMed
8.↵
Ranzani OT, et al. (2017) New Sepsis Definition (Sepsis-3) and Community-acquired Pneumonia Mortality. A Validation and Clinical Decision-Making Study. Am J Respir Crit Care Med 196(10):1287–1297.
OpenUrl
9.↵
Bibby K (2013) Metagenomic identification of viral pathogens. Trends Biotechnol 31(5):275–279.
OpenUrl CrossRef PubMed
10.↵
Yozwiak NL, et al. (2012) Virus identification in unknown tropical febrile illness cases using deep sequencing. PLoS Negl Trop Dis 6(2):e1485.
OpenUrl CrossRef PubMed
11.↵
Fischer N, et al. (2015) Evaluation of Unbiased Next-Generation Sequencing of RNA (RNAseq) as a Diagnostic Method in Influenza Virus-Positive Respiratory Samples. J Clin Microbiol 53(7):2238–2250.
OpenUrl Abstract/FREE Full Text
12.
Graf EH, et al. (2016) Unbiased Detection of Respiratory Viruses by Use of RNA Sequencing-Based Metagenomics: a Systematic Comparison to a Commercial PCR Panel. J Clin Microbiol 54(4):1000–1007.
OpenUrl Abstract/FREE Full Text
13.↵
Wilson MR, et al. (2015) Diagnosing Balamuthia mandrillaris Encephalitis With Metagenomic Deep Sequencing. Ann Neurol 78(5):722–730.
OpenUrl CrossRef
14.↵
Wilson et al. (In press.) Metagenomics for chronic meningitis: clarifying interpretation and diagnosis. JAMA Intern Med. doi:https://www.biorxiv.org/content/early/2017/11/07/213561.
15.
Naccache SN, et al. (2014) A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples. Genome Res 24. doi:10.1101/gr.171934.113.
OpenUrl Abstract/FREE Full Text
16.↵
Tsalik EL, et al. (2016) Host gene expression classifiers diagnose acute respiratory illness etiology. Sci Transl Med 8(322):322ra11.
OpenUrl Abstract/FREE Full Text
17.↵
Suarez NM, et al. (2015) Superiority of transcriptional profiling over procalcitonin for distinguishing bacterial from viral lower respiratory tract infections in hospitalized adults. J Infect Dis 212(2):213–222.
OpenUrl CrossRef PubMed
18.↵
Tsalik EL, McClain M, Zaas AK (2015) Moving toward prime time: host signatures for diagnosis of respiratory infections. J Infect Dis 212(2):173–175.
OpenUrl CrossRef PubMed
19.↵
United States Centers for Disease Control and Prevention (2017) CDC/NHSN Surveillance Definitions for Specific Types of Infections. Available at: https://www.cdc.gov/nhsn/pdfs/pscmanual/17pscnosinfdef_current.pdf.
20.↵
Langelier C, et al. (2017) Metagenomic Sequencing Detects Respiratory Pathogens in Hematopoietic Cellular Transplant Patients. Am J Respir Crit Care Med. doi:10.1164/rccm.201706-1097 LE.
OpenUrl CrossRef
21.↵
Wilson MR, et al. (2015) Diagnosing Balamuthia mandrillaris Encephalitis With Metagenomic Deep Sequencing. Ann Neurol 78(5):722–730.
OpenUrl CrossRef
22.↵
Doan T, et al. (2016) Illuminating uveitis: metagenomic deep sequencing identifies common and rare pathogens. Genome Med 8(1):90.
OpenUrl CrossRef
23.↵
Dickson RP, et al. (2017) Bacterial Topography of the Healthy Human Lower Respiratory Tract. mBio 8(1). doi: 10.1128/mBio.02287-16.
OpenUrl Abstract/FREE Full Text
24.
Panzer AR, et al. (2017) Lung Microbiota is Related to Smoking Status and to Development of ARDS in Critically Ill Trauma Patients. Am J Respir Crit Care Med. doi:10.1164/rccm.201702-0441OC.
OpenUrl CrossRef
25.
Morris A, et al. (2013) Comparison of the respiratory microbiome in healthy nonsmokers and smokers. Am J Respir Crit Care Med 187(10):1067–1075.
OpenUrl CrossRef PubMed Web of Science
26.↵
Segal LN, et al. (2016) Enrichment of the lung microbiome with oral taxa is associated with lung inflammation of a Th17 phenotype. Nat Microbiol 1:16031.
OpenUrl
27.↵
Heinonen S, et al. (2016) Rhinovirus Detection in Symptomatic and Asymptomatic Children: Value of Host Transcriptome Analysis. Am J Respir Crit Care Med 193(7):772–782.
OpenUrl CrossRef
28.↵
Wertheim HFL, et al. (2005) The role of nasal carriage in Staphylococcus aureus infections. Lancet Infect Dis 5(12):751–762.
OpenUrl CrossRef PubMed Web of Science
29.↵
McCullers JA (2014) The co-pathogenesis of influenza viruses with bacteria in the lung. Nat Rev Microbiol 12(4):252–262.
OpenUrl CrossRef PubMed
30.↵
Magill SS, et al. (2014) Multistate point-prevalence survey of health care-associated infections. N Engl J Med 370(13):1198–1208.
OpenUrl CrossRef PubMed Web of Science
31.
Kalil AC, et al. (2016) Management of Adults With Hospital-acquired and Ventilator-associated Pneumonia: 2016 Clinical Practice Guidelines by the Infectious Diseases Society of America and the American Thoracic Society. Clin Infect Dis 63(5):e61–e111.
OpenUrl CrossRef PubMed
32.↵
Mandell LA, et al. (2007) Infectious Diseases Society of America/American Thoracic Society consensus guidelines on the management of community-acquired pneumonia in adults. Clin Infect Dis Off Publ Infect Dis Soc Am 44 Suppl 2:S27-72.
OpenUrl
33.↵
Cillóniz C, Civljak R, Nicolini A, Torres A (2016) Polymicrobial community-acquired pneumonia: An emerging entity. Respirol Carlton Vic 21(1):65–75.
OpenUrl
34.↵
Pabbaraju K, et al. (2013) Detection of influenza C virus by a real-time RT-PCR assay. Influenza Other Respir Viruses 7(6):954–960.
OpenUrl
35.↵
Dewhirst FE, et al. (2010) The Human Oral Microbiome. J Bacteriol 192(19):5002–5017.
OpenUrl Abstract/FREE Full Text
36.↵
Chen C, et al. (2013) New microbiota found in sputum from patients with community-acquired pneumonia. Acta Biochim Biophys Sin 45(12):1039–1048.
OpenUrl
37.↵
Ichinohe T, et al. (2011) Microbiota regulates immune defense against respiratory tract influenza A virus infection. Proc Natl Acad Sci 108(13):5354–5359.
OpenUrl Abstract/FREE Full Text
38.↵
Abreu NA, et al. (2012) Sinus microbiome diversity depletion and Corynebacterium tuberculostearicum enrichment mediates rhinosinusitis. Sci Transl Med 4(151):151ra124.
OpenUrl Abstract/FREE Full Text
39.
Dickson RP, et al. (2014) Analysis of culture-dependent versus culture-independent techniques for identification of bacteria in clinically obtained bronchoalveolar lavage fluid. J Clin Microbiol 52(10):3605–3613.
OpenUrl Abstract/FREE Full Text
40.↵
Flanagan JL, et al. (2007) Loss of bacterial diversity during antibiotic treatment of intubated patients colonized with Pseudomonas aeruginosa. J Clin Microbiol 45(6):1954–1962.
OpenUrl Abstract/FREE Full Text
41.↵
Birtel J, Walser J-C, Pichon S, Bürgmann H, Matthews B (2015) Estimating Bacterial Diversity for Ecological Studies: Methods, Metrics, and Assumptions. PLOS ONE 10(4):e0125356.
OpenUrl
42.↵
Bray J. Roger, Curtis J. T. (1957) An Ordination of the Upland Forest Communities of Southern Wisconsin. Ecol Monogr 27(4):325–349.
OpenUrl CrossRef
43.↵
Sweeney TE, Wong HR, Khatri P (2016) Robust classification of bacterial and viral infections via integrated host gene expression diagnostics. Sci Transl Med 8(346):346ra91.
OpenUrl Abstract/FREE Full Text
44.↵
Chen J, Bardes EE, Aronow BJ, Jegga AG (2009) ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res 37(Web Server):W305–W311.
OpenUrl CrossRef PubMed Web of Science
45.↵
Macian F (2005) NFAT proteins: key regulators of T-cell development and function. Nat Rev Immunol 5(6):472–484.
OpenUrl CrossRef PubMed Web of Science
46.↵
Fu M, Blackshear PJ (2017) RNA-binding proteins in immune regulation: a focus on CCCH zinc finger proteins. Nat Rev Immunol 17(2):130–143.
OpenUrl CrossRef
47.↵
Biswas K, et al. (2017) Differentially Regulated Host Proteins Associated with Chronic Rhinosinusitis Are Correlated with the Sinonasal Microbiome. Front Cell Infect Microbiol 7:504.
OpenUrl
48.↵
Stewart CR, et al. (2010) CD36 ligands promote sterile inflammation through assembly of a Toll-like receptor 4 and 6 heterodimer. Nat Immunol 11(2):155–161.
OpenUrl CrossRef PubMed Web of Science
49.↵
Cohen TS, et al. (2016) S. aureus blocks efferocytosis of neutrophils by macrophages through the activity of its virulence factor alpha toxin. Sci Rep 6:35466.
OpenUrl CrossRef
50.↵
Baranano DE, Rao M, Ferris CD, Snyder SH (2002) Biliverdin reductase: a major physiologic cytoprotectant. Proc Natl Acad Sci U S A 99(25):16093–16098.
OpenUrl Abstract/FREE Full Text
51.↵
Leidi M, Mariotti M, Maier JAM (2010) EDF-1 contributes to the regulation of nitric oxide release in VEGF-treated human endothelial cells. Eur J Cell Biol 89(9):654–660.
OpenUrl CrossRef PubMed
52.↵
Pousada G, Baloira A, Fontán D, Núñez M, Valverde D (2016) Mutational and clinical analysis of the ENG gene in patients with pulmonary arterial hypertension. BMC Genet 17:72.
OpenUrl
53.↵
Newman AM, et al. (2015) Robust enumeration of cell subsets from tissue expression profiles. Nat Methods 12(5):453–457.
OpenUrl CrossRef PubMed
54.↵
PubMed entry Available at: http://www.ncbi.nlm.nih.gov/pubmed/28196961 [Accessed July 10, 2017].
55.↵
Greninger AL, et al. (2015) Rapid metagenomic identification of viral pathogens in clinical samples by real-time nanopore sequencing analysis. Genome Med 7(1):1–13.
OpenUrl CrossRef
56.↵
Baggs J, Fridkin SK, Pollack LA, Srinivasan A, Jernigan JA (2016) Estimating national trends in inpatient antibiotic use among us hospitals from 2006 to 2012. JAMA Intern Med 176(11):1639–1648.
OpenUrl
57.↵
Currie CJ, et al. (2014) Antibiotic treatment failure in four common infections in UK primary care 1991-2012: longitudinal analysis. BMJ 349:g5493.
OpenUrl Abstract/FREE Full Text
58.↵
Jain S, Finelli L, CDC EPIC Study Team (2015) Community-acquired pneumonia among U.S. children. N Engl J Med 372(22):2167–2168.
OpenUrl
59.↵
Tsalik EL, et al. (2016) Host gene expression classifiers diagnose acute respiratory illness etiology. Sci Transl Med 8(322):322ra11.
OpenUrl Abstract/FREE Full Text
60.↵
Zaas AK, et al. (2014) The current epidemiology and clinical decisions surrounding acute respiratory infections. Trends Mol Med 20(10):579–588.
OpenUrl CrossRef PubMed
61.↵
Walter JM, Wunderink RG (2017) Severe Respiratory Viral Infections: New Evidence and Changing Paradigms. Infect Dis Clin North Am 31(3):455–474.
OpenUrl CrossRef
62.↵
Sands KM, et al. (2017) Respiratory pathogen colonization of dental plaque, the lower airways, and endotracheal tube biofilms during mechanical ventilation. J Crit Care 37:30–37.
OpenUrl
63.↵
Gu W, et al. (2016) Depletion of Abundant Sequences by Hybridization (DASH): using Cas9 to remove unwanted high-abundance species in sequencing libraries and molecular counting applications. Genome Biol 17:41.
OpenUrl CrossRef PubMed
64.↵
Ruby JG, Bellare P, Derisi JL (2013) PRICE: software for the targeted assembly of components of (Meta) genomic sequence data. G3 Bethesda Md 3(5):865–880.
OpenUrl
65.↵
Dobin A, et al. (2013) STAR: ultrafast universal RNA-seq aligner. Bioinforma Oxf Engl 29(1):15–21.
OpenUrl
66.↵
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9(4):357–359.
OpenUrl CrossRef PubMed Web of Science
67.↵
Oksanen J, Blanchet, Kindt vegan: Community Ecology Package. R package version 2.3-5. 2016. Available at: https://rdrr.io/rforge/vegan/.
68.↵
R Core Team. R Foundation for Statistical Computing, Vienna, Austria. (2013) R: A language and environment for statistical computing. Available at: http://www.R-project.org/.
69.↵
Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15(12). doi:10.1186/s1 3059-014-0550-8.
OpenUrl CrossRef

View the discussion thread.

Posted June 11, 2018.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Genomics

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11715)
Bioengineering (8723)
Bioinformatics (29129)
Biophysics (14936)
Cancer Biology (12049)
Cell Biology (17359)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14144)
Epidemiology (2067)
Evolutionary Biology (18268)
Genetics (12221)
Genomics (16767)
Immunology (11843)
Microbiology (28014)
Molecular Biology (11560)
Neuroscience (60814)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10384)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] 1.↵
World Health Organization (2017) The top 10 causes of death. Available at: http://www.who.int/mediacentre/factsheets/fs310/en/.

[2] 2.↵
Jain S, et al. (2015) Community-Acquired Pneumonia Requiring Hospitalization among U.S. Adults. N Engl J Med 373(5):415–427.
OpenUrl CrossRef PubMed

[3] 3.↵
U.S. Centers for Disease Control and Prevention Lead Causes Death. Available at: http://www.cdc.gov/nchs/fastats/leading-causes-of-death.htm [Accessed November 2, 2015].

[4] 4.↵
el Bcheraoui C, et al. (2018) Trends and Patterns of Differences in Infectious Disease Mortality Among US Counties, 1980-2014. JAMA 319(12):1248.
OpenUrl

[5] 5.↵
Zaas AK, et al. (2014) The current epidemiology and clinical decisions surrounding acute respiratory infections. Trends Mol Med 20(10):579–588.
OpenUrl CrossRef PubMed

[6] 6.↵
Wilson MR, et al. (2014) Actionable diagnosis of neuroleptospirosis by next-generation sequencing. N Engl J Med 370(25):2408–2417.
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Leffler DA, Lamont JT (2015) Clostridium difficile infection. N Engl J Med 372(16):1539–1548.
OpenUrl CrossRef PubMed

[8] 8.↵
Ranzani OT, et al. (2017) New Sepsis Definition (Sepsis-3) and Community-acquired Pneumonia Mortality. A Validation and Clinical Decision-Making Study. Am J Respir Crit Care Med 196(10):1287–1297.
OpenUrl

[9] 9.↵
Bibby K (2013) Metagenomic identification of viral pathogens. Trends Biotechnol 31(5):275–279.
OpenUrl CrossRef PubMed

[10] 10.↵
Yozwiak NL, et al. (2012) Virus identification in unknown tropical febrile illness cases using deep sequencing. PLoS Negl Trop Dis 6(2):e1485.
OpenUrl CrossRef PubMed

[11] 11.↵
Fischer N, et al. (2015) Evaluation of Unbiased Next-Generation Sequencing of RNA (RNAseq) as a Diagnostic Method in Influenza Virus-Positive Respiratory Samples. J Clin Microbiol 53(7):2238–2250.
OpenUrl Abstract/FREE Full Text

[12] 12.
Graf EH, et al. (2016) Unbiased Detection of Respiratory Viruses by Use of RNA Sequencing-Based Metagenomics: a Systematic Comparison to a Commercial PCR Panel. J Clin Microbiol 54(4):1000–1007.
OpenUrl Abstract/FREE Full Text

[13] 13.↵
Wilson MR, et al. (2015) Diagnosing Balamuthia mandrillaris Encephalitis With Metagenomic Deep Sequencing. Ann Neurol 78(5):722–730.
OpenUrl CrossRef

[14] 14.↵
Wilson et al. (In press.) Metagenomics for chronic meningitis: clarifying interpretation and diagnosis. JAMA Intern Med. doi:https://www.biorxiv.org/content/early/2017/11/07/213561.

[15] 15.
Naccache SN, et al. (2014) A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples. Genome Res 24. doi:10.1101/gr.171934.113.
OpenUrl Abstract/FREE Full Text

[16] 16.↵
Tsalik EL, et al. (2016) Host gene expression classifiers diagnose acute respiratory illness etiology. Sci Transl Med 8(322):322ra11.
OpenUrl Abstract/FREE Full Text

[17] 17.↵
Suarez NM, et al. (2015) Superiority of transcriptional profiling over procalcitonin for distinguishing bacterial from viral lower respiratory tract infections in hospitalized adults. J Infect Dis 212(2):213–222.
OpenUrl CrossRef PubMed

[18] 18.↵
Tsalik EL, McClain M, Zaas AK (2015) Moving toward prime time: host signatures for diagnosis of respiratory infections. J Infect Dis 212(2):173–175.
OpenUrl CrossRef PubMed

[19] 19.↵
United States Centers for Disease Control and Prevention (2017) CDC/NHSN Surveillance Definitions for Specific Types of Infections. Available at: https://www.cdc.gov/nhsn/pdfs/pscmanual/17pscnosinfdef_current.pdf.

[20] 20.↵
Langelier C, et al. (2017) Metagenomic Sequencing Detects Respiratory Pathogens in Hematopoietic Cellular Transplant Patients. Am J Respir Crit Care Med. doi:10.1164/rccm.201706-1097 LE.
OpenUrl CrossRef

[21] 21.↵
Wilson MR, et al. (2015) Diagnosing Balamuthia mandrillaris Encephalitis With Metagenomic Deep Sequencing. Ann Neurol 78(5):722–730.
OpenUrl CrossRef

[22] 22.↵
Doan T, et al. (2016) Illuminating uveitis: metagenomic deep sequencing identifies common and rare pathogens. Genome Med 8(1):90.
OpenUrl CrossRef

[23] 23.↵
Dickson RP, et al. (2017) Bacterial Topography of the Healthy Human Lower Respiratory Tract. mBio 8(1). doi: 10.1128/mBio.02287-16.
OpenUrl Abstract/FREE Full Text

[24] 24.
Panzer AR, et al. (2017) Lung Microbiota is Related to Smoking Status and to Development of ARDS in Critically Ill Trauma Patients. Am J Respir Crit Care Med. doi:10.1164/rccm.201702-0441OC.
OpenUrl CrossRef

[25] 25.
Morris A, et al. (2013) Comparison of the respiratory microbiome in healthy nonsmokers and smokers. Am J Respir Crit Care Med 187(10):1067–1075.
OpenUrl CrossRef PubMed Web of Science

[26] 26.↵
Segal LN, et al. (2016) Enrichment of the lung microbiome with oral taxa is associated with lung inflammation of a Th17 phenotype. Nat Microbiol 1:16031.
OpenUrl

[27] 27.↵
Heinonen S, et al. (2016) Rhinovirus Detection in Symptomatic and Asymptomatic Children: Value of Host Transcriptome Analysis. Am J Respir Crit Care Med 193(7):772–782.
OpenUrl CrossRef

[28] 28.↵
Wertheim HFL, et al. (2005) The role of nasal carriage in Staphylococcus aureus infections. Lancet Infect Dis 5(12):751–762.
OpenUrl CrossRef PubMed Web of Science

[29] 29.↵
McCullers JA (2014) The co-pathogenesis of influenza viruses with bacteria in the lung. Nat Rev Microbiol 12(4):252–262.
OpenUrl CrossRef PubMed

[30] 30.↵
Magill SS, et al. (2014) Multistate point-prevalence survey of health care-associated infections. N Engl J Med 370(13):1198–1208.
OpenUrl CrossRef PubMed Web of Science

[31] 31.
Kalil AC, et al. (2016) Management of Adults With Hospital-acquired and Ventilator-associated Pneumonia: 2016 Clinical Practice Guidelines by the Infectious Diseases Society of America and the American Thoracic Society. Clin Infect Dis 63(5):e61–e111.
OpenUrl CrossRef PubMed

[32] 32.↵
Mandell LA, et al. (2007) Infectious Diseases Society of America/American Thoracic Society consensus guidelines on the management of community-acquired pneumonia in adults. Clin Infect Dis Off Publ Infect Dis Soc Am 44 Suppl 2:S27-72.
OpenUrl

[33] 33.↵
Cillóniz C, Civljak R, Nicolini A, Torres A (2016) Polymicrobial community-acquired pneumonia: An emerging entity. Respirol Carlton Vic 21(1):65–75.
OpenUrl

[34] 34.↵
Pabbaraju K, et al. (2013) Detection of influenza C virus by a real-time RT-PCR assay. Influenza Other Respir Viruses 7(6):954–960.
OpenUrl

[35] 35.↵
Dewhirst FE, et al. (2010) The Human Oral Microbiome. J Bacteriol 192(19):5002–5017.
OpenUrl Abstract/FREE Full Text

[36] 36.↵
Chen C, et al. (2013) New microbiota found in sputum from patients with community-acquired pneumonia. Acta Biochim Biophys Sin 45(12):1039–1048.
OpenUrl

[37] 37.↵
Ichinohe T, et al. (2011) Microbiota regulates immune defense against respiratory tract influenza A virus infection. Proc Natl Acad Sci 108(13):5354–5359.
OpenUrl Abstract/FREE Full Text

[38] 38.↵
Abreu NA, et al. (2012) Sinus microbiome diversity depletion and Corynebacterium tuberculostearicum enrichment mediates rhinosinusitis. Sci Transl Med 4(151):151ra124.
OpenUrl Abstract/FREE Full Text

[39] 39.
Dickson RP, et al. (2014) Analysis of culture-dependent versus culture-independent techniques for identification of bacteria in clinically obtained bronchoalveolar lavage fluid. J Clin Microbiol 52(10):3605–3613.
OpenUrl Abstract/FREE Full Text

[40] 40.↵
Flanagan JL, et al. (2007) Loss of bacterial diversity during antibiotic treatment of intubated patients colonized with Pseudomonas aeruginosa. J Clin Microbiol 45(6):1954–1962.
OpenUrl Abstract/FREE Full Text

[41] 41.↵
Birtel J, Walser J-C, Pichon S, Bürgmann H, Matthews B (2015) Estimating Bacterial Diversity for Ecological Studies: Methods, Metrics, and Assumptions. PLOS ONE 10(4):e0125356.
OpenUrl

[42] 42.↵
Bray J. Roger, Curtis J. T. (1957) An Ordination of the Upland Forest Communities of Southern Wisconsin. Ecol Monogr 27(4):325–349.
OpenUrl CrossRef

[43] 43.↵
Sweeney TE, Wong HR, Khatri P (2016) Robust classification of bacterial and viral infections via integrated host gene expression diagnostics. Sci Transl Med 8(346):346ra91.
OpenUrl Abstract/FREE Full Text

[44] 44.↵
Chen J, Bardes EE, Aronow BJ, Jegga AG (2009) ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res 37(Web Server):W305–W311.
OpenUrl CrossRef PubMed Web of Science

[45] 45.↵
Macian F (2005) NFAT proteins: key regulators of T-cell development and function. Nat Rev Immunol 5(6):472–484.
OpenUrl CrossRef PubMed Web of Science

[46] 46.↵
Fu M, Blackshear PJ (2017) RNA-binding proteins in immune regulation: a focus on CCCH zinc finger proteins. Nat Rev Immunol 17(2):130–143.
OpenUrl CrossRef

[47] 47.↵
Biswas K, et al. (2017) Differentially Regulated Host Proteins Associated with Chronic Rhinosinusitis Are Correlated with the Sinonasal Microbiome. Front Cell Infect Microbiol 7:504.
OpenUrl

[48] 48.↵
Stewart CR, et al. (2010) CD36 ligands promote sterile inflammation through assembly of a Toll-like receptor 4 and 6 heterodimer. Nat Immunol 11(2):155–161.
OpenUrl CrossRef PubMed Web of Science

[49] 49.↵
Cohen TS, et al. (2016) S. aureus blocks efferocytosis of neutrophils by macrophages through the activity of its virulence factor alpha toxin. Sci Rep 6:35466.
OpenUrl CrossRef

[50] 50.↵
Baranano DE, Rao M, Ferris CD, Snyder SH (2002) Biliverdin reductase: a major physiologic cytoprotectant. Proc Natl Acad Sci U S A 99(25):16093–16098.
OpenUrl Abstract/FREE Full Text

[51] 51.↵
Leidi M, Mariotti M, Maier JAM (2010) EDF-1 contributes to the regulation of nitric oxide release in VEGF-treated human endothelial cells. Eur J Cell Biol 89(9):654–660.
OpenUrl CrossRef PubMed

[52] 52.↵
Pousada G, Baloira A, Fontán D, Núñez M, Valverde D (2016) Mutational and clinical analysis of the ENG gene in patients with pulmonary arterial hypertension. BMC Genet 17:72.
OpenUrl

[53] 53.↵
Newman AM, et al. (2015) Robust enumeration of cell subsets from tissue expression profiles. Nat Methods 12(5):453–457.
OpenUrl CrossRef PubMed

[54] 54.↵
PubMed entry Available at: http://www.ncbi.nlm.nih.gov/pubmed/28196961 [Accessed July 10, 2017].

[55] 55.↵
Greninger AL, et al. (2015) Rapid metagenomic identification of viral pathogens in clinical samples by real-time nanopore sequencing analysis. Genome Med 7(1):1–13.
OpenUrl CrossRef

[56] 56.↵
Baggs J, Fridkin SK, Pollack LA, Srinivasan A, Jernigan JA (2016) Estimating national trends in inpatient antibiotic use among us hospitals from 2006 to 2012. JAMA Intern Med 176(11):1639–1648.
OpenUrl

[57] 57.↵
Currie CJ, et al. (2014) Antibiotic treatment failure in four common infections in UK primary care 1991-2012: longitudinal analysis. BMJ 349:g5493.
OpenUrl Abstract/FREE Full Text

[58] 58.↵
Jain S, Finelli L, CDC EPIC Study Team (2015) Community-acquired pneumonia among U.S. children. N Engl J Med 372(22):2167–2168.
OpenUrl

[59] 59.↵
Tsalik EL, et al. (2016) Host gene expression classifiers diagnose acute respiratory illness etiology. Sci Transl Med 8(322):322ra11.
OpenUrl Abstract/FREE Full Text

[60] 60.↵
Zaas AK, et al. (2014) The current epidemiology and clinical decisions surrounding acute respiratory infections. Trends Mol Med 20(10):579–588.
OpenUrl CrossRef PubMed

[61] 61.↵
Walter JM, Wunderink RG (2017) Severe Respiratory Viral Infections: New Evidence and Changing Paradigms. Infect Dis Clin North Am 31(3):455–474.
OpenUrl CrossRef

[62] 62.↵
Sands KM, et al. (2017) Respiratory pathogen colonization of dental plaque, the lower airways, and endotracheal tube biofilms during mechanical ventilation. J Crit Care 37:30–37.
OpenUrl

[63] 63.↵
Gu W, et al. (2016) Depletion of Abundant Sequences by Hybridization (DASH): using Cas9 to remove unwanted high-abundance species in sequencing libraries and molecular counting applications. Genome Biol 17:41.
OpenUrl CrossRef PubMed

[64] 64.↵
Ruby JG, Bellare P, Derisi JL (2013) PRICE: software for the targeted assembly of components of (Meta) genomic sequence data. G3 Bethesda Md 3(5):865–880.
OpenUrl

[65] 65.↵
Dobin A, et al. (2013) STAR: ultrafast universal RNA-seq aligner. Bioinforma Oxf Engl 29(1):15–21.
OpenUrl

[66] 66.↵
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9(4):357–359.
OpenUrl CrossRef PubMed Web of Science

[67] 67.↵
Oksanen J, Blanchet, Kindt vegan: Community Ecology Package. R package version 2.3-5. 2016. Available at: https://rdrr.io/rforge/vegan/.

[68] 68.↵
R Core Team. R Foundation for Statistical Computing, Vienna, Austria. (2013) R: A language and environment for statistical computing. Available at: http://www.R-project.org/.

[69] 69.↵
Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15(12). doi:10.1186/s1 3059-014-0550-8.
OpenUrl CrossRef

Integrating Host Response and Unbiased Microbe Detection for Lower Respiratory Tract Infection Diagnosis in Critically Ill Adults

ABSTRACT

INTRODUCTION

RESULTS

Pathogen Detection

LRTI Prediction Based on Pathogen

LRTI Prediction Based on Lung Microbiome Diversity

LRTI Prediction Based on Host Response

Evaluation of a Combined LRTI Metric

DISCUSSION

METHODS

Study Design and Subjects

Clinical Microbiologic Testing

Definitions and Clinical Adjudication of LRTI

Host/Microbe Metagenomic Next-Generation Sequencing

Pathogen Detection Bioinformatics

Statistical Analysis

Pathogen versus Commensal Models

Rules-Based Model (RBM)

Logistic Regression Model (LRM)

LRTI Prediction Based on Pathogen

Lung Microbiome Diversity Analysis

Host Gene Expression Analysis

Differential Expression Analysis

Host Gene Expression Classifier for LRTI Prediction

Classifier Combination

Identification and Mitigation of Environmental Contaminants

Data Availability

REFERENCES

Citation Manager Formats

Subject Area