Interactions between species introduce spurious associations in microbiome studies

Rajita Menon; Vivek Ramanan; Kirill S. Korolev

doi:10.1101/176677

Abstract

Microbiota contribute to many dimensions of host phenotype, including disease. To link specific microbes to specific phenotypes, microbiome-wide association studies compare microbial abundances between two groups of samples. Abundance differences, however, reflect not only direct associations with the phenotype, but also indirect effects due to microbial interactions. We found that microbial interactions could easily generate a large number of spurious associations that provide no mechanistic insight. Using techniques from statistical physics, we developed a method to remove indirect associations and applied it to the largest dataset on pediatric inflammatory bowel disease. Our method corrected the inflation of p-values in standard association tests and showed that only a small subset of associations is directly linked to the disease. Direct associations had a much higher accuracy in separating cases from controls and pointed to immunomodulation, butyrate production, and the brain-gut axis as important factors in the inflammatory bowel disease.

Introduction

Microbes are essential to any ecosystem be it the ocean or the human gut. The sheer impact of microbial processes has however been underappreciated until the advent of culture-independent methods to assess entire communities in situ. Metagenomics and 16S rRNA sequencing identified significant differences in microbiota among hosts, and experimental manipulations established that microbes could dramatically alter host phenotype [1–8]. Indeed, anxiety, obesity, colitis, and other phenotypes can be transmitted between hosts simply by transplanting their intestinal flora [9–13].

New tools and greater awareness of microbiota triggered a wave of association studies between microbiomes and host phenotypes. Microbiome wide association studies (MWAS) have been carried out for diabetes, arthritis, cancer, autism and many other disorders [14–23]. MWAS clearly established that each disease is associated with a distinct state of intestinal dysbiosis, but they of-ten produced conflicting results and identified a very large number of associations both within and across studies [14, 19, 21, 23–26]. For example, a recent study on inflammatory bowel disease (IBD) reported close to 100 taxa associated with IBD [25], a number that is fairly typical [14]. Such long lists of associations defy simple interpretation and complicate mechanistic follow-up studies because one needs to examine the role of almost every species in the microbiota. In fact, one can argue that MWAS are most useful when they can identify a small network of taxa driving the disease.

Although extensive dysbiosis might reflect the multifactorial nature of the disease, it is also possible that MWAS detect spurious associations because their statistical methods fail to account for some important aspects of microbiome dynamics. One such aspect is the pervasive nature of microbial interactions: species compete for similar resources, rely on cross-feeding for survival, and even produce their own antibiotics [27–37]. Hence, microbial abundances must be correlated with each other, and even a simple change in host phenotype could manifest as collective responses by the microbiota. Traditional MWAS, however, completely neglect this possibility because they treat each change in abundance as an independent manifestation of altered host phenotype. As a result, MWAS cannot distinguish taxa directly linked to disease from taxa that are affected only through their interactions with other species.

The main conclusion of this paper is that realistic microbial interactions produce a large number of spurious associations. Many of these indirect associations can be removed by a simple procedure based on maximum entropy models from statistical physics, which can separate host effects from the microbial interactions. We dubbed this approach Direct Association Analysis, or DAA for short.

When applied to the largest MWAS on IBD, DAA shows that many of the previously reported associations could be explained by interspecific interactions rather than the disease. At the genus and species level, the direct associations include only Roseburia, Faecalibacterium prausnitzii, Bifidobacterium adolescentis, Blautia producta, Turicibacter, Oscillospira, Eubacterium dolichum, Aggregatibacter segnis, and Sutterella. Some of these associations are well-known, while others have received little attention in IBD research. The phenotypes of the taxa directly to disease suggest that immunomodulation, butyrate production, and the brain-gut interactions play important role in the etiology of IBD.

Compared to traditional MWAS, DAA corrected the inflation of p-values responsible for the large number of spurious associations and identified taxa most informative of the diagnosis. We found that directly associated taxa are much better at discriminating between cases from controls than an equally-sized subset of indirect associations. In fact, direct associations have the same potential to discriminate between health and disease as the entire set of almost a hundred associations detected by a conventional method.

Results

Traditional MWAS detect species with significantly different abundances between case and control groups. Some changes in the abundances are directly associated with the disease while others are due to microbial interactions. The emergence of indirect changes in abundance is illustrated in Fig. 1A for a hypothetical network of five species. Only two species A and D are directly linked to the disease. However, strong interactions make the abundances of all five species differ between control and disease groups. For example, the mutualistic interaction between A and B helps B grow to a higher density following the increase in the abundance of A. The expansion of B in turn inhibits the growth of C and reduces its abundance in disease. Strong mutualistic, competitive, commensal, and parasitic interactions have been demonstrated in microbiota [27–37], and Fig. 1B shows that almost every species present in the human gut participates in a strong interaction. Thus, the propagation of abundance changes from directly-linked to other species could pose a significant challenge for MWAS. To test this hypothesis, we turned to a minimal mathematical model of microbiota composition.

Figure 1. Microbial interactions generate spurious associations.

(A) A hypothetical interaction network of five species together with their dynamics in disease. Only two species (shown in color) are directly linked to host phenotype. These directly-linked species inhibit or promote the growth of the other members of the community (shown with arrows). As a result, all five species have different abundances between case and control groups. (B) Microbial interactions are visualized via a hierarchically-clustered correlation matrix computed from the data in Ref. [21]. We used Pearson’s correlation coefficient between log-transformed abundances to quantify the strength of co-occurrence for each genus pair. Dark regions reflect strong interspecific interactions that could potentially generate spurious associations. See Tab. S1 for the list of 47 most prevalent genera included in the plot.

Maximum entropy model of microbiota composition

A quantitative description of interspecific interactions and their effect on MWAS requires a statistical model of host-associated microbial communities. Ideally, such a model would describe the probability to observe any microbial composition, but the amount of data even in large studies is only sufficient to determine the means and covariances of microbial abundances. This situation is common in the analysis of biological data and has been successfully managed with the use of maximum entropy distributions [38]. These distributions are chosen to be as random as possible under the constraints imposed by the first and second moments. Maximum entropy models introduce the least amount of bias and reflect the tendency of natural systems to maximize their entropy [39]. In other contexts, these models have successfully described the dynamics of neurons, forests, flocks, and even predicted protein structure and function [40–44]. In the context of microbiomes, a recent work derived a maximum entropy distribution for microbial abundances using the principle of maximum diversity [45].

We show in the Supplemental information (SI) that the maximum entropy distribution of microbial abundances P ({l_i}) takes the following form where l_i is the log-transformed abundance of species i, h_i represents the direct effect of the host phenotype on species i, and J_ij describes the interaction between species i and j; the factor of 1/Z is the normalization constant. The log-transformation of relative abundances alleviates two common difficulties with the analysis of the microbiome data. The first difficulty is the large subject-to-subject variation, which is much better captured by a log-normal rather than a Gaussian distribution; see Fig. S1, SI, and Ref. [25]. The second difficulty arises from the fact that the relative abundances must add up to one. This constraint is commonly known as the compositional bias because it leads to artifacts in the statistical analysis. The log-transformation is an essential step in most methods that account for the compositional bias [46–48], and, in the SI, we show that all of our conclusions are robust to the variation in the strength of the compositional bias.

Testing for spurious associations in synthetic data

We obtained realistic model parameters from one of the largest case-control studies previously reported in Ref. [21]. The samples were obtained from mucosal biopsies of 275 newly diagnosed, treatment-naive children with Crohn’s disease (a subtype of IBD) and 189 matched controls. Microbiota composition was determined by 16S rRNA sequencing with about 30,000 reads per sample. From this data, we inferred the interaction matrix J and the typical changes in microbial abundances associated with the disease for 47 most prevalent genera (Methods and SI). Even though the number of data points significantly exceeds the number of free parameters in the model, overfitting could still be a potential concern. Overfitting, however, is unlikely to affect our main conclusions because they depend only on the overall statistical properties of J rather than on the precise knowledge of every interaction. In fact, none of our results changed when we analyzed only about half of the data set (Fig. 2). To improve the quality and robustness of the inference procedure, we also used the spectral decomposition of J to remove any interaction patterns that were not strongly supported by the data; see Methods and SI for further details.

To determine the effect of microbial interactions on conventional MWAS analysis, we generated synthetic data with a known number of direct associations. The data for the control group was used without modification from Ref. [21]. The disease group was generated using Eq. (5) with the same values of h and J as in the control group, except we modified the values of h for 6 representative genera (Tab. S2). We also generated two other synthetic data sets with smaller and larger effect sizes (Tab. S2). The results for all three data sets were very similar (SI).

The synthetic data was further subsampled to several sample sizes in order to simulate variation in statistical power between different studies. For an ideal method, the number of detected associations should increase with the cohort size, but eventually saturate once all 6 directly associated genera are discovered. In contrast to this expectation, the number of associations detected by the conventional approach increased rapidly with the sample size until almost all genera were found to be statistically associated with the disease in our synthetic data. At this point, traditional MWAS completely lost the power to identify the link between the phenotype and microbiota. Unbounded growth in the number of detections was also observed for the real data (Fig. 2C) suggesting that many previously reported associations between microbiota and IBD could be indirect.

Are spurious associations simply an artifact of our ability to detect even minute differences between cases and controls? Fig. 2B and 2D show that this was not the case. The median effect size declined only moderately with the number of associations, and most associations corresponded to about a factor of two difference in the taxon abundance. Thus, spurious associations are not weak and could not be discarded based on their effect size.

Direct association analysis (DAA)

Fortunately, the maximum entropy model provides a straightforward way to separate direct from indirect associations. Since direct effects are encoded in h, MWAS should be performed on h rather than on l. This simple change in the statistical analysis correctly recovered 4 out of 6 directly associated taxa in the synthetic data and yielded no indirect associations even for large cohorts (Fig. 2A and S5). Similarly good performance was found for the two other synthetic data sets (Fig. S7). For the IBD data, DAA also identified a much smaller number of associations compared to traditional MWAS analysis and showed clear saturation at large sample sizes (Fig. 2B). Direct associations with IBD are summarized in Fig. 3 at the genus and species levels, and the entire phylogenetic tree of direct associations is shown in Fig. S2 and Tabs. S3 and S4.

To demonstrate that DAA isolates direct effects from collective changes in the microbiota, we examined the p-value distribution in this method. The distribution of p-values is commonly used as a diagnostic tool to test whether a statistical method is appropriate for the data. In the absence of any associations, p-values must follow a uniform distribution because the null hypothesis is true [54]. A few strong deviations from the uniform distribution signal true associations [55]. In contrast, large departures from the uniform distribution typically indicate that the statistical method does not account for some properties of the data, for example, population stratification in the context of genome wide association studies [56, 57]. Figure 4A compares the distribution of p-values for DAA and a conventional method in MWAS. Consistent with our hypothesis that interspecific interactions cannot be neglected, conventional analysis generates an excess of low p-values and, as a result, a large number of potentially indirect associations. In contrast, the distribution of p-values from DAA matches the expected uniform distribution and, thus, provides strong support for our method.

Finally, we show that indirect association excluded by DAA do not reduce the predictive power of microbiome data. Supervised machine learning such as random forest [58, 59], support vector machine [60], and sparse logistic regression [61–63] were used to classify samples as cases or controls based on their microbiota profile. We found good and identical performance of the classifiers trained either on all taxa detected by conventional MWAS or on a much smaller subset of direct associations detected by DAA (Fig. 4B). Moreover, the DAA-based classifier showed significantly better performance compared to a classifier trained on an equal number of randomly-selected indirect associations (Fig. 4B). Thus, DAA reduces the number of associations without losing any information on the disease status and selects taxa with the greatest potential to distinguish health from disease.

Figure 2. Signatures of indirect associations in synthetic and IBD data sets.

The synthetic data set was generated to match the statistical properties of the IBD data set from Ref. [21], but with a predefined number of 6 directly associated taxa. (A) In synthetic data, DAA identifies no spurious association and detects 4 out of 6 directly associated genera. All 6 genera and no false positives are detected when the sample size is increased further (Fig. S5). In sharp contrast, a large number of spurious associations is observed for metrics that rely on changes in abundance between cases and controls and do not correct for microbial interactions. The number of false positives grows rapidly with statistical power until all taxa are reported as significantly associated with the disease (Fig. S5). (B) All spurious associations show substantial differences between cases and controls and, therefore, cannot be discarded based on their effect sizes. To quantify the effect size, we estimated the magnitude of the fold change for each genus. Specifically, we first computed the difference in the mean log abundance between cases and controls and then exponentiated the absolute value of this difference. The plot shows how the median effect size for significantly associated genera depends on the sample size. Larger samples sizes result in much higher number of associations, but only a small drop in the typical effect size. (C) and (D) are the same as (A) and (B), but for the IBD data set. The results are consistent between the two data sets suggesting that most associations detected by traditional MWAS are spurious. The complete list of indirect associations inferred from the IBD data set is shown in Tab. S5 and the results for different synthetic data sets are shown in Fig. S7.

Figure 3. Network of direct associations with Crohn’s Disease.

Five species and four genera were found to be significantly associated with Crohn’s Disease (q < 0.05) after correcting for microbial interactions. The links correspond to significant interactions (q < 0.05) between the taxa with J_ij > 0.27 or J_ij < -0.15; the width of the arrows reflects the strength of the interactions. Note that DAA controls for the fact that two species could be correlated because both interact with a third species, but not with each other (Fig. 1A). Thus, the network shows only direct interactions between the taxa. Compared to the correlation matrix in Fig. 1B, the interaction network has both mutualistic and inhibitory links, which suggests that the microbial community might have several stable states corresponding to distinct modes of dysbiosis [30, 49–53]. For comparison, the correlation-based network for directly associated taxa is shown in Fig. S3. A complete summary of correlations and interactions for all species pairs is provided in Tab. S6.

Discussion

The primary goal of MWAS is to guide the study of disease etiology by detecting microbes that have a direct effect on the host. These direct effects could be very diverse and include secretion of toxins, production of nutrients, stimulation of the immune system, and changes in mucus and bile [64, 65]. In addition to the host-microbe interactions, the composition of microbiota is also influenced by the interspecific interactions among the microbes such as competition for resources, cross-feeding, and production of antibiotics. In the context of MWAS, microbial interactions contribute to indirect changes in microbial abundances, which are less informative of the disease mechanism and are less likely to be valuable for follow-up studies or in interventions. Here, we estimated the relative contribution of indirect associations to MWAS and showed how to isolate direct from indirect assoctiations.

Our main result is that interspecific interactions are sufficiently strong to generate detectable changes in the abundance of many microbes that are not directly linked to host phenotype. As a result, conventional approaches to MWAS detect a large number of spurious associations and produce inflated p-values that do not match their expected distribution (Fig. 4A). These challenges are resolved by Direct Association Analysis (DAA), which uses maximum entropy models to explicitly account for interspecific interactions. We applied DAA to a large data set of pediatric Crohn’s disease and found that it restores the distribution of p-values and substantially simplifies the pattern of dysbiosis while retaining full classification power of a conventional MWAS.

Figure 4. Direct associations analysis corrects p-value inflation and retains diagnostic accuracy.

A) The distribution of p-values in DAA closely follows the expected uniform distribution. Without correcting for microbial interactions, the same analysis yields an excess of low p-values, a signature of indirect associations. For both methods, p-values were computed using a permutation test. The expected uniform distribution was obtained by sampling from a generator of random numbers. The ranked plots of p-values visualize their cumulative distribution functions; this is a variant of a Q-Q plot. (B) Direct associations are a small subset of all associations with IBD, yet they retain full power in classifying samples as cases or controls. In contrast, the classification power is substantially reduced for an equally-sized subset of randomly-chosen indirect associations. In each case, we used sparse logistic regression to train a classifier on 80% of the data and tested its performance on the remaining 20% (Methods). The shaded regions show one standard deviation obtained by repeated partitioning the data into the training and validation sets. Identical results were obtained with a random forest [58, 59] and support vector machine [60] classifiers (Fig. S4).

The relatively simple dysbiosis identified by DAA in IBD has strong support in the literature and offers interesting insights into disease etiology. Four of the taxa identified by our method have a well-established role in IBD: B. adolescentis, F. prausnitzii, B. producta, and Roseburia. They have been repeatedly found to have lower abundance in both Crohn’s disease and ulcerative colitis [66–73], and several studies have demonstrated their ability to suppress inflammation and alleviate colitis [69, 74–78]. Bifidobacterium species occupy a low trophic level in the gut and ferment complex polysaccharides such as fiber [79, 80]. Fermentation products include lactic acid, which promotes barrier function, and maintains a healthy, slightly acidic environment in the colon [81]. Due to these properties Bifidobacterium species are commonly used as probiotics [79]. F. prausnitzii, Blautia producta and Roseburia occupy a higher trophic level and ferment the byproducts of polysaccharide digestion into short-chain fatty acids (SCFA), which are an important energy source for the host [68,69, 82, 83].

The ability of DAA to detect taxa strongly associated with IBD is reassuring, but not surprising. What is surprising is that many strong associations are classified as indirect by our method. For example, Roseburia and Blautia are the only genera of Lachnospiraceae that DAA finds to be directly linked to the disease. In sharp contrast, traditional MWAS report seven genera in this family that are strongly associated with IBD [25]. All seven genera are involved in SCFA metabolism, but their specializations differ. Species in Blautia genus are major producers of acetate, a SCFA that is commonly involved in microbial crossfeeding [84, 85]. In particular, many species extract energy from acetate by converting it into butyrate, another SCFA that plays a major role in gut health by nourishing colonocytes and regulating the immune function [82, 85]. Roseburia genus specializes almost exclusively in the production of butyrate and acts as a major source of butyrate for the host [82, 86]. Thus, our findings suggest that butyrate production plays an important role in IBD etiology and that the dysregulation of this process is directly linked to the depletion of Roseburia and possibly Blautia.

The important role of butyrate is further supported by our detection of E. dolichum and Oscillospira, which are known to produce butyrate [87–89]. The latter taxon has not been detected in three independent analyses of this IBD data set [21, 25, 90] presumably because its involvement was masked by indirect associations and interactions with other microbes. Indeed, several other studies found that Oscillospira is suppressed in IBD [91],[92]. Oscillospira was also found to be positively associated with leanness and negatively associated with the inflammatory liver disease [93–95]. The interactions between Oscillospira and the host appears to be quite complex and involve the consumption of host-derived glycoproteins including mucin, production of SCFA, and modulation of bile-acid metabolism [89, 96, 97]. The latter interaction was suggested to be a major factor in the protective role of Oscillospira against infections with Clostridium difficile [96, 98, 99].

The final taxon that was suppressed in IBD is Turicibacter. This genus is not very well characterized, and few MWAS studies point to its involvement in IBD [21, 25, 100]. Two studies in animal models, however, directly looked into the connection between IBD and Turicibacter [101, 102]. The first study found that iron limitation eliminates colitis in mice while at the same time restoring the abundance of Turicibacter, Bifidobacterium, and four other genera [101]. The second study study identified Turicibacter as the only genus that is fully correlated with immunological differences between mice resistant and susceptible to colitis: high abundance of Turicibacter in the colon predicted high levels of MZ B and iNK T cells, which are potent regulators of the immune response [102]. Moreover, Turicibacter was the only genus positively affected by the reduction in CD8⁺ T cells. Thus, our method identified a taxon that is potentially directly linked to IBD via the modulation of the immune system.

Perhaps the most unexpected finding was our detection of Aggregatibacter and Sutterella as the only genera increased in disease compared to 26 positive associations detected by the previous analysis [25]. All other associations were classified as indirect even though they often corresponded to much more significant changes in abundance between IBD and control groups. Thus, our results indicate that expansion of many taxa including opportunistic pathogens is driven by their interactions with the core IBD network shown in Fig. 3. One possibility is that the dysbiosis of the symbiotic microbiota makes it less competitive against other bacteria and opens up niches that can be colonized by opportunistic pathogens. The other, less explored possibility, is that commensal microbiota can not only protect from pathogens, but also facilitate their invasion, a phenomenon that has been recently demonstrated in bees [103].

Little is known about the specific roles that Aggregatibacter and Sutterella play in IBD, and more generally in gut health. Aggregatibacter is a common member of the oral microbiota that thrives in local infections such as periodontal disease and bacterial vaginosis [104–106]. The high abundance of Aggregatibacter is also associated with an increased risk of IBD recurrence [107]. Sutterella, on the other hand lacks overt pathogenicity, and MWAS produced inconsistent findings [108–114] on its involvement in IBD. Some studies reported that Sutterella is increased in patients with good outcomes [21, 111] while other studies found positive or no association between Sutterella and IBD [25, 109, 112–114]. Experimental investigations showed that Sutterella lacks many pathogenic properties; in particular, it does not induce a strong immune-response and has only moderate ability to adhere to mucus [113, 114]. Further, Sutterella strains from IBD and control patients showed no phenotypic differences in metabolomic, proteomic, and immune response assays [114]. Nevertheless, Sutterella is strongly associated with worse behavioral scores in children with autism spectrum disorder and Down syndrome [19, 20, 115]. Therefore, the direct link between Sutterella and IBD could involve the gut-brain axis.

In summary, we found a small number of taxa can explain extensive dysbiosis in IBD and accurately predict disease status. Directly associated taxa include strains with dramatically different abilities to trigger colitis and are specifically targeted by the immune system of patients and animals with IBD [12]. Previous studies of these taxa point to facilitated colonization by pathogens, butyrate production, immunomodulation, bile metabolism, and the gut-brain axis as the primary factors in the etiology of IBD.

Many disorders are accompanied by substantial changes in host microbiota, but our work shows that only a small subset of these changes could be directly related to the disease. Similarly, only a handful of taxa could drive the dynamics of ecosystem-level changes in the environment. To untangle the complexity of such dysbioses, it is important to account for microbial interactions using mechanistic or statistical methods. Direct association analysis is a simple statistical approach based on the principle of maximum entropy. It can be applied to any microbiome data set that is sufficiently large to infer interspecific interactions.

Methods

The data used in this study was obtained from Ref. [21], which reported changes in the microbiome of newly-diagnosed, treatment-naive children with IBD compared to controls. This data was recently analyzed in Ref. [25], and we followed all the statistical procedures adopted in that study to enable direct comparison of the results. Specifically, we used a permutation test on mean log-transformed abundances to determine the statistical significance of an association.

All computation was carried out in Python environment. We used scikit-learn 0.15.2 [116] for hierarchical clustering and to build the supervised classifiers used in Fig. 4B of the main text and Fig. S3. The variance in the accuracy of classification was evaluated through 5-fold stratified cross-validation with 100 random partitions of the data into the training and validation sets. For all findings, statistical significance was evaluated with Fisher’s exact test (permutation test) with 10⁶ permutations. False discovery rate was controlled to be below 5% following Benjamini-Hochberg method [54].

To fit the maximum entropy model to the data, we first computed the mean log-abundance for each genus m_i and the covariance in the log-transformed abundances C_ij. The interaction matrix was computed as J = C^-1 by performing singular value decomposition [117] and removing all singular values that were comparable to the amount of noise present in the data. The host effects were computed as h = J m. See Supplementary Methods for further details.

Acknowledgements

This work was supported by a grant from the Simons Foundation (#409704, Kirill S. Korolev) and by the startup fund from Boston University to Kirill S. Korolev. Vivek Ramanan was also supported by NSF grant DBI-1559829 through BRITE Bioinformatics REU Program at Boston University. Rajita Menon was partly supported by the Graduate Fellowship from the Rafik B. Hariri Institute for Computing and Computational Sciences & Engineering. Simulations were carried out on Shared Computing Cluster at Boston University.

References

[1].↵
Turnbaugh, P. J. et al. The human microbiome project: exploring the microbial part of ourselves in a changing world. Nature 449, 804 (2007).
OpenUrl CrossRef PubMed Web of Science
[2].
Yatsunenko, T. et al. Human gut microbiome viewed across age and geography. Nature 486, 222–227 (2012).
OpenUrl CrossRef PubMed Web of Science
[3].
Cho, I. & Blaser, M. J. The human microbiome: at the interface of health and disease. Nature Reviews Genetics 13, 260–270 (2012).
OpenUrl CrossRef PubMed
[4].
Ding, T. & Schloss, P. D. Dynamics and associations of microbial community types across the human body. Nature 509, 357–360 (2014).
OpenUrl CrossRef PubMed Web of Science
[5].
Gilbert, J. A., Jansson, J. K. & Knight, R. The earth microbiome project: successes and aspirations. BMC biology 12, 69 (2014).
OpenUrl
[6].
Bakken, J. S. et al. Treating clostridium difficile infection with fecal microbiota transplantation. Clinical Gastroenterology and Hepatology 9, 1044–1049 (2011).
OpenUrl
[7].
Suez, J. et al. Artificial sweeteners induce glucose intolerance by altering the gut microbiota. Nature 514, 181–186 (2014).
OpenUrl CrossRef PubMed Web of Science
[8].↵
Jumpertz, R. et al. Energy-balance studies reveal associations between gut microbes, caloric load, and nutrient absorption in humans. The American journal of clinical nutrition 94, 58–65 (2011).
OpenUrl Abstract/FREE Full Text
[9].↵
Turnbaugh, P. J. et al. An obesity-associated gut microbiome with increased capacity for energy harvest. nature 444, 1027–131 (2006).
OpenUrl CrossRef PubMed Web of Science
[10].
Messaoudi, M. et al. Assessment of psychotropic-like properties of a probiotic formulation (lactobacillus helveticus r0052 and bifidobacterium longum r0175) in rats and human subjects. British Journal of Nutrition 105, 755–764 (2011).
OpenUrl CrossRef PubMed
[11].
Cryan, J. F. & OMahony, S. The microbiome-gut-brain axis: from bowel to behavior. Neurogastroenterology & Motility 23, 187–192 (2011).
OpenUrl CrossRef PubMed Web of Science
[12].↵
Palm, N. W. et al. Immunoglobulin a coating identifies colitogenic bacteria in inflammatory bowel disease. Cell 158, 1000–1010 (2014).
OpenUrl CrossRef PubMed Web of Science
[13].↵
Sampson, T. R. et al. Gut microbiota regulate motor deficits and neuroinflammation in a model of parkinsons disease. Cell 167, 1469–1480 (2016).
OpenUrl CrossRef PubMed
[14].↵
Qin, J. et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature 490, 55–60 (2012).
OpenUrl CrossRef PubMed Web of Science
[15].
Kostic, A. D. et al. The dynamics of the human infant gut microbiome in development and in progression toward type 1 diabetes. Cell host & microbe 17, 260–273 (2015).
OpenUrl CrossRef PubMed
[16].
Giongo, A. et al. Toward defining the autoimmune microbiome for type 1 diabetes. The ISME journal 5, 82–91 (2011).
OpenUrl
[17].
Brusca, S. B., Abramson, S. B. & Scher, J. U. Microbiome and mucosal inflammation as extra-articular triggers for rheumatoid arthritis and autoimmunity. Current opinion in rheumatology 26, 101 (2014).
OpenUrl CrossRef PubMed
[18].
Taneja, V. Arthritis susceptibility and the gut microbiome. FEBS letters 588, 4244–4249 (2014).
OpenUrl CrossRef PubMed
[19].↵
Williams, B. L., Hornig, M., Parekh, T. & Lipkin, W. I. Application of novel pcr-based methods for detection, quantitation, and phylogenetic characterization of sutterella species in intestinal biopsy samples from children with autism and gastrointestinal disturbances. MBio 3, e00261–11 (2012).
OpenUrl PubMed
[20].↵
Wang, L. et al. Increased abundance of sutterella spp. and ruminococcus torques in feces of children with autism spectrum disorder. Molecular autism 4, 1 (2013).
OpenUrl
[21].↵
Gevers, D. et al. The treatment-naive microbiome in new-onset crohns disease. Cell Host 'I' Microbe 15, 382–392 (2014).
OpenUrl
[22].
El Mouzan, M. et al. Fungal microbiota profile in newly diagnosed treatment-näive children with crohns disease. Journal of Crohn's and Colitis 1–7 (2017).
[23].↵
Gilbert, J. A. et al. Microbiome-wide association studies link dynamic microbial consortia to disease. Nature 535, 94–103 (2016).
OpenUrl CrossRef
[24].
Son, J. S. et al. Comparison of fecal microbiota in children with autism spectrum disorders and neurotypical siblings in the simons simplex collection. PloS ONE 10, e0137725 (2015).
OpenUrl CrossRef
[25].↵
Wang, F. et al. Detecting microbial dysbiosis associated with pediatric crohn disease despite the high variability of the gut microbiota. Cell Reports 14, 945–955 (2016).
OpenUrl
[26].↵
De Cruz, P. et al. Characterization of the gastrointestinal microbiota in health and inflammatory bowel disease. Infiammatory bowel diseases 18, 372–390 (2012).
OpenUrl
[27].↵
Coyte, K. Z., Schluter, J. & Foster, K. R. The ecology of the microbiome: networks, competition, and stability. Science 350, 663–666 (2015).
OpenUrl Abstract/FREE Full Text
[28].
Rakoff-Nahoum, S., Foster, K. R. & Comstock, L. E. The evolution of cooperation within the gut microbiota. Nature 533, 255–259 (2016).
OpenUrl CrossRef PubMed
[29].
Flint, H. J., Duncan, S. H., Scott, K. P. & Louis, P. Interactions and competition within the microbial community of the human colon: links between diet and health. Environmental microbiology 9, 1101–1111 (2007).
OpenUrl CrossRef PubMed Web of Science
[30].↵
Bashan, A. et al. Universality of human microbial dynamics. Nature 534, 259–262 (2016).
OpenUrl CrossRef
[31].
Faust, K. et al. Microbial co-occurrence relationships in the human microbiome. PLoS Comput Biol 8, e1002606 (2012).
OpenUrl CrossRef PubMed
[32].
Magnúsdóttir, S. et al. Generation of genome-scale metabolic reconstructions for 773 members of the human gut microbiota. Nature Biotechnology 35, 81–89 (2017).
OpenUrl CrossRef
[33].
Chu, J. et al. Discovery of mrsa active antibiotics using primary sequence from the human microbiome. Nature Chemical Biology 12, 1004–1006 (2016).
OpenUrl
[34].
Riley, M. A., Goldstone, C., Wertz, J. & Gordon, D. A phylogenetic approach to assessing the targets of microbial warfare. Journal of evolutionary biology 16, 690–697 (2003).
OpenUrl CrossRef PubMed
[35].
Czárán, T. L., Hoekstra, R. F. & Pagie, L. Chemical warfare between microbes promotes biodiversity. Proceedings of the National Academy of Sciences 99, 786–790 (2002).
OpenUrl Abstract/FREE Full Text
[36].
Dethlefsen, L., Eckburg, P. B., Bik, E. M. & Relman, D. A. Assembly of the human intestinal microbiota. Trends in ecology & evolution 21, 517–523 (2006).
OpenUrl CrossRef PubMed
[37].↵
Mackie, R. I. Gut environment and evolution of mutualistic fermentative digestion. In Gastrointestinal microbiology, 13–35 (Springer, 1997).
[38].↵
Stein, R. R., Marks, D. S. & Sander, C. Inferring pairwise interactions from biological data using maximum-entropy probability models. PLoS Comput Biol 11, e1004182 (2015).
OpenUrl
[39].↵
Plischke, M. & Bergersen, B. Equilibrium statistical physics (World Scientific Publishing Co Inc, 1994).
[40].↵
Schneidman, E., Berry, M. J., Segev, R. & Bialek, W. Weak pairwise correlations imply strongly correlated network states in a neural population. Nature 440, 1007–1012 (2006).
OpenUrl CrossRef PubMed Web of Science
[41].↵
Volkov, I., Banavar, J. R., Hubbell, S. P. & Maritan, A. Inferring species interactions in tropical forests. Proceedings of the National Academy of Sciences 106, 13854–13859 (2009).
OpenUrl Abstract/FREE Full Text
[42].↵
Mora, T. et al. Local equilibrium in bird flocks. Nature Physics 12, 1153–1157 (2016).
OpenUrl
[43].↵
Morcos, F. et al. Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proceedings of the National Academy of Sciences 108, E1293–E1301 (2011).
OpenUrl Abstract/FREE Full Text
[44].↵
Dahirel, V. et al. Coordinate linkage of hiv evolution reveals regions of immunological vulnerability. Proceedings of the National Academy of Sciences 108, 11530–11535 (2011)
OpenUrl Abstract/FREE Full Text
[45].↵
Fisher, C. K., Mora, T. & Walczak, A. M. Variable habitat conditions drive species covariation in the human microbiota. PLOS Computational Biology 13, e1005435 (2017).
OpenUrl
[46].↵
Friedman, J. & Alm, E. J. Inferring correlation networks from genomic survey data. PLoS computational biology 8, e1002687 (2012).
OpenUrl
[47].
Aitchison, J. The statistical analysis of compositional data (1986).
[48].↵
Pawlowsky-Glahn, V. & Buccianti, A. Compositional data analysis: Theory and applications (John Wiley & Sons, 2011).
[49].↵
Arumugam, M. et al. Enterotypes of the human gut microbiome. nature 473, 174–180 (2011).
OpenUrl CrossRef PubMed Web of Science
[50].
Weiser, M. et al. Molecular classi?cation of crohn's disease reveals two clinically relevant subtypes. Gut gutjnl–2016 (2016).
[51].
Cleynen, I. et al. Inherited determinants of crohn's disease and ulcerative colitis phenotypes: a genetic association study. The Lancet 387, 156–167 (2016).
OpenUrl
[52].
Jeraldo, P. et al. Quantification of the relative roles of niche and neutral processes in structuring gastrointestinal microbiomes. Proceedings of the National Academy of Sciences 109, 9692–9698 (2012).
OpenUrl Abstract/FREE Full Text
[53].↵
Pepper, J. W. & Rosenfeld, S. The emerging medical ecology of the human gut microbiome. Trends in ecology & evolution 27, 381–384 (2012).
OpenUrl
[54].↵
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the royal statistical society. Series B (Methodological) 289–300 (1995).
[55].↵
Storey, J. D. & Tibshirani, R. Statistical significance for genomewide studies. Proceedings of the National Academy of Sciences 100, 9440–9445 (2003).
OpenUrl Abstract/FREE Full Text
[56].↵
Power, R. A., Parkhill, J. & de Oliveira, T. Microbial genome-wide association studies: lessons from human gwas. Nature Reviews Genetics (2016).
[57].↵
Voorman, A., Lumley, T., McKnight, B. & Rice, K. Behavior of qq-plots and genomic control in studies of gene-environment interaction. PloS one 6, e19416 (2011).
OpenUrl CrossRef PubMed
[58].↵
Ho, T. K. Random decision forests. In Document Analysis and Recognition, 1995., Proceedings of the Third vol. 1, 278–282 (IEEE, 1995).
OpenUrl
[59].↵
Breiman, L. Random forests. Machine learning 45, 5–32 (2001).
OpenUrl CrossRef Web of Science
[60].↵
Cortes, C. & Vapnik, V. Support-vector networks. Machine learning 20, 273–297 (1995).
OpenUrl CrossRef Web of Science
[61].↵
Walker, S. H. & Duncan, D. B. Estimation of the probability of an event as a function of several independent variables. Biometrika 54, 167–179 (1967).
OpenUrl CrossRef PubMed Web of Science
[62].
Cox, D. R. The regression analysis of binary sequences. Journal of the Royal Statistical Society. Series B (Methodological) 215–242 (1958).
[63].↵
Tibshirani, R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) 267–288 (1996).
[64].↵
Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124 (2012).
OpenUrl CrossRef PubMed Web of Science
[65].↵
Xavier, R. & Podolsky, D. Unravelling the pathogenesis of inflammatory bowel disease. Nature 448, 427–434 (2007).
OpenUrl CrossRef PubMed Web of Science
[66].↵
Morgan, X. C. et al. Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment. Genome biology 13, 1 (2012).
OpenUrl CrossRef
[67].
Machiels, K. et al. A decrease of the butyrate-producing species roseburia hominis and faecalibacterium prausnitzii defines dysbiosis in patients with ulcerative colitis. Gut gutjnl–2013 (2013).
[68].↵
Morgan, X. C. et al. Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment. Genome biology 13, 1 (2012).
OpenUrl CrossRef
[69].↵
Travis, A. J., Kelly, D., Flint, H. J. & Aminov, R. I. Complete genome sequence of the human gut symbiont roseburia hominis. Genome announcements 3, e01286–15 (2015).
OpenUrl
[70].
Forbes, J. D., Van Domselaar, G. & Bernstein, C. N. The gut microbiota in immune-mediated inflammatory diseases. Frontiers in Microbiology 7, 1081 (2016).
OpenUrl
[71].
Joossens, M. et al. Dysbiosis of the faecal microbiota in patients with crohn's disease and their unaffected relatives. Gut 60, 631–637 (2011).
OpenUrl Abstract/FREE Full Text
[72].
Sokol, H. et al. Low counts of faecalibacterium prausnitzii in colitis microbiota. inflammatory bowel diseases 15, 1183–1189 (2009).
OpenUrl CrossRef PubMed Web of Science
[73].↵
Takahashi, K. et al. Reduced abundance of butyrate-producing bacteria species in the fecal microbial community in crohn's disease. Digestion 93, 59–65 (2016).
OpenUrl CrossRef
[74].↵
Sokol, H. et al. Faecalibacterium prausnitzii is an anti-inflammatory commensal bacterium identified by gut microbiota analysis of crohn disease patients. Proceedings of the National Academy of Sciences 105, 16731–16736 (2008).
OpenUrl Abstract/FREE Full Text
[75].
Zhang, M. et al. Faecalibacterium prausnitzii inhibits interleukin-17 to ameliorate colorectal colitis in rats. PloS one 9, e109146 (2014).
OpenUrl CrossRef PubMed
[76].
Qiu, X., Zhang, M., Yang, X., Hong, N. & Yu, C. Faecalibacterium prausnitzii upregulates regulatory t cells and anti-inflammatory cytokines in treating tnbs-induced colitis. Journal of Crohn's and Colitis 7, e558–e568 (2013).
OpenUrl
[77].
Forbes, J. D., Van Domselaar, G. & Bernstein, C. N. The gut microbiota in immune-mediated inflammatory diseases. Frontiers in Microbiology 7 (2016).
[78].↵
Scharek, L., Hartmann, L., Heinevetter, L. & Blaut, M. Bifidobacterium adolescentis modulates the specific immune response to another human gut bacterium, bacteroides thetaiotaomicron, in gnotobiotic rats. Immunobiology 202, 429–441 (2000).
OpenUrl PubMed
[79].↵
Oyetayo, V. O. & Oyetayo, F. L. Review-potential of probiotics as biotherapeutic agents targeting the innate immune system. African Journal of Biotechnology 4, 123–127 (2005).
OpenUrl
[80].↵
Duranti, S. et al. Evaluation of genetic diversity among strains of the human gut commensal bifidobacterium adolescentis. Scientific reports 6 (2016).
[81].↵
Sonomoto, K. & Yokota, A. Lactic acid bacteria and bifidobacteria: current progress in advanced research (Horizon Scientific Press, 2011).
[82].↵
Louis, P. & Flint, H. J. Formation of propionate and butyrate by the human colonic micro-biota. Environmental Microbiology 19, 29–41 (2016).
OpenUrl
[83].↵
Jeraldo, P. et al. Capturing one of the human gut microbiomes most wanted: Reconstructing the genome of a novel butyrate-producing, clostridial scavenger from metagenomic sequence data. Frontiers in Microbiology 7 (2016).
[84].↵
Carbonero, F., Benefiel, A. C. & Gaskins, H. R. Contributions of the microbial hydrogen economy to colonic homeostasis. Nature Reviews Gastroenterology and Hepatology 9, 504–518 (2012).
OpenUrl
[85].↵
Louis, P., Hold, G. L. & Flint, H. J. The gut microbiota, bacterial metabolites and colorectal cancer. Nature reviews. Microbiology 12, 661 (2014).
OpenUrl
[86].↵
Kettle, H., Louis, P., Holtrop, G., Duncan, S. H. & Flint, H. J. Modelling the emergent dynamics and major metabolites of the human colonic microbiota. Environmental microbiology 17, 1615–1630 (2015).
OpenUrl CrossRef
[87].↵
Eeckhaut, V. et al. Butyrate production in phylogenetically diverse firmicutes isolated from the chicken caecum. Microbial biotechnology 4, 503–512 (2011).
OpenUrl
[88].
Louis, P. & Flint, H. J. Diversity, metabolism and microbial ecology of butyrate-producing bacteria from the human large intestine. FEMS microbiology letters 294, 1–8 (2009).
OpenUrl CrossRef PubMed Web of Science
[89].↵
Gophna, U., Konikoff, T. & Nielsen, H. B. Oscillospira and related bacteria–from metagenomic species to metabolic features. Environmental microbiology 19, 835–841 (2017).
OpenUrl CrossRef
[90].↵
Haberman, Y. et al. Pediatric crohn disease patients exhibit specific ileal transcriptome and microbiome signature. The Journal of clinical investigation 124, 3617 (2014).
OpenUrl CrossRef PubMed
[91].↵
Kaakoush, N. O. et al. Microbial dysbiosis in pediatric patients with crohn's disease. Journal of clinical microbiology 50, 3258–3266 (2012).
OpenUrl Abstract/FREE Full Text
[92].↵
Walters, W. A., Xu, Z. & Knight, R. Meta-analyses of human gut microbes associated with obesity and ibd. FEBS letters 588, 4223–4233 (2014).
OpenUrl CrossRef PubMed Web of Science
[93].↵
Verdam, F. J. et al. Human intestinal microbiota composition is associated with local and systemic infiammation in obesity. Obesity 21 (2013).
[94].
Tims, S. et al. Microbiota conservation and bmi signatures in adult monozygotic twins. The ISME journal 7, 707 (2013).
OpenUrl
[95].↵
Zhu, L. et al. Characterization of gut microbiomes in nonalcoholic steatohepatitis (nash) patients: a connection between endogenous alcohol and nash. Hepatology 57, 601–609 (2013).
OpenUrl CrossRef PubMed Web of Science
[96].↵
Keren, N. et al. Interactions between the intestinal microbiota and bile acids in gallstones patients. Environmental microbiology reports 7, 874–880 (2015).
OpenUrl
[97].↵
Kohl, K. D., Amaya, J., Passement, C. A., Dearing, M. D. & McCue, M. D. Unique and shared responses of the gut microbiota to prolonged fasting: a comparative study across five classes of vertebrate hosts. FEMS microbiology ecology 90, 883–894 (2014).
OpenUrl CrossRef PubMed
[98].↵
Milani, C. et al. Gut microbiota composition and clostridium difficile infection in hospitalized elderly individuals: a metagenomic study. Scientific reports 6 (2016).
[99].↵
Gu, S. et al. Identification of key taxa that favor intestinal colonization of clostridium difficile in an adult chinese population. Microbes and infection 18, 30–38 (2016).
OpenUrl CrossRef
[100].↵
Minamoto, Y. et al. Alteration of the fecal microbiota and serum metabolite profiles in dogs with idiopathic inflammatory bowel disease. Gut microbes 6, 33–47 (2015).
OpenUrl CrossRef PubMed
[101].↵
Werner, T. et al. Depletion of luminal iron alters the gut microbiota and prevents crohn's disease-like ileitis. Gut gut–2010 (2010).
[102].↵
Presley, L. L., Wei, B., Braun, J. & Borneman, J. Bacteria associated with immunoregulatory cells in mice. Applied and environmental microbiology 76, 936–941 (2010).
OpenUrl Abstract/FREE Full Text
[103].↵
Schwarz, R. S., Moran, N. A. & Evans, J. D. Early gut colonizers shape parasite susceptibility and microbiota composition in honey bee workers. Proceedings of the National Academy of Sciences 113, 9345–9350 (2016).
OpenUrl Abstract/FREE Full Text
[104].↵
Raja, M. & Fajar Ummer, C. Aggregatibacter actinomycetemcomitans–a tooth killer? Journal of clinical and diagnostic research: JCDR 8, ZE13 (2014).
OpenUrl
[105].
Kamma, J., Nakou, M. & Manti, F. Predominant microflora of severe, moderate and minimal periodontal lesions in young adults with rapidly progressive periodontitis. Journal of periodontal research 30, 66–72 (1995).
OpenUrl CrossRef PubMed Web of Science
[106].↵
Cassini, M. et al. Periodontal bacteria in the genital tract: are they related to adverse pregnancy outcome? International journal of immunopathology and pharmacology 26, 931–939 (2013).
[107].↵
Sokol, H. et al. Fungal microbiota dysbiosis in ibd. Gut gutjnl–2015 (2016).
[108].↵
Lavelle, A. et al. Spatial variation of the colonic microbiota in patients with ulcerative colitis and control volunteers. Gut gutjnl–2014 (2015).
[109].↵
Mangin, I. et al. Molecular inventory of faecal microflora in patients with crohn's disease. FEMS microbiology ecology 50, 25–36 (2004).
OpenUrl CrossRef PubMed Web of Science
[110].
Gophna, U., Sommerfeld, K., Gophna, S., Doolittle, W. F. & van Zanten, S. J. V. Differences between tissue-associated intestinal microfloras of patients with crohn's disease and ulcerative colitis. Journal of clinical microbiology 44, 4136–4141 (2006).
OpenUrl Abstract/FREE Full Text
[111].↵
Tyler, A. D. et al. Characterization of the gut-associated microbiome in inflammatory pouch complications following ileal pouch-anal anastomosis. PloS one 8, e66934 (2013).
OpenUrl CrossRef PubMed
[112].↵
Hansen, R. et al. The microaerophilic microbiota of de-novo paediatric inflammatory bowel disease: the biscuit study. PLoS One 8, e58825 (2013).
OpenUrl CrossRef PubMed
[113].↵
Hiippala, K., Kainulainen, V., Kalliomäki, M., Arkkila, P. & Satokari, R. Mucosal prevalence and interactions with the epithelium indicate commensalism of sutterella spp. Frontiers in microbiology 7 (2016).
[114].↵
Mukhopadhya, I. et al. A comprehensive evaluation of colonic mucosal isolates of sutterella wadsworthensis from inflammatory bowel disease. PLoS One 6, e27076 (2011).
OpenUrl CrossRef PubMed
[115].↵
Biagi, E. et al. Gut microbiome in down syndrome. PLoS one 9, e112023 (2014).
OpenUrl CrossRef PubMed
[116].↵
Pedregosa, F. et al. Scikit-learn: Machine learning in python. Journal of Machine Learning Research 12, 2825–2830 (2011).
OpenUrl Web of Science
[117].↵
Stewart, G. W. On the early history of the singular value decomposition. SIAM review 35, 551–566 (1993).
OpenUrl
[118].↵
DeSantis, T. Z. et al. Greengenes, a chimera-checked 16s rrna gene database and workbench compatible with arb. Applied and environmental microbiology 72, 5069–5072 (2006).
OpenUrl Abstract/FREE Full Text
[119].↵
Caporaso, J. G. et al. Qiime allows analysis of high-throughput community sequencing data. Nature methods 7, 335–336 (2010).
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted August 15, 2017.

Download PDF

Citation Tools

Subject Area

Microbiology

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11740)
Bioengineering (8750)
Bioinformatics (29189)
Biophysics (14967)
Cancer Biology (12093)
Cell Biology (17410)
Clinical Trials (138)
Developmental Biology (9420)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18301)
Genetics (12239)
Genomics (16797)
Immunology (11865)
Microbiology (28070)
Molecular Biology (11583)
Neuroscience (60953)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4957)
Plant Biology (10425)
Scientific Communication and Education (1683)
Synthetic Biology (2884)
Systems Biology (7338)
Zoology (1651)

[1] [1].↵
Turnbaugh, P. J. et al. The human microbiome project: exploring the microbial part of ourselves in a changing world. Nature 449, 804 (2007).
OpenUrl CrossRef PubMed Web of Science

[2] [2].
Yatsunenko, T. et al. Human gut microbiome viewed across age and geography. Nature 486, 222–227 (2012).
OpenUrl CrossRef PubMed Web of Science

[3] [3].
Cho, I. & Blaser, M. J. The human microbiome: at the interface of health and disease. Nature Reviews Genetics 13, 260–270 (2012).
OpenUrl CrossRef PubMed

[4] [4].
Ding, T. & Schloss, P. D. Dynamics and associations of microbial community types across the human body. Nature 509, 357–360 (2014).
OpenUrl CrossRef PubMed Web of Science

[5] [5].
Gilbert, J. A., Jansson, J. K. & Knight, R. The earth microbiome project: successes and aspirations. BMC biology 12, 69 (2014).
OpenUrl

[6] [6].
Bakken, J. S. et al. Treating clostridium difficile infection with fecal microbiota transplantation. Clinical Gastroenterology and Hepatology 9, 1044–1049 (2011).
OpenUrl

[7] [7].
Suez, J. et al. Artificial sweeteners induce glucose intolerance by altering the gut microbiota. Nature 514, 181–186 (2014).
OpenUrl CrossRef PubMed Web of Science

[8] [8].↵
Jumpertz, R. et al. Energy-balance studies reveal associations between gut microbes, caloric load, and nutrient absorption in humans. The American journal of clinical nutrition 94, 58–65 (2011).
OpenUrl Abstract/FREE Full Text

[9] [9].↵
Turnbaugh, P. J. et al. An obesity-associated gut microbiome with increased capacity for energy harvest. nature 444, 1027–131 (2006).
OpenUrl CrossRef PubMed Web of Science

[10] [10].
Messaoudi, M. et al. Assessment of psychotropic-like properties of a probiotic formulation (lactobacillus helveticus r0052 and bifidobacterium longum r0175) in rats and human subjects. British Journal of Nutrition 105, 755–764 (2011).
OpenUrl CrossRef PubMed

[11] [11].
Cryan, J. F. & OMahony, S. The microbiome-gut-brain axis: from bowel to behavior. Neurogastroenterology & Motility 23, 187–192 (2011).
OpenUrl CrossRef PubMed Web of Science

[12] [12].↵
Palm, N. W. et al. Immunoglobulin a coating identifies colitogenic bacteria in inflammatory bowel disease. Cell 158, 1000–1010 (2014).
OpenUrl CrossRef PubMed Web of Science

[13] [13].↵
Sampson, T. R. et al. Gut microbiota regulate motor deficits and neuroinflammation in a model of parkinsons disease. Cell 167, 1469–1480 (2016).
OpenUrl CrossRef PubMed

[14] [14].↵
Qin, J. et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature 490, 55–60 (2012).
OpenUrl CrossRef PubMed Web of Science

[15] [15].
Kostic, A. D. et al. The dynamics of the human infant gut microbiome in development and in progression toward type 1 diabetes. Cell host & microbe 17, 260–273 (2015).
OpenUrl CrossRef PubMed

[16] [16].
Giongo, A. et al. Toward defining the autoimmune microbiome for type 1 diabetes. The ISME journal 5, 82–91 (2011).
OpenUrl

[17] [17].
Brusca, S. B., Abramson, S. B. & Scher, J. U. Microbiome and mucosal inflammation as extra-articular triggers for rheumatoid arthritis and autoimmunity. Current opinion in rheumatology 26, 101 (2014).
OpenUrl CrossRef PubMed

[18] [18].
Taneja, V. Arthritis susceptibility and the gut microbiome. FEBS letters 588, 4244–4249 (2014).
OpenUrl CrossRef PubMed

[19] [19].↵
Williams, B. L., Hornig, M., Parekh, T. & Lipkin, W. I. Application of novel pcr-based methods for detection, quantitation, and phylogenetic characterization of sutterella species in intestinal biopsy samples from children with autism and gastrointestinal disturbances. MBio 3, e00261–11 (2012).
OpenUrl PubMed

[20] [20].↵
Wang, L. et al. Increased abundance of sutterella spp. and ruminococcus torques in feces of children with autism spectrum disorder. Molecular autism 4, 1 (2013).
OpenUrl

[21] [21].↵
Gevers, D. et al. The treatment-naive microbiome in new-onset crohns disease. Cell Host 'I' Microbe 15, 382–392 (2014).
OpenUrl

[22] [22].
El Mouzan, M. et al. Fungal microbiota profile in newly diagnosed treatment-näive children with crohns disease. Journal of Crohn's and Colitis 1–7 (2017).

[23] [23].↵
Gilbert, J. A. et al. Microbiome-wide association studies link dynamic microbial consortia to disease. Nature 535, 94–103 (2016).
OpenUrl CrossRef

[24] [24].
Son, J. S. et al. Comparison of fecal microbiota in children with autism spectrum disorders and neurotypical siblings in the simons simplex collection. PloS ONE 10, e0137725 (2015).
OpenUrl CrossRef

[25] [25].↵
Wang, F. et al. Detecting microbial dysbiosis associated with pediatric crohn disease despite the high variability of the gut microbiota. Cell Reports 14, 945–955 (2016).
OpenUrl

[26] [26].↵
De Cruz, P. et al. Characterization of the gastrointestinal microbiota in health and inflammatory bowel disease. Infiammatory bowel diseases 18, 372–390 (2012).
OpenUrl

[27] [27].↵
Coyte, K. Z., Schluter, J. & Foster, K. R. The ecology of the microbiome: networks, competition, and stability. Science 350, 663–666 (2015).
OpenUrl Abstract/FREE Full Text

[28] [28].
Rakoff-Nahoum, S., Foster, K. R. & Comstock, L. E. The evolution of cooperation within the gut microbiota. Nature 533, 255–259 (2016).
OpenUrl CrossRef PubMed

[29] [29].
Flint, H. J., Duncan, S. H., Scott, K. P. & Louis, P. Interactions and competition within the microbial community of the human colon: links between diet and health. Environmental microbiology 9, 1101–1111 (2007).
OpenUrl CrossRef PubMed Web of Science

[30] [30].↵
Bashan, A. et al. Universality of human microbial dynamics. Nature 534, 259–262 (2016).
OpenUrl CrossRef

[31] [31].
Faust, K. et al. Microbial co-occurrence relationships in the human microbiome. PLoS Comput Biol 8, e1002606 (2012).
OpenUrl CrossRef PubMed

[32] [32].
Magnúsdóttir, S. et al. Generation of genome-scale metabolic reconstructions for 773 members of the human gut microbiota. Nature Biotechnology 35, 81–89 (2017).
OpenUrl CrossRef

[33] [33].
Chu, J. et al. Discovery of mrsa active antibiotics using primary sequence from the human microbiome. Nature Chemical Biology 12, 1004–1006 (2016).
OpenUrl

[34] [34].
Riley, M. A., Goldstone, C., Wertz, J. & Gordon, D. A phylogenetic approach to assessing the targets of microbial warfare. Journal of evolutionary biology 16, 690–697 (2003).
OpenUrl CrossRef PubMed

[35] [35].
Czárán, T. L., Hoekstra, R. F. & Pagie, L. Chemical warfare between microbes promotes biodiversity. Proceedings of the National Academy of Sciences 99, 786–790 (2002).
OpenUrl Abstract/FREE Full Text

[36] [36].
Dethlefsen, L., Eckburg, P. B., Bik, E. M. & Relman, D. A. Assembly of the human intestinal microbiota. Trends in ecology & evolution 21, 517–523 (2006).
OpenUrl CrossRef PubMed

[37] [37].↵
Mackie, R. I. Gut environment and evolution of mutualistic fermentative digestion. In Gastrointestinal microbiology, 13–35 (Springer, 1997).

[38] [38].↵
Stein, R. R., Marks, D. S. & Sander, C. Inferring pairwise interactions from biological data using maximum-entropy probability models. PLoS Comput Biol 11, e1004182 (2015).
OpenUrl

[39] [39].↵
Plischke, M. & Bergersen, B. Equilibrium statistical physics (World Scientific Publishing Co Inc, 1994).

[40] [40].↵
Schneidman, E., Berry, M. J., Segev, R. & Bialek, W. Weak pairwise correlations imply strongly correlated network states in a neural population. Nature 440, 1007–1012 (2006).
OpenUrl CrossRef PubMed Web of Science

[41] [41].↵
Volkov, I., Banavar, J. R., Hubbell, S. P. & Maritan, A. Inferring species interactions in tropical forests. Proceedings of the National Academy of Sciences 106, 13854–13859 (2009).
OpenUrl Abstract/FREE Full Text

[42] [42].↵
Mora, T. et al. Local equilibrium in bird flocks. Nature Physics 12, 1153–1157 (2016).
OpenUrl

[43] [43].↵
Morcos, F. et al. Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proceedings of the National Academy of Sciences 108, E1293–E1301 (2011).
OpenUrl Abstract/FREE Full Text

[44] [44].↵
Dahirel, V. et al. Coordinate linkage of hiv evolution reveals regions of immunological vulnerability. Proceedings of the National Academy of Sciences 108, 11530–11535 (2011)
OpenUrl Abstract/FREE Full Text

[45] [45].↵
Fisher, C. K., Mora, T. & Walczak, A. M. Variable habitat conditions drive species covariation in the human microbiota. PLOS Computational Biology 13, e1005435 (2017).
OpenUrl

[46] [46].↵
Friedman, J. & Alm, E. J. Inferring correlation networks from genomic survey data. PLoS computational biology 8, e1002687 (2012).
OpenUrl

[47] [47].
Aitchison, J. The statistical analysis of compositional data (1986).

[48] [48].↵
Pawlowsky-Glahn, V. & Buccianti, A. Compositional data analysis: Theory and applications (John Wiley & Sons, 2011).

[49] [49].↵
Arumugam, M. et al. Enterotypes of the human gut microbiome. nature 473, 174–180 (2011).
OpenUrl CrossRef PubMed Web of Science

[50] [50].
Weiser, M. et al. Molecular classi?cation of crohn's disease reveals two clinically relevant subtypes. Gut gutjnl–2016 (2016).

[51] [51].
Cleynen, I. et al. Inherited determinants of crohn's disease and ulcerative colitis phenotypes: a genetic association study. The Lancet 387, 156–167 (2016).
OpenUrl

[52] [52].
Jeraldo, P. et al. Quantification of the relative roles of niche and neutral processes in structuring gastrointestinal microbiomes. Proceedings of the National Academy of Sciences 109, 9692–9698 (2012).
OpenUrl Abstract/FREE Full Text

[53] [53].↵
Pepper, J. W. & Rosenfeld, S. The emerging medical ecology of the human gut microbiome. Trends in ecology & evolution 27, 381–384 (2012).
OpenUrl

[54] [54].↵
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the royal statistical society. Series B (Methodological) 289–300 (1995).

[55] [55].↵
Storey, J. D. & Tibshirani, R. Statistical significance for genomewide studies. Proceedings of the National Academy of Sciences 100, 9440–9445 (2003).
OpenUrl Abstract/FREE Full Text

[56] [56].↵
Power, R. A., Parkhill, J. & de Oliveira, T. Microbial genome-wide association studies: lessons from human gwas. Nature Reviews Genetics (2016).

[57] [57].↵
Voorman, A., Lumley, T., McKnight, B. & Rice, K. Behavior of qq-plots and genomic control in studies of gene-environment interaction. PloS one 6, e19416 (2011).
OpenUrl CrossRef PubMed

[58] [58].↵
Ho, T. K. Random decision forests. In Document Analysis and Recognition, 1995., Proceedings of the Third vol. 1, 278–282 (IEEE, 1995).
OpenUrl

[59] [59].↵
Breiman, L. Random forests. Machine learning 45, 5–32 (2001).
OpenUrl CrossRef Web of Science

[60] [60].↵
Cortes, C. & Vapnik, V. Support-vector networks. Machine learning 20, 273–297 (1995).
OpenUrl CrossRef Web of Science

[61] [61].↵
Walker, S. H. & Duncan, D. B. Estimation of the probability of an event as a function of several independent variables. Biometrika 54, 167–179 (1967).
OpenUrl CrossRef PubMed Web of Science

[62] [62].
Cox, D. R. The regression analysis of binary sequences. Journal of the Royal Statistical Society. Series B (Methodological) 215–242 (1958).

[63] [63].↵
Tibshirani, R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) 267–288 (1996).

[64] [64].↵
Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124 (2012).
OpenUrl CrossRef PubMed Web of Science

[65] [65].↵
Xavier, R. & Podolsky, D. Unravelling the pathogenesis of inflammatory bowel disease. Nature 448, 427–434 (2007).
OpenUrl CrossRef PubMed Web of Science

[66] [66].↵
Morgan, X. C. et al. Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment. Genome biology 13, 1 (2012).
OpenUrl CrossRef

[67] [67].
Machiels, K. et al. A decrease of the butyrate-producing species roseburia hominis and faecalibacterium prausnitzii defines dysbiosis in patients with ulcerative colitis. Gut gutjnl–2013 (2013).

[68] [68].↵
Morgan, X. C. et al. Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment. Genome biology 13, 1 (2012).
OpenUrl CrossRef

[69] [69].↵
Travis, A. J., Kelly, D., Flint, H. J. & Aminov, R. I. Complete genome sequence of the human gut symbiont roseburia hominis. Genome announcements 3, e01286–15 (2015).
OpenUrl

[70] [70].
Forbes, J. D., Van Domselaar, G. & Bernstein, C. N. The gut microbiota in immune-mediated inflammatory diseases. Frontiers in Microbiology 7, 1081 (2016).
OpenUrl

[71] [71].
Joossens, M. et al. Dysbiosis of the faecal microbiota in patients with crohn's disease and their unaffected relatives. Gut 60, 631–637 (2011).
OpenUrl Abstract/FREE Full Text

[72] [72].
Sokol, H. et al. Low counts of faecalibacterium prausnitzii in colitis microbiota. inflammatory bowel diseases 15, 1183–1189 (2009).
OpenUrl CrossRef PubMed Web of Science

[73] [73].↵
Takahashi, K. et al. Reduced abundance of butyrate-producing bacteria species in the fecal microbial community in crohn's disease. Digestion 93, 59–65 (2016).
OpenUrl CrossRef

[74] [74].↵
Sokol, H. et al. Faecalibacterium prausnitzii is an anti-inflammatory commensal bacterium identified by gut microbiota analysis of crohn disease patients. Proceedings of the National Academy of Sciences 105, 16731–16736 (2008).
OpenUrl Abstract/FREE Full Text

[75] [75].
Zhang, M. et al. Faecalibacterium prausnitzii inhibits interleukin-17 to ameliorate colorectal colitis in rats. PloS one 9, e109146 (2014).
OpenUrl CrossRef PubMed

[76] [76].
Qiu, X., Zhang, M., Yang, X., Hong, N. & Yu, C. Faecalibacterium prausnitzii upregulates regulatory t cells and anti-inflammatory cytokines in treating tnbs-induced colitis. Journal of Crohn's and Colitis 7, e558–e568 (2013).
OpenUrl

[77] [77].
Forbes, J. D., Van Domselaar, G. & Bernstein, C. N. The gut microbiota in immune-mediated inflammatory diseases. Frontiers in Microbiology 7 (2016).

[78] [78].↵
Scharek, L., Hartmann, L., Heinevetter, L. & Blaut, M. Bifidobacterium adolescentis modulates the specific immune response to another human gut bacterium, bacteroides thetaiotaomicron, in gnotobiotic rats. Immunobiology 202, 429–441 (2000).
OpenUrl PubMed

[79] [79].↵
Oyetayo, V. O. & Oyetayo, F. L. Review-potential of probiotics as biotherapeutic agents targeting the innate immune system. African Journal of Biotechnology 4, 123–127 (2005).
OpenUrl

[80] [80].↵
Duranti, S. et al. Evaluation of genetic diversity among strains of the human gut commensal bifidobacterium adolescentis. Scientific reports 6 (2016).

[81] [81].↵
Sonomoto, K. & Yokota, A. Lactic acid bacteria and bifidobacteria: current progress in advanced research (Horizon Scientific Press, 2011).

[82] [82].↵
Louis, P. & Flint, H. J. Formation of propionate and butyrate by the human colonic micro-biota. Environmental Microbiology 19, 29–41 (2016).
OpenUrl

[83] [83].↵
Jeraldo, P. et al. Capturing one of the human gut microbiomes most wanted: Reconstructing the genome of a novel butyrate-producing, clostridial scavenger from metagenomic sequence data. Frontiers in Microbiology 7 (2016).

[84] [84].↵
Carbonero, F., Benefiel, A. C. & Gaskins, H. R. Contributions of the microbial hydrogen economy to colonic homeostasis. Nature Reviews Gastroenterology and Hepatology 9, 504–518 (2012).
OpenUrl

[85] [85].↵
Louis, P., Hold, G. L. & Flint, H. J. The gut microbiota, bacterial metabolites and colorectal cancer. Nature reviews. Microbiology 12, 661 (2014).
OpenUrl

[86] [86].↵
Kettle, H., Louis, P., Holtrop, G., Duncan, S. H. & Flint, H. J. Modelling the emergent dynamics and major metabolites of the human colonic microbiota. Environmental microbiology 17, 1615–1630 (2015).
OpenUrl CrossRef

[87] [87].↵
Eeckhaut, V. et al. Butyrate production in phylogenetically diverse firmicutes isolated from the chicken caecum. Microbial biotechnology 4, 503–512 (2011).
OpenUrl

[88] [88].
Louis, P. & Flint, H. J. Diversity, metabolism and microbial ecology of butyrate-producing bacteria from the human large intestine. FEMS microbiology letters 294, 1–8 (2009).
OpenUrl CrossRef PubMed Web of Science

[89] [89].↵
Gophna, U., Konikoff, T. & Nielsen, H. B. Oscillospira and related bacteria–from metagenomic species to metabolic features. Environmental microbiology 19, 835–841 (2017).
OpenUrl CrossRef

[90] [90].↵
Haberman, Y. et al. Pediatric crohn disease patients exhibit specific ileal transcriptome and microbiome signature. The Journal of clinical investigation 124, 3617 (2014).
OpenUrl CrossRef PubMed

[91] [91].↵
Kaakoush, N. O. et al. Microbial dysbiosis in pediatric patients with crohn's disease. Journal of clinical microbiology 50, 3258–3266 (2012).
OpenUrl Abstract/FREE Full Text

[92] [92].↵
Walters, W. A., Xu, Z. & Knight, R. Meta-analyses of human gut microbes associated with obesity and ibd. FEBS letters 588, 4223–4233 (2014).
OpenUrl CrossRef PubMed Web of Science

[93] [93].↵
Verdam, F. J. et al. Human intestinal microbiota composition is associated with local and systemic infiammation in obesity. Obesity 21 (2013).

[94] [94].
Tims, S. et al. Microbiota conservation and bmi signatures in adult monozygotic twins. The ISME journal 7, 707 (2013).
OpenUrl

[95] [95].↵
Zhu, L. et al. Characterization of gut microbiomes in nonalcoholic steatohepatitis (nash) patients: a connection between endogenous alcohol and nash. Hepatology 57, 601–609 (2013).
OpenUrl CrossRef PubMed Web of Science

[96] [96].↵
Keren, N. et al. Interactions between the intestinal microbiota and bile acids in gallstones patients. Environmental microbiology reports 7, 874–880 (2015).
OpenUrl

[97] [97].↵
Kohl, K. D., Amaya, J., Passement, C. A., Dearing, M. D. & McCue, M. D. Unique and shared responses of the gut microbiota to prolonged fasting: a comparative study across five classes of vertebrate hosts. FEMS microbiology ecology 90, 883–894 (2014).
OpenUrl CrossRef PubMed

[98] [98].↵
Milani, C. et al. Gut microbiota composition and clostridium difficile infection in hospitalized elderly individuals: a metagenomic study. Scientific reports 6 (2016).

[99] [99].↵
Gu, S. et al. Identification of key taxa that favor intestinal colonization of clostridium difficile in an adult chinese population. Microbes and infection 18, 30–38 (2016).
OpenUrl CrossRef

[100] [100].↵
Minamoto, Y. et al. Alteration of the fecal microbiota and serum metabolite profiles in dogs with idiopathic inflammatory bowel disease. Gut microbes 6, 33–47 (2015).
OpenUrl CrossRef PubMed

[101] [101].↵
Werner, T. et al. Depletion of luminal iron alters the gut microbiota and prevents crohn's disease-like ileitis. Gut gut–2010 (2010).

[102] [102].↵
Presley, L. L., Wei, B., Braun, J. & Borneman, J. Bacteria associated with immunoregulatory cells in mice. Applied and environmental microbiology 76, 936–941 (2010).
OpenUrl Abstract/FREE Full Text

[103] [103].↵
Schwarz, R. S., Moran, N. A. & Evans, J. D. Early gut colonizers shape parasite susceptibility and microbiota composition in honey bee workers. Proceedings of the National Academy of Sciences 113, 9345–9350 (2016).
OpenUrl Abstract/FREE Full Text

[104] [104].↵
Raja, M. & Fajar Ummer, C. Aggregatibacter actinomycetemcomitans–a tooth killer? Journal of clinical and diagnostic research: JCDR 8, ZE13 (2014).
OpenUrl

[105] [105].
Kamma, J., Nakou, M. & Manti, F. Predominant microflora of severe, moderate and minimal periodontal lesions in young adults with rapidly progressive periodontitis. Journal of periodontal research 30, 66–72 (1995).
OpenUrl CrossRef PubMed Web of Science

[106] [106].↵
Cassini, M. et al. Periodontal bacteria in the genital tract: are they related to adverse pregnancy outcome? International journal of immunopathology and pharmacology 26, 931–939 (2013).

[107] [107].↵
Sokol, H. et al. Fungal microbiota dysbiosis in ibd. Gut gutjnl–2015 (2016).

[108] [108].↵
Lavelle, A. et al. Spatial variation of the colonic microbiota in patients with ulcerative colitis and control volunteers. Gut gutjnl–2014 (2015).

[109] [109].↵
Mangin, I. et al. Molecular inventory of faecal microflora in patients with crohn's disease. FEMS microbiology ecology 50, 25–36 (2004).
OpenUrl CrossRef PubMed Web of Science

[110] [110].
Gophna, U., Sommerfeld, K., Gophna, S., Doolittle, W. F. & van Zanten, S. J. V. Differences between tissue-associated intestinal microfloras of patients with crohn's disease and ulcerative colitis. Journal of clinical microbiology 44, 4136–4141 (2006).
OpenUrl Abstract/FREE Full Text

[111] [111].↵
Tyler, A. D. et al. Characterization of the gut-associated microbiome in inflammatory pouch complications following ileal pouch-anal anastomosis. PloS one 8, e66934 (2013).
OpenUrl CrossRef PubMed

[112] [112].↵
Hansen, R. et al. The microaerophilic microbiota of de-novo paediatric inflammatory bowel disease: the biscuit study. PLoS One 8, e58825 (2013).
OpenUrl CrossRef PubMed

[113] [113].↵
Hiippala, K., Kainulainen, V., Kalliomäki, M., Arkkila, P. & Satokari, R. Mucosal prevalence and interactions with the epithelium indicate commensalism of sutterella spp. Frontiers in microbiology 7 (2016).

[114] [114].↵
Mukhopadhya, I. et al. A comprehensive evaluation of colonic mucosal isolates of sutterella wadsworthensis from inflammatory bowel disease. PLoS One 6, e27076 (2011).
OpenUrl CrossRef PubMed

[115] [115].↵
Biagi, E. et al. Gut microbiome in down syndrome. PLoS one 9, e112023 (2014).
OpenUrl CrossRef PubMed

[116] [116].↵
Pedregosa, F. et al. Scikit-learn: Machine learning in python. Journal of Machine Learning Research 12, 2825–2830 (2011).
OpenUrl Web of Science

[117] [117].↵
Stewart, G. W. On the early history of the singular value decomposition. SIAM review 35, 551–566 (1993).
OpenUrl

[118] [118].↵
DeSantis, T. Z. et al. Greengenes, a chimera-checked 16s rrna gene database and workbench compatible with arb. Applied and environmental microbiology 72, 5069–5072 (2006).
OpenUrl Abstract/FREE Full Text

[119] [119].↵
Caporaso, J. G. et al. Qiime allows analysis of high-throughput community sequencing data. Nature methods 7, 335–336 (2010).
OpenUrl CrossRef PubMed Web of Science