Crowdsourced study of children with autism and their typically developing siblings identifies differences in taxonomic and predicted function for stool-associated microbes using exact sequence variant analysis

Maude M David; Christine Tataru; Jena Daniels; Jessey Schwartz; Jessica Keating; Jarrad Hampton-Marcell; Neil Gottel; Jack A. Gilbert; Dennis P. Wall

doi:10.1101/319236

SHORT ABSTRACT

Autism Spectrum Disorder (ASD) affects 1 in 59 children in the United States and is impacted by both genetic and environmental factors, including the gut microbiome. To investigate the link between microbiome functionality and ASD, this study analyzes behavioral data (with home video and questionnaires) and 16S-amplicons crowd-sourced from age-matched sibling pairs (2-7 yo) where one sibling has an ASD diagnosis and the other does not. We identified 21 exact sequence variants (ESVs) that are significantly differentially abundant between the two cohorts. ESVs from the families Ruminococcaceae and Bacteroidaceae were found preferentially in the ASD cohort, while ESVs from the genera Bifidobacterium, Porphyromonas, Slackia, Desulfovibrio, species Acinetobacter johnsonii, and the Lachnospiraceae family, were specific to the neurotypical cohort. Predicting KEGG Orthologs from these ESVs, we found both cohorts harbor butyragenic pathways, but the ESVs enriched in the ASD cohort can use 4-aminobutanoate as precursor, potentially impacting the availability of neurotransmitters.

Background The existence of a link between the gut microbiome and autism spectrum disorder (ASD) is well established in mice, but in human populations efforts to identify microbial biomarkers have been limited due to problems stratifying participants within the broad phenotype of ASD and a lack of appropriately matched controls. To overcome these limitations and investigate the relationship between ASD and the gut microbiome, we ran a crowdsourced study of families 2-7 year old sibling pairs, where one child of the pair had a diagnosis of ASD and the other child did not.

Methods Parents of age-matched sibling pairs electronically consented and completed study procedures via a secure web portal (microbiome.stanford.edu). Parents collected stool samples from each child, responded to behavioral questionnaires about the ASD child’s typical behavior, and whenever possible provided a home video of their ASD child’s natural social behavior. We performed DNA extraction and 16S rRNA amplicon sequencing on 117 stool samples (60 ASD and 57 NT) that met all study design eligibility criteria, and filtered sequences that did not pass quality control tests. Using DADA2, Exact Sequence Variants (ESVs) were identified as taxonomic units, and three statistical tests were performed on ESV abundance counts: (1) permutation test to determine differences between sibling pairs, (2) differential abundance test using a zero-inflated gaussian mixture model to account for the sparse abundance matrix, and (3) differential abundance test after modeling under a negative binomial distribution. The potential functional gene abundance for each sample was also inferred from the 16S rRNA data, providing KEGG Ortholog (KO) proportions, which were analyzed for differential abundance using an enrichment algorithm.

Results In total, 21 ESVs had significantly differentially proportions in stool of children with ASD and their neurotypical siblings. Of these 21 ESVs, 11 were enriched in neurotypical children and ten were enriched in children with ASD. ESVs enriched in the ASD cohort were predominantly associated with Ruminococcaceae and Bacteroidaceae; while those enriched in controls were more diverse including taxa associated with Bifidobacterium, Porphyromonas, Slackia, Desulfovibrio, Acinetobacter johnsonii, and Lachnospiraceae. Exact Variant Analysis suggested that Lachnospiraceae was specific to the control cohort, while Ruminococcaceae, Tissierellaceae and Bacteroidaceae were significantly enriched in children with ASD. Metabolic gene predictions determined that while both cohorts harbor the butyrogenic pathway, the ASD cohort was more likely to use the 4-aminobutanoate (4Ab) pathway, while the control cohort was more likely to use the pyruvate pathway. The 4Ab pathway releases harmful by-products like ammonia and can shunt glutamate, affecting its availability as an excitatory neurotransmitter. Finally, we observed differences in the carbohydrate uptake capabilities of various ESVs identified between the two cohorts.

Conclusions Using crowdsourcing methods, we successfully recruited a sufficient sample of sibling pairs close in age while limiting environmental confounding factors that can impact microbiotic composition of the gut. Our findings support previously identified taxa that link the gut microbiota to children with ASD and also identifies several novel ESVs specific to each cohort, including clades within the Clostridiales order. Predicted functional metabolic differences suggest a potential impact on microbial neurotransmitter production. In addition to furthering our understanding of the underlying mechanisms of the associations between gut microbiome compositions and ASD, we have potentially identified novel ASD biomarkers that may help with future therapeutic interventions.

INTRODUCTION

Autism spectrum disorder (ASD) is a heterogeneous developmental disorder affecting social and behavioral functioning in 1 out of 59 children in the United States (Baio et al., 2018). Recent studies have identified several environmental factors associated with ASD etiology and susceptibility, including prenatal infection (Hall et al., 2012), zinc deficiency (Lakshmi Priya and Geetha, 2011), maternal diabetes (Gardener et al., 2009), toxins and pesticides (Moore et al., 2000), and advanced parental age (Parner et al., 2012). Individuals with ASD have demonstrated a high prevalence of gastrointestinal (GI) and immunologic abnormalities pertaining to GI motility and intestinal permeability (Boukthir et al., 2010; de Magistris et al., 2010). Additionally, ASD-typified behavioral traits are more severe in children with both ASD and GI disturbances (Adams et al., 2011). These factors have also been shown to influence or be influenced by the intestinal microbiome (Spor et al., 2011), which could suggest a role for the intestinal microbiota in mediating ASD and ASD-typified behavioral traits.

The legitimacy of the proposed microbiome-ASD connection is supported by recent research on ASD phenotype mouse models and the microbiota compositions of human individuals with ASD (Finegold et al., 2002; Song et al., 2004; Desbonnet et al., 2008; Bercik et al., 2011; Bravo et al., 2011; Messaoudi et al., 2011; Hsiao et al., 2013; Kang et al., 2013). Hsiao et al. (2013) found that administrating Bacteroides fragilis to ASD mouse models improved ASD-typified behavioral traits by reducing anxiety, restoring communicative behaviors, and improving sensorimotor gating (Hsiao et al., 2013). Bacterial taxa, such as members of Lactobacillus and a genus of Bifidobacterium, have demonstrated microbially-induced behavioral modulation in both rats and humans (Desbonnet et al., 2008; Bercik et al., 2011; Bravo et al., 2011; Messaoudi et al., 2011; Kang et al., 2013). Moreover, several studies have identified microbial trends amongst the ASD population such as an increased abundance of Clostridium (Finegold et al., 2002; Song et al., 2004). More recently, a study involving Fecal Microbiome Transplant between neurotypical controls and children with ASD demonstrated a significant improvement in both GI and neurobehavioral symptoms following the treatment (Kang et al., 2017a). This study particularly demonstrates a potential causative relationship between the gut microbiome and ASD symptoms. While the data on the microbiome-ASD symptom link is compelling, studies attempting to identify the specific microbes responsible have maintained small sample sizes, single time point sampling, limited phenotype scoring and sampling, and a lack of bacterial phylogenetic resolution, factors that may impact the reproducibility of the results.

The present study aims to determine the specific intestinal microbiota that associate with behavioral traits in children with ASD. We recruited families with age-matched neurotypical and ASD siblings via crowdsourcing to reach a sufficient sample size. (Krippendorff, 2004; Behrend et al., 2011; Swan, 2012; Comber et al., 2013; Jeong et al., 2013; Hong et al., 2015; David et al., 2016). Recruited families had a child clinically diagnosed with ASD and a neurotypical sibling who were both between the ages of 2-7 and no more than 2 years apart in age. The crowdsourcing recruitment methodology enabled us to recruit a large cohort of families disbursed across the United States. Each family completed behavioral and dietary questionnaires online and collected a stool sample from each child at home via sampling kits shipped to each family by the research team. This approach facilitated the collection of diverse and pertinent metadata regarding allergies, diet, supplement usage, gastrointestinal abnormalities, gestational age, and antibiotic and probiotic treatment (Swan, 2012; Jeong et al., 2013; Hong et al., 2015; Walters et al., 2015; David et al., 2016). We confirmed the self-reported autism diagnosis of each child by leveraging validated machine-learning classification tools that assess ASD-typified features obtained from parent reports and home video showcasing social interactions (Wall et al., 2012; Duda et al., 2014; Fusaro et al., 2014; Kosmicki et al., 2015; Duda et al., 2016; Levy et al., 2017).

MATERIALS AND METHODS

Crowdsourcing Recruitment and Data Collection

Data were collected from March 2015 to September 2017 under an approved Stanford University Institutional Review Board protocol (eProtocol 30205). To target and inform the autism community of the study, we crowdsourced study subject recruitment via popular social media networking platforms including Twitter, Facebook, autism-focused Yahoo Groups, and a press released article from National Public Radio. In addition, we engaged with non-profit and for-profit companies who informed their community of the study via their social media platforms and email lists.

Parents of eligible participants completed the online component of study procedures via a secure, HIPAA-compliant web-based platform (https://microbiome.stanford.edu) where they provided electronic consent, responded to behavioral, demographic, and dietary surveys on behalf of their children, and uploaded a home video of the child with autism. Metadata were collected using RedCap (Harris et al., 2009; Lowe et al., 2009).

Sampling Kits

After they completed the online surveys, research staff mailed sampling kits to families for at-home stool collection. Each sampling kit included two sets of collections tubes and swabs to collect stool samples, for both the child with ASD and his or her neurotypically developing sibling, from soiled toilet paper, instructions on how to collect the samples, and a detailed, 53-question dietary questionnaire for each child (see Supporting Information SI 1). Participants returned the samples to the research staff via prepaid and pre-labelled packaging (McDonald et al., 2018).

ASD Diagnosis Confirmation

To confirm the parent-provided ASD diagnosis of a child-subject, we applied two machine learning classifiers, one based on a parent-directed questionnaire (Duda et al., 2016)Wall:2012gv} and one based on a home video of the child with ASD (Wall et al., 2012; Duda et al., 2014; Levy et al., 2017). The parent-directed questionnaire is described below as “Mobile Autism Risk Assessment or MARA” and the video-based classifier is referred to as “video classifier. Due to both privacy and technical barriers to video upload, we made the video upload optional but required that all subjects complete the MARA as a strict inclusion criterion. We confirmed the self-reported diagnosis of the child using MARA and, when available, both the MARA and the video classifier. When both were available, concordance in outcome from both classifiers with the self-reported diagnosis was required for a sample to be included in our study.

Mobile Autism Risk Assessment

Participants electronically completed the clinically validated Mobile Autism Risk Assessment (MARA) (Duda et al., 2016)Wall:2012gv}. This system uses a set of 7 behavioral features developed through machine learning for rapid screening for autism. The 7-feature set is measured through parent-report in a questionnaire on a mobile device. Each feature is scored by the parent on a scale from 0 to 4, 0 being most impaired and 4 being least impaired. The features focus on the child’s language ability, make-believe play, social activity, restricted and repetitive behaviors, general signs of developmental delays by or before age 3, and eye contact. The responses generate a score that classifies the child as either “ASD” (positive score) or “no ASD” (negative score).

Video Analysis

In addition, we requested (as optional) a home video of their child with ASD via our secure study website. For a video to be eligible for analysis, we asked that it include social interaction, use or play with objects in the video, be at least two minutes long, and clearly show the child’s face and hands. The specific details of scoring and the validation of the classifier are described in previous publications (Wall et al., 2012; Duda et al., 2014; Levy et al., 2017). For the purposes of this study we had 3 video raters who were blind to diagnosis independently tag the specific behavioral features that our video classifier requires to produce a risk score. We took the majority consensus diagnosis as the outcome for comparison with the caregiver’s self-reported diagnosis.

To safeguard against ascertainment bias due to increasing familiarity, we required our video analysts to score an unlabeled mixture of ASD participant videos and similar home videos of neurotypical children ages 2-7 years mined from YouTube’s publicly available video repository. The responses to each question were scored on a scale from 0 to 4, generating a classification of the child as either “ASD” or “no ASD”. Similar to the MARA, the outcome is a probability score that indicates both class (ASD or non-ASD) as well as severity of phenotype.

Lifestyle and Dietary Practices

Participants electronically completed dietary and lifestyle questionnaires on behalf of their children, using a 5-point frequency-based Likert scale and categorical answers (Supplementary Information SI 1). Questions covered dietary habits (e.g., How many servings of vegetables does your child eat in a typical week?), lifestyle habits (e.g., How many times a week does your child exercise?), and other pertinent information (e.g., Was your child born by C-section?).

We investigated systematic differences in the dietary habits of lifestyles of children with ASD as compared to NT children. Categorical data items were assigned either 1 or 0, and Likert scale items were assigned a value from 1 to 5 (1 = “Never,” 2 = “Occasionally,” 3 = “Sometimes,” 4 = “Often,” and 5 = “Always”). Differences of qualities with ordinal values were investigated using a linear-by-linear association test, and qualities with categorical values were tested using a chi-squared test. To verify that family relation was a practical criterion to ensure similarity between case and control lifestyle and dietary habits, we performed a permutation test (999 permutations) on the Euclidean distances between participants’ numerical questionnaire responses. Data were standardized to mean as 0 and variance as 1 to account for the differences in scale between categorical and Likert scale numerical values.

DNA Extraction, Amplification and Sequencing

Microbiome samples were processed according to the procedures outlined by the American Gut Project Protocol Apprill:2015gb, (Caporaso et al., 2011; 2012; Parada et al., 2015). DNA was extracted using the 96-well Powersoil DNA Isolation Kit (MO BIO, Carlsbad, CA). We utilized the manufacturer’s protocol with the following modification: after the addition of the sample and solution C1, we partially submerged the sealed extraction plates in a water bath for 10 minutes at 65°C. We amplified the extracted DNA using the 5PRIME MasterMix (5 PRIME, Inc, Gaithersburg, MD) and the 515F/806R primers for a final concentration of 0.2µM per primer. Thermocycler settings for generating amplicons were 3 minutes at 94°C, then 35 cycles at 94°C for 45 seconds, 50°C for 1 minute, and 72°C for 1.5 minutes, with a final extension for 10 minutes at 72°C. After PCR, we quantified the DNA concentration of each sample using the Quant-iT PicoGreen dsDNA Assay kit and then pooled to 70 ng DNA per sample. We generated clean pools using the QIAquick PCR Purification Kit (QIAGEN, Hilden, Germany). The clean pools were then submitted to the Environmental Sample Preparation and Sequencing Facility at Argonne National Laboratory to be sequenced on an Illumina MiSeq using V4 chemistry.

Sequence Filtering, Chimera Removal, Taxonomic Assignment and Phylogenic Tree

Raw sequences were processed using the workflow available in the software package DADA2 (Callahan et al., 2016), which models and corrects amplicon errors. Reads were trimmed to include base pairs 10 through 140 and truncated at the first instance of a Phred quality score less than 20. Reads with more than two expected errors were filtered out. Reads were then de-replicated and de-noised. Forward and reverse reads were merged and chimeras were removed. Taxonomy was assigned to each Exact Sequence Variant (ESV) generated by this pipeline by running the Ribosomal Database Project’s (RDP) naive Bayesian classifier (Wang et al., 2007), implemented in DADA2, against the GreenGenes dataset maintained in DADA2 package (DeSantis et al., 2006).

The phylogenetic tree was rooted using an archea sequence from Halorhabdus rudnickae as an outgroup (see available github code): all the sequences were aligned using the phangorn package and a Neighbor-Joining Tree was built (SAITOU and NEI, 1987) using ape. The tree was bootstrapped 100 times with phangorn (Schliep, 2011).

Statistical Analyses

We performed statistical analyses with R version 3.4.2 (2017-09-28) using R Studio Integrated development environment for R v1.0.136 (open source software, Boston, MA). We used the following packages in R: DESeq2_1.18.0, SummarizedExperiment_1.8.0, DelayedArray_0.4.1, matrixStats_0.52.2, GenomicRanges_1.30.0, GenomeInfoDb_1.14.0, IRanges_2.12.0, S4Vectors_0.16.0, BiocInstaller_1.28.0, gtable_0.2.0, cowplot_0.8.0, lattice_0.20-35, gridExtra_2.3, scales_0.5.0, metagenomeSeq_1.20.0, RColorBrewer_1.1-2, glmnet_2.0-13, foreach_1.4.3, Matrix_1.2-11, limma_3.34.0, Biobase_2.38.0, BiocGenerics_0.24.0, gage_2.28.0, readr_1.1.1, igraph_1.1.2, ggplot2_2.2.1, reshape2_1.4.2, structSSI_1.1.1, dplyr_0.7.4, ape_5.0, phyloseq_1.22.3. All code used for this work is publicly available: https://github.com/walllab/ASD_microbiome16s_public/. The raw fastq files can be found at (Kang et al., 2013; 2017b). For a workflow diagram, see SI 9.

Analysis of Alpha-Diversity Differences

We calculated alpha-diversity for each sample using Shannon-Weiner diversity, a traditional metric that takes into account richness and evenness of taxonomic species, and Phylogenetic Diversity, a metric that measures the total length of phylogenetic branches necessary to span the set of taxa in a sample (Vuong and Hsiao, 2017). We then used a Wilcoxon rank sum test (Finegold, 2011) to quantify the significance of differences observed between the two cohorts. Next, we performed 1000 bootstrap simulations to calculate the variance of diversity metrics observed in each cohort and again used a rank sum test to quantify significance of the difference in variances.

Identification of Dietary and Lifestyle Habits Influencing the Microbial Community

To determine whether or not the parent-reported dietary and lifestyle questionnaires contained influential data and insight into the microbial communities observed in our samples, we used a PERMANOVA test (ADONIS function in vegan package) on Bray-Curtis distances between microbial compositions (Kang et al., 2013). For each dietary or lifestyle factor, samples are broken into K clusters where K is the number of possible responses to the question. The distance between sample microbial compositions was calculated, and the within-cluster distances compared to the between-cluster distances over random permutations of cluster membership. We also performed a test to measure the homogeneity of the dispersion (PERMDISP2 procedure) of each cluster in order to be more confident that the cluster-specific centroids were robustly different rather than due to disparities in cohort dispersions (Kang et al., 2017a).

Permutation Test on Sibling Pair Differentials

In addition to community level trends, we investigated differential abundances of specific taxa. We ran a permutation test on mean taxa abundance differences between sibling pairs to determine if any taxa were systematically enriched or depleted in our ASD samples when compared to our NT controls. We selected DeSeq2 as a normalization method to minimize batch effect and noise from differences in sampling depth while maintaining the expected dataset properties (Faith and Baker, 2006). We expected that the microbiome compositions of age-matched sibling pairs would be closer to each other than to any other samples in the cohort, due to the environmental and genetic similarities shared by young siblings living within the same household. Additionally, we resampled and resequenced samples from eight individuals, with at least six months in between samplings, to examine the similarities of the results and confirm that the quality of the samples was maintained over time. Just as in the case of siblings, we expected samples from the same individual to contain microbiome compositions very similar to each other. We investigated the suitability of three commonly used normalization techniques: CSS normalization, DeSeq2 normalization, and log transformation, and found that DeSeq2 best matched above stated expectations (SI 2). This method performs variance stabilization to normalize counts with respect to library size and heteroskedasticity (Wilcoxon et al., 1963).

We first excluded all samples that were singletons resulting from quality control or from third siblings (sibling furthest in age was removed). We were left with 55 sibling pairs, each sequenced on the same day, on the same instrument and to similar depths. Using DeSeq2 normalized taxa abundances, counts in the NT siblings were subtracted from those in the ASD siblings and averaged across all samples for each taxon. For each taxon, we simulated a null distribution of the average sibling differences by repeating the above procedure with permutated sibling pairs and phenotype assignments 10000 times. Lastly, we calculated a p-value as the number of times the null hypotheses produced a value more extreme than the actual value observed (see SI 8).

Null distributions remained stable well before 10000 permutations. We assessed stability by comparing the value of the ks test statistic to the null distribution shapes at increments of 500 simulations. At 10,000 simulations, the maximal ks test statistic over all taxa (when increasing from 9000 to 10000 simulations) was 4.8 * 10^-3.

Models to Maximize the Likelihood of Detecting Low Abundance Species

Given the sparsity of 16S sequencing due to sequencing depth limitations, we used differential ribosomal analysis based on the negative binomial distribution and zero inflated Gaussian analysis, to estimate log-fold changes of taxa abundances between our ASD and NT groups (Dixon, 2003).

Differential Ribosomal Analysis Based on the Negative Binomial Distribution

We performed differential analysis of taxa counts between groups by modeling taxa abundances under a negative binomial model using the DESeq2 framework. This method performs variance stabilization on taxa counts and then fits a generalized linear model with a log link on normalized count data. Coefficients representing the log fold changes of taxa between groups are then extracted and shrunk toward zero using an empirical Bayes model that effects taxa with low counts more severely. The method then calculates each shrunken log fold change’s standard error from the curvature of its posterior and performs a Wald test (Wald, 1943) to determine whether the log fold change of any one taxa is significantly different from zero. The p-values associated with each taxon are corrected for multiple hypotheses using false discovery rate (FDR) (Benjamini and Hochberg, 1995).

Zero Inflated Gaussian Analysis

To account for sparsity due to under-sampling, we used the method developed by Paulson et al.: a mixture model that uses a zero-inflated gaussian distribution to account for varying depths of coverage Paulson:2013hk}. To model data appropriately under a zero-inflated Gaussian model, it was necessary to normalize data in a way that does not change the distribution of the variance of taxa across samples. We used cumulative sum scaling (CSS) (Paulson et al., 2013). to account for under-sampling and increase the sensitivity and specificity of identifiable taxa. CSS is a technique that mitigates bias coming from features that are preferentially amplified in a sample-specific manner. CSS divides the feature counts for each sample by the sum of feature counts with values less than that of the median (or chosen percentile), rather than by the total counts in that sample (as in total-sum normalization). Counts are then multiplied by a normalization constant that is the same across samples to ensure normalized counts have interpretable units. The method models each taxon count as a mixture model of a point mass at zero and a normal distribution parameterized by the observed taxa distribution.

Functional Profile Prediction using Piphillan

We inferred metabolic activity using Piphillan (Iwai et al., 2016), a bioinformatics software package designed to predict metagenome functional content from marker gene (16S) surveys. Piphillan aligns 16S sequences to sequences in the GreenGenes database (DeSantis et al., 2006), and assigns functional profiles based on 97% match of 16S sequences. 16S sequences that do not match a database entry are assigned the functional profile of their nearest neighbor. We chose this software over other alternatives, such as PiCRUST (Langille et al., 2013) and Tax4Fun (Aßhauer et al., 2015), because of its high performance on clinical samples and its usage of the most current GreenGenes database. Piphillan was run on DESeq2 normalized data to produce estimates of KEGG ortholog abundances (Kanehisa and Goto, 2000; Kanehisa et al., 2016; 2017). We then performed a modified gene enrichment analysis: a set was considered a metabolic pathway as defined by the KEGG Brite database and an element was considered a KEGG ortholog (Subramanian et al., 2005).

Gene Set Enrichment Analysis Using KEGG Orthologs

Using KO prediction provided by Piphillan, we distilled the data to define functional pathways as described by the KEGG database <http://www.kegg.jp/kegg/pathway.html> (Kanehisa and Goto, 2000). As some KOs contribute to multiple pathways, we first divided the abundance of each KO in a sample by the number of pathways the KO participates in, so that a single functional unit’s activity or output may contribute to only one pathway at a time (as a rule of thumb). Then we used the Gage implementation of Gene Set Enrichment Analysis (GSEA) (Luo et al., 2009) to perform a modified gene enrichment analysis: a set was considered a metabolic pathway as defined by the KEGG Brite database and an element was considered a KEGG ortholog(Subramanian et al., 2005).

Power Calculation

We modeled microbial abundances as a Dirichlet-Multinomial, a model which has been proven to successfully reflect the abundances seen in naturally occurring microbial communities (La Rosa et al., 2012). Under this model, we estimated Method-of-Moments (MoM) parameters for each taxon and determined the stability of those estimates by comparing likelihood-ratio-test statistics over 1000 Monte-Carlo simulations (Mooney, 1997). From this simulation, we can determine the number of samples necessary to reach a given level of power when estimating parameter values. At a rejection threshold value of 0.05, n=70 ASD child-subjects and n=70 NT subjects were required to obtain power above 0.99, and n=45 ASD child-subjects and n=45 NT child-subjects was sufficient to provide a power greater than 0.95. Though we do not explicitly use the Dirichlet-Multinomial model in further analyses, a high power in this context implies a high power in more complex down-stream analyses that are not able to be simulated.

RESULTS

Crowd Sourcing Recruitment and Participant Demographics

Between March 2015 and September 2017, 20,478 unique users visited our study website, 1,953 were electronically screened for eligibility by survey, and 297 of them met our study inclusion criteria. 194 users electronically consented to participate, and 164 began responding to the online surveys. Of 164 participants, 100 completed the online component and were mailed sampling kits. 71 families, or parents of 142 sibling pairs, completed the online and at-home sampling procedures for the study, and 117 child-subjects (60 ASD and 57 NT) met all eligibility criteria, including the required confirmation of diagnosis obtained from the MARA and video classifier, when submitted. Of the 117 child-subjects, there were 55 sibling pairs, two sibling pairs were accompanied by a third sibling with autism, and 5 were singleton samples. The ASD cohort comprised 72% male participants (n = 43), as compared to 55% of the NT cohort (n=27). Dietary and lifestyle questionnaires were completed, in entirety, for 106 of the 117 participants. Among the 106 child-subjects, 66% (n=79) identified as Caucasian, 7.5% (n=8) identified as Asian or Pacific Islander, 3.8% (n=4) identified as African American, and 7.5% (n=8) identified as Hispanic (participants were also given the option to select more than one identifying ethnicity, not reported here). Participant age was not significantly different between the ASD and NT cohorts. Additional demographic data are in Supplementary Information SI 4.

ASD Diagnosis Confirmation using the Mobile Autism Risk Assessment (MARA) and Video Classifier

All child-subjects with ASD that completed the MARA and of these 29 provided scorable video (including one family with two siblings with ASD and one NT sibling meeting the age criteria). There was a 100% agreement in class assignment between the MARA and the video classifier in all 37 cases (SI 3). In 12 instances, the output from either or both classifier (2 supported by the video classifier) did not confirm the parent-reported ASD diagnosis. These participants were therefore excluded from analysis. The results reported hereafter include only the remaining 60 child-subjects with confirmed ASD.

Diet Differences between Children with ASD and Neurotypical Siblings

We found three categorical factors (supplements, dairy intolerance, and dietary restrictions) to be significantly different between the two cohorts according to a chi-square test (Table 1). Nutritional/herbal supplements showed significant differences between the two cohorts, with 63.6% (n=35) ASD child-subjects taking an herbal supplement as compared to 35.3% (n=18) NT child-subjects (qval 2.3e-2). Dairy intolerance was also more prevalent in the ASD cohort (n=1 NT child-subjects versus n=16 ASD child-subjects) (qval 2.6e-3), which correlates with a statistically significant deviation in the frequency of consumption of both milk/cheese and milk substitutes: only 33.3% (n=18) of ASD child-subjects consumed milk/cheese on a “regular” or “daily” basis as compared to 72 % (n= 36) of NT child-subjects (qval 2.3e-3). Finally, gluten intolerance was found to be more prevalent in the ASD cohort (qval 3.5e-4). Additionally, n=20 ASD child-subjects had other special dietary restrictions apart from dairy and gluten constraints, compared to only n=6 NT child-subjects (qval 2.3e-3). Refer to Table 1 and Supplementary Information SI 4 for a summary of the remaining reported data.

View this table:

Table 1: Clinical Characteristics for ASD and NT Participants with significant difference between the cohorts.

Dietary and Lifestyle Habits Influencing the Microbial Community

SI 12 detailed the five variables that seems to significantly influence the microbial community: Probiotics, Multi-vitamin, sugary sweet, olive oil and sequecing batch. Constraint PcoA were used to identify the ESVs for which the abundance was the most influenced by these variables (SI 13).

Similarity between Sibling Lifestyles

As hypothesized, sibling lifestyles, as measured by dietary choices, supplement intake, exercise, allergies, and other factors, were significantly more similar to each other than to other participants (p << .01). We used the Euclidean distance between lifestyle description vectors to perform a permutation test (999 permutations), which confirmed the similar sibling lifestyle hypothesis. (SI 5).

Reported Gastrointestinal Symptoms

As reported above, we observed significant differences in gluten and dairy intolerances, which imply greater propensity for GI abnormalities among the ASD cohort. We did not, however, observe any significant differences between our cohorts regarding the reported gastrointestinal motility (SI 6) nor the frequency distribution of bowel movements (two-sample Wilcoxon signed-rank, p-value = 0.8313). Grouping samples into stool categories “Frequent”(> once a day), “Typical” (once a day) and “Sparse”(< once a day) did not result in any significant interdependence of phenotype and bowel movement frequency. When samples were agglomerated into two categories, typical bowel movement (one per day) and abnormal bowel movement (less or more than one per day), we did observe a slight, though not statistically significant, trend in the ASD cohort towards increase abnormal bowel movement frequency (chi-square p= 0.17).

Microbial Alpha-Diversity

We calculated the phylogenetic diversity (PD) and Shannon diversity metric for each sample (Cadotte et al., 2010). Grouping diversity measurements into “low”, “medium” and “high” categories based on observed standard deviation, we see a significant relationship between phenotype and diversity (fisher-exact p = 0.01), with high diversity associated with ASD (Figure 1). However, performing a rank sum test using each metric, we found no significant difference between cohorts. Although the variance of diversity (distribution of scores) in the ASD cohort was significantly greater than the NT cohort (bootstrap p < .001; Figure 1). Shannon diversity was also significantly related to bowel movement quality (fisher-exact p = .02), with low diversity associated with diarrhea, but not significantly related to bowel movement frequency (fisher-exact p = .17)

Figure 1: Phylogenetic Diversity and Shannon Diversity used as estimators of microbial alpha-diversity

The variance of diversity (distribution of scores) in the ASD cohort was significantly greater than the NT cohort (bootstrap p < .001) with both Diversity Estimator. Shannon diversity was also significantly related to bowel movement quality (fisher-exact p = .02), with low diversity associated with diarrhea, but not significantly related to bowel movement frequency (fisher-exact p = .17).

Permutation Test on Sibling Pairs to determine ESVs that differentiate between ASD and NT

Ten ESVs were determined to be differentially abundant between sibling pairs (ASD vs. NT) as determined by a permutation test with FDR correction (Table 2; SI 7). The mean differential abundance drawn from the null distribution was never more extreme than the actual differential abundance mean, and all p-values were 0, and increased to 9.36 × 10⁻³ upon correction (see distribution plot in SI 8). The genera Aggregatibacter, Anaerococcus and Oscillospira were significantly enriched in the ASD cohort, while Porphyromonas, Slackia, Desulfovibrio, Clostridium colinum, and Acinetobacter johnsonii were enriched in the NT cohort.

View this table:

Table 2: Candidate 16S biomarkers enriched and depleted in the autism cohort and their annotation

This table indicates the Exact Sequence Variants (ESVs) identified using 3 analysis methods: Permutation Test on Sibling Pair Differentials (Pair), Differential Ribosomal Analysis Based on the Negative Binomial Distribution (Deseq2), and Zero Inflated Gaussian Analysis (ZIG). The annotation was performed using Ribosomal Database Project’s naive Bayesian classifier with GreenGenes database 13.8. A blast was also performed using IMG’s most recent database (January 2018), and the perfect match (100% similarity on full length of the query) is indicated in the last column.

Models to Maximize the Likelihood of Detecting Low Abundance Species

We implemented a mixture model using a zero-inflated Gaussian (ZIG) distribution of mean group abundance for each ESV in metagenomeSeq (Paulson et al., 2013), in order to quantify the fold change in taxa between the ASD cohort and the NT cohort. This analysis again revealed 10 ESVs differentially present in the two cohorts: four were enriched in the ASD cohort (Table 2), and six were enriched in the NT cohort. The NT cohort was enriched in the Lachnospiraceae (five of six ESVs), including Coprococcus catus, Clostridium colinum, and the genera Bifidobacterium. In comparison, the ASD cohort was enriched in Ruminococcus and Holdemania, as well as the species Bacteroides uniformis and Clostridium celatum.

We also used a Negative Binomial Distribution Analysis to Identify ESV between ASD and NT, through which we identified a single ESV, from the genus Bacteroides (ESV1), enriched in the ASD cohort. Among the aforementioned statistical analyses, the microbial genus types identified in more than one statistical abundance test in the ASD cohort included the family Ruminococcaceae (by three ESVs including the genera Oscillospira and Ruminococcus), and the genus Bacteroides (by two ESVs). The NT cohort presents six ESVs belonging to the family Lachnospiraceae.

Functional Profile Prediction

The software Piphillan predicted ~6900 active KEGG Orthologs (KO) that were part of ~170 metabolic pathways as defined by KEGG Brite. Overall, we were able to associate 105 ESVs with full genome annotations. From the predicted KOs that were present in these genomes, we observed 17 predicted metabolic pathways with significantly differential abundance between ASD and NT. Two pathways were significantly enriched in the ASD cohort: Flagellar assembly (ko02040), and Aminoacyl-tRNA biosynthesis (ko00970) (Figure 2). Fifteen pathways were significantly enriched in the NT cohort, including Butanoate metabolism (ko00650), Propanoate metabolism (ko00640), Sulfur metabolism (ko00920), Phosphotransferase system (ko02060), and microbial metabolism in diverse environments (ko01120). A full list of significantly differential pathways is in Figure 2.

FIGURE 2: Pathway analyses derived from Exact Sequence Variants Analysis

Panel A: Butanoate metabolism: Detailed analysis of the butanoate pathway, the color of the arrow reflecting the cohort in which the ESV carrying the KEGG ortholog was detected

Panel B: KEGG pathways enriched in each cohort: List of the 17 pathways enriched in the Gene Set Enrichment Analysis using genomes and abundances estimate from the ribosomal analysis

Panel C: Microbial Metabolism in Diverse Environments: Detailed analysis of the pathway microbial metabolism in diverse environments and metabolites of interest for the gut-brain axis interaction.

DISCUSSION

Crowdsourcing Recruitment

By targeting the Internet-active autism community, we were able to crowdsource study subject recruitment and reach our targeted sample size for each cohort in a short amount of time. This methodology allowed us to collect data efficiently and effectively and to recruit participants from diverse geographical areas.

Lifestyle, Dietary Practices and GI symptoms

This study highlights ASD biomarker candidates while minimizing the impact of confounding environmental factors by crowdsourcing recruitment of ASD child-subjects who have age-matched NT siblings to act as study controls. By recruiting only sibling pairs who are within 2 years of age of one another, living in similar home environments, and eating similar diets (see SI 5), we successfully controlled for diet and lifestyle among our two cohorts. Using this approach of working with sibling cohorts, other studies have also showed very similar microbial structure between the two cohorts (Gondalia et al., 2012) to the point of not being able to identify taxa specific to one or the other cohort.

We observed no overlap between factors that heavily influence the microbial structure (SI 12) and the factors that were significantly different between the ASD and the NT cohorts (Table 1). Therefore, it is unlikely that the ESVs identified as ASD or NT biomarkers were differentially abundant due to diet or lifestyle as confounding influences.

We did not observe significant differences in the GI motility or stool quality between cohorts, however, there was an increased prevalence of dairy and gluten sensitivities among ASD child-subjects which may imply a propensity for GI distress. Dairy and gluten sensitivities have been previously found to be associated with children with ASD (Williams et al., 2011; Lau et al., 2013; Vuong and Hsiao, 2017). We also observed an increase in special dietary restrictions among our ASD child-subjects, which could imply that many parents had already implemented limitations to their child’s diet to alleviate any potential or previously identified gastrointestinal issues. Perhaps due to high parent involvement, we did not observe the expected differences in GI motility or gastrointestinal abnormalities. Failure to detect systematic GI distress in ASD could also be attributed to differences in study populations, as older siblings with wider intervals in age as were included in Son et al. are more likely to have more variable lifestyles and therefore greater differences in GI state.

Microbial Community Diversity

There is much debate in the literature as to whether microbial diversity is significantly different in children with ASD versus neurotypically developing children. While we found no significant rank sum relationship between alpha diversity of the microbiota and ASD diagnosis, we observed a significant relationship when considering samples as “High”, “Medium”, or “Low” diversity. This finding implies that more or less gut microbiome diversity does not directly translate to more or less benefit in the case of ASD, but rather that diversity should be viewed more broadly as a general contributing metric. The variance in diversity scores was significantly greater in the ASD cohort compared with the NT cohort, which may explain some of the discrepancies seen in smaller cohort studies such as those conducted by Finegold et al. (2010), Kang et al. (2017), and Hsiao et. al. (2013), which respectively report increased, decreased, and unchanged microbial diversity in an ASD cohort. Notably, in our ASD cohort, low diversity seemed strongly related to parent-reported diarrhea occurrences. Therefore, it is possible that studies that specifically enrich for significant gut abnormalities when recruiting ASD subjects may unwittingly enrich for decreased ASD microbial diversity. This finding of greater variance in the diversity of ASD microbiota suggests that perhaps the ‘Anna Karenina principle’ is at work in ASD, whereby there are more ways to be dysbiotic than non-dysbiotic, hence it is more probable to identify a greater range of alpha diversity scores in dysbiotic individuals Zaneveld:2017ii}.

Differential abundance analysis of microbial species

This study was designed to include controls that match as exactly as possible the lifestyle and environment of the autism samples in order to allow for better reproducibility and more robust association exploration. There is a concern that fecal bacteria from ASD cases could transfer to neurotypical control siblings, thus obscuring signal and not allowing us to observe differences between the ASD typified gut and the neurotypical gut (Finegold et al., 2010). This may be the case; however, this type of contamination only serves to obscure signal, not to create spurious results. Therefore, while there may be true biological associations not reported here, the associations observed from this cohort are not likely to be spurious.

To improve the taxonomic resolution of the analyses, this study relied exclusively on ESVs, meaning the taxonomic comparisons were performed without any clustering of 16S rRNA amplicon sequences (Table 2) (Callahan et al., 2016). Importantly it also produces single sequence variants which can be reproducibly detected between studies and across sequencing runs, reducing potential batch effects and improving future meta-analyses (Callahan et al., 2017).

It should be noted that while increased resolution can help improve reproducibility, no singular species, genus, or family can be considered homogeneous, and the functioning of any microbial group can vary widely based on circumstances including most recent diet, influence of other microbial community members, and strain level variation. The same species can manifest different effects on host physiology, but we report here cohort-wide associations that appear robust in a large subset of our population.

ESVs enriched in the ASD cohort

We provided in Table 3 & Table 4 a comprehensive list of biological information and information related to other reports on each bacterial associations with ASD and other pertinent phenotypes. Our findings agree overall with literature that states that Bacteroides genus and families such as Erysipelotrichaceae and Clostridiaceae (member of Clostridial cluster I) have already been widely reported as enriched in ASD (see Table 3). We also observed that members of Clostridial cluster IV (genus Ruminococcus), and ESV5 which belongs to the family Pasteurellaleae, are both enriched in the ASD cohort. This entire family was previously reported as being depleted in ASD participants (Kang et al., 2017b); this discrepancy could be explained by the study’s aggregation of all Pasteurellaleae, while we implicate a single member of the family. Pasteurellaleae was also detected as one of the most abundant bacterial family in children with developmental disabilities in Japan (Naka et al., 2009), supporting its potential association with atypical behavioral phenotypes. Finally, we also found that the genus Anaerococcus (ESV5) was enriched in the ASD cohort which to our knowledge, has not yet been reported in previous research literature.

View this table:

Table 3: Taxa Enriched in the ASD Cohort

View this table:

Table 4: Taxa Enriched in the Neurotypical Cohort

ESVs depleted in the ASD cohort

Our analysis identified five ESVs belonging to the Lachnospiraceae family that are depleted in the ASD cohort and enriched in the neurotypical cohort. This family overall has already been reported as associated with autism phenotype (see Table 3). Of these ESVs, the RDP classifier was only able to assign two species names, and we were only able to identify three genomes carrying similar ribosomal sequences (Table 2), indicating that we may have identified novel variants from our analyses (Euzéby, 2010; Yoon et al., 2017). These homogeneous phylogenetic groups of ESVs seem especially interesting as they cluster near each other on the 16S rRNA phylogenic tree (Figure 3). Additionally, members from the Clostridial cluster IV were associated with ASD, while members from the Clostridial cluster XIVa were associated with the control cohort. We could hypothesize that these two families exhibit metabolic pathways that are distinct among their functional redundancies. As microbes from this genus are some of human gut-associated microbiomes main butyrate producers, we examine the differences in butyrate production pathways of these two clusters in our pathway analysis below.

FIGURE 3: Selected order from V4 ribosomal derived phylogenetic tree

Ribosomal derived phylogenetic tree (V4 alignments) of the order carrying Exact Sequence Variants (ESV) of interest. The ESV in blue are enriched in Neurotypical cohort and the ones in blue in the Autism cohort.

The genera Desulfovibrio and Bifidobacterium were also depleted in the ASD cohort, consistent with results from at least 3 other studies (see table 3). Bifidobacterium has been characterized for its ability to normalize gut permeability (Desbonnet et al., 2008), and lack of this genus has been hypothesized to facilitate translocation of harmful microbial metabolites from the gut to the blood. Our analysis also pinpoints the possible importance of the genus Slackia in the neurotypical cohort, which was depleted in our ASD cohort. Finally, we identified two more families depleted in the ASD cohort, Porphyromonadaceae and Moraxellaceae, which respectively produce butyrate or use it as the sole source of carbon.

Pathway analysis

Using the pipeline Piphillin (Iwai et al., 2016) to infer KOs from our ESVs, and performing a GSEA using the KOs, we identified 17 predicted pathways associated with either cohort. Predicted pathway abundance in this analysis is defined by the relative potential capacity of a present bacterial genome to produce any active enzymes in a given biological pathway. It should be noted that connecting enriched predicted pathways to the potential biomarker ESVs reported can be tenuous, as Piphillin was only able to match 6 of our 21 markers to full genomes and extract their associated KOs. Notable differentially predicted pathways include (1) butanoate metabolism, glycolysis and pyruvate metabolism, (2) propanoate metabolism, (3) sulfur metabolism, (4) aminoacyl-tRNA biosynthesis, (5) the phosphotransferase system, and (6) microbial metabolisms in diverse environments. Additionally predicted pathways comprised more general functions (e.g., biosynthesis of antibiotics, biosynthesis of secondary metabolites, carbon metabolism, and two-component system pathways) or were only detected in one ESV biomarker (e.g. flagellar assembly).

Butyrate Production Pathway

The potential role of short chain fatty acids (SCFAs) in autism has been discussed in multiple studies. Wang et al. reported elevated SCFA concentration in children with ASD (Wang et al., 2012), while two other studies reported the opposite trend when looking at total SCFAs (Adams et al., 2011; De Angelis et al., 2013). MacFabe et al. found that intravenous administration of the SCFA propionate induced ASD typified behavior in mouse models, though it is likely that propionate injected intravenously may have a different effect compared with propionate originating from GI-microbial fermentation (Macfabe, 2013). Butyrate, in particular, has been proposed as a potential major mediator of the gut-brain axis either through modulation of the density of cholinergic enteric neurons through epigenetic mechanisms, or through direct modulation of the vagus nerve and hypothalamus (Liu et al., 2018). In our cohort, we observe an enrichment of microbial genomes capable of butyrate metabolism, implying that butyrate production and consumption pathways, in the stool-microbiome of NT participants (Figure 2). Butyrate production pathways in commensal microbial species and pathogens are thought to have evolved divergently; there are 4 pathways for butyrate production each branching from a different initial substrate: Pyruvate, 4-aminobutyrate, Glutarate, and Lysine (Anand et al., 2016). The by-products and influences of these major butyrogenic pathways could be relevant to host physiology. In our cohort, the predicted bacterial genetic potential in NT samples showed an enrichment for KOs associated with butyrate production from pyruvate, while in the ASD samples, the predicted functional potential was enriched for butyrate production via the 4-aminobutanoate (4Ab) pathway (Figure C). 4Ab is a neurotransmitter and its biosynthesis can directly interfere with the amount of available glutamate (Anand et al., 2016). This pathway can also potentially release harmful by-products such as ammonia (Anand et al., 2016), which were found elevated in feces of children with ASD (Wang et al., 2012). Genes identified as part of the pyruvate biosynthesis pathway were either found in both cohorts or only in predicted genomes associated with biomarkers from the NT cohort (Figure 2) (SI 10 panel B).

Propionate pathway: The predicted pathways for the synthesis of propionate, another SCFA, appears depleted in the ASD cohort (Figure C). This pathway has the potential to generate isopropanol, which has recently been found at a significantly greater concentration in the feces of children with autism (Kang et al., 2017b).

Sulfur Pathway: The association between the predicted sulfur pathway and the NT cohort, though somewhat surprising, parallels our finding of Acinetobacter and Desulfovibrio enrichment in the NT cohort. The reactions detailed in SI 10 panel D suggest an imbalanced within the sulfur cycle, which has already been hypothesized as possible route modulating the gut-brain interaction in autism (Midtvedt, 2012).

Aminoacyl-tRNA Biosynthesis Pathway: As expected, the vast majority of the aminoacyl-tRNA biosynthesis pathway is predicted to be present in all identified ESVs. Some enzymes from this pathway, specifically L-glutamine amido-ligase directly affect the availability of neurotransmitter precursors (SI 10 panel F) (YIP and KNOX, 1970).

Phosphotransferase system: We also observed differential abundance of predicted carbohydrate uptake pathways within the ESVs associated with each cohort: the ESVs associated with ASD seemed to show a much greater variety of carbohydrate transporters (SI 10 panel G).

Microbial metabolism in diverse environments: This high level category comprises many different pathways, which were not individually found enriched in either cohort. It is however interesting to note that this KEGG category is associated with several metabolites already known in the literature as enriched or depleted in subjects with ASD (Figure 2). Among them were p-cresol (Kang et al., 2017b), and ammonia (De Angelis et al., 2013) that have been found in greater abundance in the feces of children with autism; SCFAs (propanoate and acetate), which have mixed reports associating them with either children with ASD or controls (Adams et al., 2011; Wang et al., 2011; De Angelis et al., 2013); and neurotransmitters such as L-glutamate and GABA, which tend to be respectively greater and lower in feces of children with ASD, respectively (Kang et al., 2017b). Glutamine, found in greater levels in the plasma of ASD participants (Kang et al., 2017b), also belongs to this KEGG pathway, as do several metabolites such as nicotinate and aspartate, another neurotransmitter (Yap et al., 2010; De Angelis et al., 2013; Kang et al., 2017b).

Power Calculation and Sample Size

Likelihood-ratio-test statistics for a Dirichlet-Multinomial parameter test comparison showed that to reach an acceptable power (>0.9), we needed to include a minimum of 45 child-subjects per cohort. We were successful at screening, recruiting, sequencing, and analyzing 60 ASD and 57 NT child-subjects (SI 11). While this analysis does not calculate the power for each of the tests we performed, the Dirichlet-Multinomial distribution does allow power calculations for experimental design and population parameter estimations using a fully parametric approach. And though we cannot relate the verification of sufficient sample size to the non-parametric permutation test, we can conclude that our sample size is sufficient for reproducibility in the results from the zero-inflated Gaussian and DESeq2 models.

Limitations

While these results show promising microbial differences between autism and typically developing children, potential limitations included reliance on self-reported information, limited identification of species or strain level variants, limited single time-point sampling, and lack of consideration of host genetic variation.

While we safeguarded against self-report bias through two validated machine-learning algorithms that adapt well to mobile testing, there remains bias in self-report of diagnosis may remain. In particular because we only required MARA for the child with ASD, we could not confirm the typical development of their siblings. In addition, the compliance with the optional request for video was slightly under <50% of the cohort studied. While it was encouraging to see perfect alignment between the MARA and the video classifier outcomes, bolstering confidence in the confirmation of self-report, it would be better to require this dual check for all participants in future work.

Although widely used (Kang et al., 2017a), self-reported GI symptoms can also suffer some discrepancies when compared to a pediatric gastroenterologist reported data (Frye et al., 2015). Furthermore, while we observed physiological distinctions between the microbiomes of the cohorts on the level of exact sequence variants, it was often not possible to assign a taxonomic annotation or full genome to these sequences because of incomplete coverage in public databases. As the predicted pathways discussed were highly dependent on availabilities of full genome information, further metagenome and multi-Omics analyses in this space will be needed to confirm the metabolic hypotheses presented.

Finally, this study only collects one microbiome sample from each participant child and does not consider the influence of genetic variation between subjects or cohorts. A prospective and longitudinal study, described in Future Work, will ameliorate these limitations and significantly contribute to our understanding of the gut-brain interactions by accounting for the host genotype, gut microbiome, phenome, and metabolome of more than 200 age-matched sibling pairs with and without ASD.

Future Work

This study has provided potential microbial biomarkers, both taxonomic and functional that associate the stool-associated microbiome with the ASD phenome. These findings may be due in part to the fact that we were able build a larger sample than previous studies on the microbiome and ASD, however, it will be necessary to continue sampling this modality in a larger and even more diverse cohort. Our study to confirms that crowdsourcing is a viable and cost effective way to do so. Thus our future work will take a similar angle but on an expanded population and will additionally move from single to multi-time-point sampling of the subjects to control for unrelated environmental influences on the gut microbiome. In addition, future work should combine the microbiome and phenotype, with the genome to enable more precise stratification of the relationship between microbiota and the Autism Spectrum and improve our understanding of the host-microbiome interaction as well as facilitate the discovery of more clinically useful autism biomarkers. This study phase has paved the way towards creating a more robust study where we will incorporate the three aforementioned modalities. We will also aim to validate whether specific microbial taxa and metabolisms are causally associated with ASD through improved characterization of the microbiome (e.g. metagenomics, metabolomics, etc.) in human longitudinal studies, animal studies to demonstrate causation, and human interventional studies that target the microbiome

Conclusion

The aim of this study was to explore the association between the composition of the gut microbiome and the ASD phenotype in order to predict mechanism of association and to identify taxonomic and functional biomarkers and targets for future therapeutic research. Using a novel crowd-sourcing approach, we recruited 71 ASD/NT young sibling pairs, thereby limiting the confounding factors of age, lifestyle, diet, and genetics. This improves our confidence in the observed differences in gut bacterial taxa associated with the ASD phenotype. We observed systematic differences in the abundance of specific microbes between the ASD and NT cohorts, including a depletion of five ESVs from the Lachnospiraceae family and two ESVs associated with the genera Desulfovibrio and Bifidobacterium in the ASD cohort, and a difference in membership of clostridial organisms between the cohorts. Taxonomic assignments for short 16S rRNA fragments cannot be used to predict the full genetic functional potential of a microbiome, but using conservative methods we predicted the general functional potential of these microbiota and observed possible differences in the pathways associated with butyrate synthesis, potential harmful by-product and asssociated neurotransmitter production, that warrants further examination. The observed predicted differences in stool-associated microbial metabolic potential between ASD and NT siblings are worthy of future investigation into causality, and could represent opportunities for therapeutic intervention.

Competing interests

JAG is the cofounder and chief scientific advisor for Gusto Global LCC, in which he owns equity. DPW is cofounder of Cognoa, a company focused on digital methods for healthy child development. MMD is the co-founder of ENOVEO a company specialized in environmental microbiology.

Acknowledgements

The work was supported in part by funds to DPW from NIH (1R01EB025025-01 & 1R21HD091500-01), The Hartwell Foundation, Bill and Melinda Gates Foundation, Coulter Foundation, Lucile Packard Foundation, and program grants from Stanford’s Precision Health and Integrated Diagnostics Center (PHIND), Beckman Center, Bio-X Center, Predictives and Diagnostics Accelerator (SPADA) Spectrum, and Child Health Research Institute. We also acknowledge generous support from David Orr, Imma Calvo, Bobby Dekesyer and Peter Sullivan.

References

↵
Adams JB, Johansen LJ, Powell LD, Quig D, Rubin RA. Gastrointestinal flora and gastrointestinal status in children with autism--comparisons to typical children and correlation with autism severity. BMC Gastroenterol. 2011;11:22. PMCID: PMC3072352
OpenUrl CrossRef PubMed
Adams JB, Romdalvik J, Levine KE. Mercury in first-cut baby hair of children with autism versus typically-developing children. Toxicological & …. 2008.
↵
Anand S, Kaur H, Mande SS. Comparative In silico Analysis of Butyrate Production Pathways in Gut Commensals and Pathogens. Front Microbiol. Frontiers; 2016;7(e0142038):1945. PMCID: PMC5133246
OpenUrl
↵
Aßhauer K, Bernd W, Rolf D, Meinicke P. OP-CBIO150298 2882..2884. Hofacker I, editor. Bioinformatics [Internet]. 2015 Aug 19;31(17):2882–4. Retrieved from: https://www-ncbi-nlm-nih-gov.stanford.idm.oclc.org/pmc/articles/PMC4547618/pdf/btv287.pdf
↵
Baio J, Wiggins L, Christensen DL, Maenner MJ, Daniels J, Warren Z, et al. Prevalence of Autism Spectrum Disorder Among Children Aged 8 Years - Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2014. MMWR Surveill Summ. 2018 Apr 27;67(6):1–23. PMCID: PMC5919599
OpenUrl CrossRef PubMed
↵
Behrend TS, Sharek DJ, Meade AW, Wiebe EN. The viability of crowdsourcing for survey research. Behav Res Methods. 2011 Sep;43(3):800–13.
OpenUrl CrossRef PubMed Web of Science
↵
Benjamini, Hochberg Y. Controlling the False Discovery Rate: A practical and powerfu approach to Multiple Testing. Journal of the royal Statistical Society. 1995 May 17;:289–300.
↵
Bercik P, Park AJ, Sinclair D, Khoshdel A, Lu J, Huang X, et al. The anxiolytic effect of Bifidobacterium longum NCC3001 involves vagal pathways for gut-brain communication. Neurogastroenterol. Motil. 2011 Dec;23(12):1132–9. PMCID: PMC3413724
OpenUrl CrossRef PubMed
↵
Boukthir S, Matoussi N, Belhadj A, Mammou S, Dlala SB, Helayem M, et al. [Abnormal intestinal permeability in children with autism]. Tunis Med. 2010 Sep;88(9):685–6.
OpenUrl PubMed
↵
Bravo JA, Forsythe P, Chew MV, Escaravage E, Savignac HM, Dinan TG, et al. Ingestion of Lactobacillus strain regulates emotional behavior and central GABA receptor expression in a mouse via the vagus nerve. Proc. Natl. Acad. Sci. U.S.A. 2011 Sep 20;108(38):16050–5. PMCID: PMC3179073
OpenUrl Abstract/FREE Full Text
Bruce-Keller AJ, Salbaum JM, Luo M, Blanchard E, Taylor CM, Welsh DA, et al. Obese-type gut microbiota induce neurobehavioral changes in the absence of obesity. Biol. Psychiatry. 2015 Apr 1;77(7):607–15. PMCID: PMC4297748
OpenUrl CrossRef PubMed
↵
Cadotte MW, Jonathan Davies T, Regetz J, Kembel SW, Cleland E, Oakley TH. Phylogenetic diversity metrics for ecological communities: integrating species richness, abundance and evolutionary history. Ecol. Lett. Blackwell Publishing Ltd; 2010 Jan;13(1):96–105.
OpenUrl
↵
Callahan BJ, McMurdie PJ, Holmes SP. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. ISME J. Nature Publishing Group; 2017 Dec;11(12):2639–43. PMCID: PMC5702726
OpenUrl
↵
Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods. Nature Publishing Group; 2016 May 23;13(7):581–3. PMCID: PMC4927377
OpenUrl
↵
Caporaso JG, Caporaso JG, Lauber CL, Lauber CL, Walters WA, Walters WA, et al. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc. Natl. Acad. Sci. U.S.A. [Internet]. 2011 Mar 15;108(Supplement_1):4516–22. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=20534432&retmode=ref&cmd=prlinks
OpenUrl
↵
Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Huntley J, Fierer N, et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 2012 Aug;6(8): 1621–4. PMCID: PMC3400413
OpenUrl CrossRef PubMed Web of Science
↵
Comber AJ, Brunsdon C, See LM, Fritz S, McCallum I. Comparing Expert and Non-expert Conceptualisations of the Land - An Analysis of Crowdsourced Land Cover Data. COSIT. Cham: Springer International Publishing; 2013;8116(Chapter 14):243–60.
OpenUrl
↵
David MM, Babineau BA, Wall DP. Can we accelerate autism discoveries through crowdsourcing? Research in Autism Spectrum Disorders. 2016 Dec;32:80–3.
OpenUrl
↵
De Angelis M, Piccolo M, Vannini L, Siragusa S, De Giacomo A, Serrazzanetti DI, et al. Fecal microbiota and metabolome of children with autism and pervasive developmental disorder not otherwise specified. Heimesaat MM, editor. PLoS ONE. Public Library of Science; 2013;8(10):e76993. PMCID: PMC3793965
OpenUrl
↵
de Magistris L, Familiari V, Pascotto A, Sapone A, Frolli A, Iardino P, et al. Alterations of the intestinal barrier in patients with autism spectrum disorders and in their first-degree relatives. J. Pediatr. Gastroenterol. Nutr. 2010 Oct;51(4):418–24.
OpenUrl CrossRef PubMed
DeAngelis KM, D’haeseleer P, Chivian D, Simmons B, Arkin AP, Mavromatis K, et al. Metagenomes of tropical soil-derived anaerobic switchgrass-adapted consortia with and without iron. Standards in genomic sciences [Internet]. 2013;7(3):382–98. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=24019987&retmode=ref&cmd=prlinks
OpenUrl
↵
DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, et al. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 2006 Jul;72(7):5069–72. PMCID: PMC1489311
OpenUrl Abstract/FREE Full Text
↵
Desbonnet L, Garrett L, Clarke G, Bienenstock J, Dinan TG. The probiotic Bifidobacteria infantis: An assessment of potential antidepressant properties in the rat. J Psychiatr Res. 2008 Dec;43(2):164–74.
OpenUrl CrossRef PubMed Web of Science
Di Gioia D, Aloisio I, Mazzola G, Biavati B. Bifidobacteria: their impact on gut microbiota composition and their applications as probiotics in infants. Appl Microbiol Biotechnol. Springer Berlin Heidelberg; 2014 Jan;98(2):563–77.
OpenUrl
↵
Dixon P. VEGAN, a package of R functions for community ecology. Journal of Vegetation Science. 2003 Dec;14(6):927–30.
OpenUrl CrossRef Web of Science
Downs R, Perna J, Vitelli A, Cook D, Dhurjati P. Model-based hypothesis of gut microbe populations and gut/brain barrier permeabilities in the development of regressive autism. Med. Hypotheses. 2014 Dec;83(6):649–55.
OpenUrl
↵
Duda M, Daniels J, Wall DP. Clinical Evaluation of a Novel and Mobile Autism Risk Assessment. J Autism Dev Disord. 2016 Jun;46(6):1953–61. PMCID: PMC4860199
OpenUrl
↵
Duda M, Kosmicki JA, Wall DP. Testing the accuracy of an observation-based classifier for rapid detection of autism risk. Transl Psychiatry. 2014;4:e424. PMCID: PMC4150240
OpenUrl
↵
Euzéby J. List of new names and new combinations previously effectively, but not validly, published. Int. J. Syst. Evol. Microbiol. Microbiology Society; 2010 May;60(Pt 5):1009–10.
OpenUrl
Ezaki T, Kawamura Y, Li N, Shu S, Li ZY, Zhao L. Proposal of the genera Anaerococcus gen. nov., Peptoniphilus gen. nov. and Gallicola gen. nov. for members of the genus Peptostreptococcus. Int. J. Syst. Evol. Microbiol. [Internet]. 2001 Jul 1;51(4):1521–8. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=11491354&retmode=ref&cmd=prlinks
OpenUrl
↵
Faith DP, Baker AM. Phylogenetic diversity (PD) and biodiversity conservation: some bioinformatics challenges. Evol. Bioinform. Online. Libertas Academica Ltd; 2006;2:121–8. PMCID: PMC2674678
OpenUrl
↵
Finegold SM. State of the art; microbiology in health and disease. Intestinal bacterial flora in autism. Anaerobe. 2011 Dec;17(6):367–8.
OpenUrl
↵
Finegold SM, Dowd SE, Gontcharova V, Liu C, Henley KE, Wolcott RD, et al. Pyrosequencing study of fecal microflora of autistic and control children. Anaerobe. 2010 Aug;16(4):444–53.
OpenUrl CrossRef PubMed Web of Science
↵
Finegold SM, Molitoris D, Song Y, Liu C, Vaisanen M-L, Bolte E, et al. Gastrointestinal microflora studies in late-onset autism. Clin. Infect. Dis. 2002 Sep 1;35(Suppl 1):S6–S16.
OpenUrl CrossRef PubMed Web of Science
Frank DN, St Amand AL, Feldman RA, Boedeker EC, Harpaz N, Pace NR. Molecular-phylogenetic characterization of microbial community imbalances in human inflammatory bowel diseases. Proc. Natl. Acad. Sci. U.S.A. 2007 Aug 21;104(34):13780–5. PMCID: PMC1959459
OpenUrl Abstract/FREE Full Text
Friedman SD, Shaw DW, Artru AA, Richards TL, Gardner J, Dawson G, et al. Regional brain chemical alterations in young children with autism spectrum disorder. Neurology. 2003;60(1):100–7.
OpenUrl CrossRef PubMed
↵
Frye RE, Rose S, Slattery J, Macfabe DF. Gastrointestinal dysfunction in autism spectrum disorder: the role of the mitochondria and the enteric microbiome. Microb. Ecol. Health Dis. Taylor & Francis; 2015;26(0):27458. PMCID: PMC4425813
OpenUrl
Fukuda S, Toh H, Hase K, Oshima K, Nakanishi Y, Yoshimura K, et al. Bifidobacteria can protect from enteropathogenic infection through production of acetate. Nature. Nature Publishing Group; 2011 Jan 27;469(7331):543–7.
OpenUrl
↵
Fusaro VA, Daniels J, Duda M, Deluca TF, D’Angelo O, Tamburello J, et al. The potential of accelerating early detection of autism through content analysis of YouTube videos. PLoS ONE. 2014;9(4):e93533. PMCID: PMC3989176
OpenUrl
↵
Gardener H, Spiegelman D, Buka SL. Prenatal risk factors for autism: comprehensive meta-analysis. Br J Psychiatry. 2009 Jul;195(1):7–14. PMCID: PMC3712619
OpenUrl Abstract/FREE Full Text
Gilbert JA, Krajmalnik-Brown R, Porazinska DL, Weiss SJ, Knight R. Toward effective probiotics for autism and other neurodevelopmental disorders. Cell. 2013 Dec 19;155(7):1446–8. PMCID: PMC4166551
OpenUrl CrossRef PubMed
Golubeva AV, Joyce SA, Moloney G, Burokas A, Sherwin E, Arboleya S, et al. Microbiota-related Changes in Bile Acid & Tryptophan Metabolism are Associated with Gastrointestinal Dysfunction in a Mouse Model of Autism. EBioMedicine. 2017 Oct;24:166–78. PMCID: PMC5652137
OpenUrl
↵
Gondalia SV, Palombo EA, Knowles SR, Cox SB, Meyer D, Austin DW. Molecular Characterisation of Gastrointestinal Microbiota of Children With Autism (With and Without Gastrointestinal Dysfunction) and Their Neurotypical Siblings. Autism Res. 2012 Dec 1;5(6):419–27.
OpenUrl CrossRef PubMed Web of Science
Haas KN, Blanchard JL. Kineothrix alysoides, gen. nov., sp. nov., a saccharolytic butyrate-producer within the family Lachnospiraceae. Int. J. Syst. Evol. Microbiol. Microbiology Society; 2017 Feb;67(2):402–10.
OpenUrl
↵
Hall D, Huerta MF, McAuliffe MJ, Farber GK. Sharing heterogeneous data: the national database for autism research. Neuroinformatics. 2012 Oct;10(4):331–9. PMCID: PMC4219200
OpenUrl CrossRef PubMed
↵
Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)-A metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009 Apr;42(2):377–81. PMCID: PMC2700030
OpenUrl CrossRef PubMed Web of Science
↵
Hong H, Gilbert E, Abowd GD, Arriaga RI. In-group Questions and Out-group Answers - Crowdsourcing Daily Living Advice for Individuals with Autism. CHI. 2015.
↵
Hsiao EY, McBride SW, Hsien S, Sharon G, Hyde ER, McCue T, et al. Microbiota modulate behavioral and physiological abnormalities associated with neurodevelopmental disorders. Cell. 2013 Dec 19;155(7):1451–63. PMCID: PMC3897394
OpenUrl CrossRef PubMed Web of Science
↵
Iwai S, Weinmaier T, Schmidt BL, Albertson DG, Poloso NJ, Dabbagh K, et al. Piphillin: Improved Prediction of Metagenomic Content by Direct Inference from Human Microbiomes. PLoS ONE. 2016;11(11):e0166104. PMCID: PMC5098786
OpenUrl CrossRef
↵
Jeong J-W, Morris MR, Teevan J, Liebling DJ. A Crowd-Powered Socially Embedded Search Engine. ICWSM. 2013.
Jiang Y, Qi H, Zhang XM. Co-biodegradation of anthracene and naphthalene by the bacterium Acinetobacter johnsonii. J Environ Sci Health A Tox Hazard Subst Environ Eng. 2018 Jan 4;7(386):1–9.
OpenUrl
Kaakoush NO. Insights into the Role of Erysipelotrichaceae in the Human Host. Front Cell Infect Microbiol. 2015;5.
↵
Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Research. 2017 Jan 4;45(D1):D353–61. PMCID: PMC5210567
OpenUrl CrossRef PubMed
↵
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Research [Internet]. 2000 Jan 1;28(1):27–30. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=10592173&retmode=ref&cmd=prlinks. PMCID: PMC102409
OpenUrl
↵
Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Research. 2016 Jan 4;44(D1):D457–62. PMCID: PMC4702792
OpenUrl CrossRef PubMed
↵
Kang D-W, Adams JB, Gregory AC, Borody T, Chittick L, Fasano A, et al. Microbiota Transfer Therapy alters gut ecosystem and improves gastrointestinal and autism symptoms: an open-label study. Microbiome. BioMed Central; 2017a;5(1). PMCID: PMC5264285
↵
Kang D-W, Ilhan ZE, Isern NG, Hoyt DW, Howsmon DP, Shaffer M, et al. Differences in fecal microbial metabolites and microbiota of children with autism spectrum disorders. Anaerobe. 2017b Dec 22;49:121–31.
OpenUrl
↵
Kang D-W, Park JG, Ilhan ZE, Wallstrom G, LaBaer J, Adams JB, et al. Reduced Incidence of Prevotella and Other Fermenters in Intestinal Microflora of Autistic Children. PLoS ONE. 2013;8(7). PMCID: PMC3700858
↵
Kosmicki JA, Sochat V, Duda M, Wall DP. Searching for a minimal set of behaviors for autism detection through feature selection-based machine learning. Transl Psychiatry. 2015;5:e514. PMCID: PMC4445756
OpenUrl
↵
Krippendorff K. Reliability in Content Analysis: Some Common Misconceptions and Recommendations. Human Communication Research. 2004 Jul 1;30(3):411–33.
OpenUrl CrossRef Web of Science
Krogius-Kurikka L, Kassinen A, Paulin L, Corander J, Mäkivuokko H, Tuimala J, et al. Sequence analysis of percent G+C fraction libraries of human faecal bacterial DNA reveals a high number of Actinobacteria. BMC Microbiol. BioMed Central; 2009 Apr 8;9(1):68. PMCID: PMC2679024
OpenUrl
↵
La Rosa PS, Brooks JP, Deych E, Boone EL, Edwards DJ, Wang Q, et al. Hypothesis testing and power calculations for taxonomic-based human microbiome data. PLoS ONE. Public Library of Science; 2012;7(12):e52078. PMCID: PMC3527355
OpenUrl
↵
Lakshmi Priya MD, Geetha A. Level of trace elements (copper, zinc, magnesium and selenium) and toxic elements (lead and mercury) in the hair and nail of children with autism. Biol Trace Elem Res. 2011 Aug;142(2):148–58.
OpenUrl CrossRef PubMed Web of Science
↵
Langille MGI, Zaneveld J, Caporaso JG, McDonald D, Knights D, Reyes JA, et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat. Biotechnol. 2013 Sep;31(9):814–21. PMCID: PMC3819121
OpenUrl CrossRef PubMed
↵
Lau NM, Green PHR, Taylor AK, Hellberg D, Ajamian M, Tan CZ, et al. Markers of Celiac Disease and Gluten Sensitivity in Children with Autism. PLoS ONE. 2013;8(6):e66155. PMCID: PMC3688832
OpenUrl CrossRef PubMed
↵
Levy S, Duda M, Haber N, Wall DP. Sparsifying machine learning models identify stable subsets of predictive features for behavioral detection of autism. Mol Autism. BioMed Central; 2017;8(1):65. PMCID: PMC5735531
OpenUrl
↵
Liu H, Wang J, He T, Becker S, Zhang G, Li D, et al. Butyrate: A Double-Edged Sword for Health? Adv Nutr. 2018 Jan 1;9(1):21–9.
OpenUrl
↵
Lowe HJ, Ferris TA, Hernandez PM, Weber SC. STRIDE--An integrated standards-based translational research informatics platform. AMIA Annu Symp Proc. American Medical Informatics Association; 2009 Nov 14;2009:391–5. PMCID: PMC2815452
OpenUrl
↵
Luo W, Friedman MS, Shedden K, Hankenson KD, Woolf PJ. GAGE: generally applicable gene set enrichment for pathway analysis. BMC Bioinformatics. 2009; 10(1):161–17.
OpenUrl CrossRef PubMed
↵
Macfabe D. Autism: metabolism, mitochondria, and the microbiome. Glob Adv Health Med. 2013 Nov;2(6):52–66. PMCID: PMC3865378
OpenUrl CrossRef PubMed
↵
Macfabe DF, Cain DP, Rodriguez-Capote K, Franklin AE, Hoffman JE, Boon F, et al. Neurobiological effects of intraventricular propionic acid in rats: possible role of short chain fatty acids on the pathogenesis and characteristics of autism spectrum disorders. Behav. Brain Res. 2007 Jan 10;176(1):149–69.
OpenUrl CrossRef PubMed Web of Science
Magnusson KR, Hauck L, Jeffrey BM, Elias V, Humphrey A, Nath R, et al. Relationships between diet-related changes in the gut microbiome and cognitive flexibility. Neuroscience. 2015 Aug 6;300:128–40.
OpenUrl CrossRef PubMed
Malinen E, Krogius-Kurikka L, Lyra A, Nikkilä J, Jääskeläinen A, Rinttilä T, et al. Association of symptoms with gastrointestinal microbiota in irritable bowel syndrome. World J. Gastroenterol. Baishideng Publishing Group Inc; 2010 Sep 28;16(36):4532–40. PMCID: PMC2945484
OpenUrl
↵
McDonald D, Hyde ER, Debelius JW, Morton JT, González A, Ackermann G, et al. American Gut: an Open Platform for Citizen-Science Microbiome Research. bioRxiv. Cold Spring Harbor Laboratory; 2018 Mar 6;: 1–80.
↵
Messaoudi M, Violle N, Bisson J-F, Desor D, Javelot H, Rougeot C. Beneficial psychological effects of a probiotic formulation (Lactobacillus helveticus R0052 and Bifidobacterium longum R0175) in healthy human volunteers. Gut Microbes. 2011 Jul;2(4):256–61.
OpenUrl CrossRef PubMed
↵
Midtvedt T. The gut: a triggering place for autism - possibilities and challenges. Microb. Ecol. Health Dis. 2012;23(0):237. PMCID: PMC3747739
OpenUrl
Miller DC, Fung M, Carbo A. A Furry Friend’s Dirty Mouth: Brain Abscess Due to Aggregatibacter (Haemophilus) aphrophilus. Am. J. Med. 2017 Oct;130(10):e447–8.
OpenUrl
Montaña S, Schramm STJ, Traglia GM, Chiem K, Parmeciano Di Noto G, Almuzara M, et al. The Genetic Analysis of an Acinetobacter johnsonii Clinical Strain Evidenced the Presence of Horizontal Genetic Transfer. Hall R, editor. PLoS ONE. 2016;11(8):e0161528. PMCID: PMC4993456
OpenUrl
↵
Mooney CZ. Monte Carlo Simulation. SAGE; 1997.
↵
Moore SJ, Turnpenny P, Quinn A, Glover S, Lloyd DJ, Montgomery T, et al. A clinical study of 57 children with fetal anticonvulsant syndromes. J. Med. Genet. 2000 Jul;37(7):489–97. PMCID: PMC1734633
OpenUrl Abstract/FREE Full Text
Mudd AT, Berding K, Wang M, Donovan SM, Dilger RN. Serum cortisol mediates the relationship between fecal Ruminococcus and brain N-acetylaspartate in the young pig. Gut Microbes. Taylor & Francis; 2017 Nov 2;8(6):589–600. PMCID: PMC5730385
OpenUrl
↵
Naka S, Yamana A, Nakano K, Okawa R, Fujita K, Kojima A, et al. Distribution of periodontopathic bacterial species in Japanese children with developmental disabilities. BMC Oral Health. BioMed Central; 2009 Sep 23;9(1):24. PMCID: PMC2758840
OpenUrl
Nichols FC, Housley WJ, O’Conor CA, Manning T, Wu S, Clark RB. Unique lipids from a common human bacterium represent a new class of Toll-like receptor 2 ligands capable of enhancing autoimmunity. Am. J. Pathol. 2009 Dec; 175(6):2430–8. PMCID: PMC2789629
OpenUrl CrossRef PubMed
Nørskov-Lauritsen N. Classification, identification, and clinical significance of Haemophilus and Aggregatibacter species with host specificity for humans. Clin. Microbiol. Rev. American Society for Microbiology; 2014 Apr;27(2):214–40. PMCID: PMC3993099
OpenUrl
↵
Parada AE, Needham DM, Fuhrman JA. Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time series and global field samples. Environ. Microbiol. [Internet]. 2015 Oct 14;18(5):1403–14. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=26271760&retmode=ref&cmd=prlinks
OpenUrl
↵
Parner ET, Baron-Cohen S, Lauritsen MB, Jørgensen M, Schieve LA, Yeargin-Allsopp M, et al. Parental age and autism spectrum disorders. Ann Epidemiol. 2012 Mar;22(3):143–50.
OpenUrl CrossRef PubMed
Parracho H, Bingham MO, Gibson GR, McCartney AL. Differences between the gut microflora of children with autistic spectrum disorders and that of healthy children. J. Med. Microbiol. 2005 Oct;54(10):987–91.
OpenUrl CrossRef PubMed Web of Science
↵
Paulson JN, Stine OC, Bravo HC, Pop M. Differential abundance analysis for microbial marker-gene surveys. Nat. Methods. BioMed Central; 2013 Sep 29;10(12):1200–2.
OpenUrl
↵
Saitou N, Nei M. The Neighbor-Joining Method - a New Method for Reconstructing Phylogenetic Trees. Mol. Biol. Evol. 1987 Jul;4(4):406–25.
OpenUrl CrossRef PubMed Web of Science
Sandler RH, Finegold SM, Bolte ER, Buchanan CP, Maxwell AP, Väisänen ML, et al. Short-term benefit from oral vancomycin treatment of regressive-onset autism. J. Child Neurol. 5 ed. 2000 Jul;15(7):429–35.
OpenUrl
↵
Schliep KP. phangorn: phylogenetic analysis in R. Bioinformatics. 2011;27(4):592–3. PMCID: PMC3035803
OpenUrl CrossRef PubMed Web of Science
Shaw W. Increased urinary excretion of a 3-(3-hydroxyphenyl)-3-hydroxypropionic acid (HPHPA), an abnormal phenylalanine metabolite of Clostridia spp. in the gastrointestinal tract, in urine samples from patients with autism and schizophrenia. Nutr Neurosci. 10 ed. Taylor & Francis; 2010 Jun;13(3):135–43.
OpenUrl CrossRef PubMed
↵
Song Y, Liu C, Finegold SM. Real-time PCR quantitation of clostridia in feces of autistic children. Appl. Environ. Microbiol. 2004 Nov;70(11):6459–65. PMCID: PMC525120
OpenUrl Abstract/FREE Full Text
Spooner R, Weigel KM, Harrison PL, Lee K, Cangelosi GA, Yilmaz Ö. In Situ Anabolic Activity of Periodontal Pathogens Porphyromonas gingivalis and Filifactor alocis in Chronic Periodontitis. Sci Rep. Nature Publishing Group; 2016 Sep 19;6(1):33638. PMCID: PMC5027532
OpenUrl
↵
Spor A, Koren O, Ley R. Unravelling the effects of theenvironment and host genotypeon the gut microbiome. Nat. Rev. Microbiol. Nature Publishing Group; 2011 Apr 1;9(4):279–90.
OpenUrl
↵
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U.S.A. 2005 Oct 25;102(43):15545–50. PMCID: PMC1239896
OpenUrl Abstract/FREE Full Text
↵
Swan M. Crowdsourced health research studies: an important emerging complement to clinical trials in the public health research ecosystem. J Med Internet Res [Internet]. 2012 Mar 7;14(2):e46. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=22397809&retmode=ref&cmd=prlinks. PMCID: PMC3376509
OpenUrl
Van Veen HW, Abee T, Kleefsman AW, Melgers B, Kortstee GJ, Konings WN, et al. Energetics of alanine, lysine, and proline transport in cytoplasmic membranes of the polyphosphate-accumulating Acinetobacter johnsonii strain 210A. J. Bacteriol. American Society for Microbiology (ASM); 1994 May;176(9):2670–6. PMCID: PMC205407
OpenUrl
Vital M, Howe AC, Tiedje JM. Revealing the bacterial butyrate synthesis pathways by analyzing (meta)genomic data. MBio. 2014 Apr 22;5(2):e00889–e00889–14. PMCID: PMC3994512
OpenUrl CrossRef PubMed
↵
Vuong HE, Hsiao EY. Emerging Roles for the Gut Microbiome in Autism Spectrum Disorder. Biol. Psychiatry. 2017;81(5):411–23. PMCID: PMC5285286
OpenUrl CrossRef PubMed
↵
Wald A. Tests of Statistical Hypotheses Concerning Several Parameters When the Number of Observations is Large. Transactions of the American Mathematical Society. 1943 Nov;54(3):426–82.
OpenUrl CrossRef Web of Science
↵
Wall DP, Kosmicki J, Deluca TF, Harstad E, Fusaro VA. Use of machine learning to shorten observation-based screening and diagnosis of autism. Transl Psychiatry. 2012;2:e100. PMCID: PMC3337074
OpenUrl
↵
Walters WA, Hyde ER, Berg-Lyons D, Ackermann GL, Humphrey G, Parada A, et al. Improved Bacterial 16S rRNA Gene (V4 and V4-5) and Fungal Internal Transcribed Spacer Marker Gene Primers for Microbial Community Surveys. American Society for Microbiology mSystem. 2015 Dec 16;1(1):1–10.
OpenUrl
↵
Wang L, Christophersen CT, Sorich MJ, Gerber JP, Angley MT, Conlon MA. Low Relative Abundances of the Mucolytic Bacterium Akkermansia muciniphila and Bifidobacterium spp. in Feces of Children with Autism. Appl. Environ. Microbiol. American Society for Microbiology; 2011 Sep;77(18):6718–21. PMCID: PMC3187122
OpenUrl
↵
Wang L, Christophersen CT, Sorich MJ, Gerber JP, Angley MT, Conlon MA. Elevated Fecal Short Chain Fatty Acid and Ammonia Concentrations in Children with Autism Spectrum Disorder. Dig. Dis. Sci. 2012 Aug;57(8):2096–102.
OpenUrl CrossRef PubMed
Wang L, Christophersen CT, Sorich MJ, Gerber JP, Angley MT, Conlon MA. Increased abundance of Sutterella spp. and Ruminococcus torques in feces of children with autism spectrum disorder. Mol Autism. BioMed Central; 2013 Nov 4;4(1):42. PMCID: PMC3828002
OpenUrl
↵
Wang Q, Garrity GM, Tiedje JM, Cole JR. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 2007 Aug;73(16):5261–7. PMCID: PMC1950982
OpenUrl Abstract/FREE Full Text
Wang Y, Kasper LH. The role of microbiome in central nervous system disorders. Brain Behav. Immun. 2014 May;38:1–12.
OpenUrl CrossRef PubMed
↵
Wilcoxon F, Katti SK, Wilcox RA. Critical Values and Probability Levels for the Wilcoxon Rank Sum Test and the Wilcoxon Signed Rank Test. 1963.
↵
Williams BL, Hornig M, Buie T, Bauman ML, Paik MC, Wick I, et al. Impaired Carbohydrate Digestion and Transport and Mucosal Dysbiosis in the Intestines of Children with Autism and Gastrointestinal Disturbances. PLoS ONE. 2011;6(9). PMCID: PMC3174969
↵
Yap IKS, Angley M, Veselkov KA, Holmes E, Lindon JC, Nicholson JK. Urinary Metabolic Phenotyping Differentiates Children with Autism from Their Unaffected Siblings and Age-Matched Controls. J. Proteome Res. American Chemical Society; 2010 Jun;9(6):2996–3004.
OpenUrl
↵
Yip M, Knox WE. Glutamine-Dependent Carbamyl Phosphate Synthetase - Properties and Distribution in Normal and Neoplastic Rat Tissues. J. Biol. Chem. 1970;245(9):2199–&.
OpenUrl Abstract/FREE Full Text
↵
Yoon S-H, Ha S-M, Kwon S, Lim J, Kim Y, Seo H, et al. Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies. Int. J. Syst. Evol. Microbiol. 2017 May;67(5):1613–7. PMCID: PMC5563544
OpenUrl CrossRef
Ze X, Duncan SH, Louis P, Flint HJ. Ruminococcus bromii is a keystone species for the degradation of resistant starch in the human colon. ISME J. Nature Publishing Group; 2012 Aug;6(8):1535–43. PMCID: PMC3400402
OpenUrl

View the discussion thread.

Posted May 10, 2018.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Microbiology

Subject Areas

All Articles

Animal Behavior and Cognition (5214)
Biochemistry (11745)
Bioengineering (8751)
Bioinformatics (29195)
Biophysics (14971)
Cancer Biology (12095)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18306)
Genetics (12245)
Genomics (16801)
Immunology (11867)
Microbiology (28083)
Molecular Biology (11592)
Neuroscience (60965)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7339)
Zoology (1651)

[1] ↵
Adams JB, Johansen LJ, Powell LD, Quig D, Rubin RA. Gastrointestinal flora and gastrointestinal status in children with autism--comparisons to typical children and correlation with autism severity. BMC Gastroenterol. 2011;11:22. PMCID: PMC3072352
OpenUrl CrossRef PubMed

[2] Adams JB, Romdalvik J, Levine KE. Mercury in first-cut baby hair of children with autism versus typically-developing children. Toxicological & …. 2008.

[3] ↵
Anand S, Kaur H, Mande SS. Comparative In silico Analysis of Butyrate Production Pathways in Gut Commensals and Pathogens. Front Microbiol. Frontiers; 2016;7(e0142038):1945. PMCID: PMC5133246
OpenUrl

[4] ↵
Aßhauer K, Bernd W, Rolf D, Meinicke P. OP-CBIO150298 2882..2884. Hofacker I, editor. Bioinformatics [Internet]. 2015 Aug 19;31(17):2882–4. Retrieved from: https://www-ncbi-nlm-nih-gov.stanford.idm.oclc.org/pmc/articles/PMC4547618/pdf/btv287.pdf

[5] ↵
Baio J, Wiggins L, Christensen DL, Maenner MJ, Daniels J, Warren Z, et al. Prevalence of Autism Spectrum Disorder Among Children Aged 8 Years - Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2014. MMWR Surveill Summ. 2018 Apr 27;67(6):1–23. PMCID: PMC5919599
OpenUrl CrossRef PubMed

[6] ↵
Behrend TS, Sharek DJ, Meade AW, Wiebe EN. The viability of crowdsourcing for survey research. Behav Res Methods. 2011 Sep;43(3):800–13.
OpenUrl CrossRef PubMed Web of Science

[7] ↵
Benjamini, Hochberg Y. Controlling the False Discovery Rate: A practical and powerfu approach to Multiple Testing. Journal of the royal Statistical Society. 1995 May 17;:289–300.

[8] ↵
Bercik P, Park AJ, Sinclair D, Khoshdel A, Lu J, Huang X, et al. The anxiolytic effect of Bifidobacterium longum NCC3001 involves vagal pathways for gut-brain communication. Neurogastroenterol. Motil. 2011 Dec;23(12):1132–9. PMCID: PMC3413724
OpenUrl CrossRef PubMed

[9] ↵
Boukthir S, Matoussi N, Belhadj A, Mammou S, Dlala SB, Helayem M, et al. [Abnormal intestinal permeability in children with autism]. Tunis Med. 2010 Sep;88(9):685–6.
OpenUrl PubMed

[10] ↵
Bravo JA, Forsythe P, Chew MV, Escaravage E, Savignac HM, Dinan TG, et al. Ingestion of Lactobacillus strain regulates emotional behavior and central GABA receptor expression in a mouse via the vagus nerve. Proc. Natl. Acad. Sci. U.S.A. 2011 Sep 20;108(38):16050–5. PMCID: PMC3179073
OpenUrl Abstract/FREE Full Text

[11] Bruce-Keller AJ, Salbaum JM, Luo M, Blanchard E, Taylor CM, Welsh DA, et al. Obese-type gut microbiota induce neurobehavioral changes in the absence of obesity. Biol. Psychiatry. 2015 Apr 1;77(7):607–15. PMCID: PMC4297748
OpenUrl CrossRef PubMed

[12] ↵
Cadotte MW, Jonathan Davies T, Regetz J, Kembel SW, Cleland E, Oakley TH. Phylogenetic diversity metrics for ecological communities: integrating species richness, abundance and evolutionary history. Ecol. Lett. Blackwell Publishing Ltd; 2010 Jan;13(1):96–105.
OpenUrl

[13] ↵
Callahan BJ, McMurdie PJ, Holmes SP. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. ISME J. Nature Publishing Group; 2017 Dec;11(12):2639–43. PMCID: PMC5702726
OpenUrl

[14] ↵
Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods. Nature Publishing Group; 2016 May 23;13(7):581–3. PMCID: PMC4927377
OpenUrl

[15] ↵
Caporaso JG, Caporaso JG, Lauber CL, Lauber CL, Walters WA, Walters WA, et al. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc. Natl. Acad. Sci. U.S.A. [Internet]. 2011 Mar 15;108(Supplement_1):4516–22. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=20534432&retmode=ref&cmd=prlinks
OpenUrl

[16] ↵
Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Huntley J, Fierer N, et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 2012 Aug;6(8): 1621–4. PMCID: PMC3400413
OpenUrl CrossRef PubMed Web of Science

[17] ↵
Comber AJ, Brunsdon C, See LM, Fritz S, McCallum I. Comparing Expert and Non-expert Conceptualisations of the Land - An Analysis of Crowdsourced Land Cover Data. COSIT. Cham: Springer International Publishing; 2013;8116(Chapter 14):243–60.
OpenUrl

[18] ↵
David MM, Babineau BA, Wall DP. Can we accelerate autism discoveries through crowdsourcing? Research in Autism Spectrum Disorders. 2016 Dec;32:80–3.
OpenUrl

[19] ↵
De Angelis M, Piccolo M, Vannini L, Siragusa S, De Giacomo A, Serrazzanetti DI, et al. Fecal microbiota and metabolome of children with autism and pervasive developmental disorder not otherwise specified. Heimesaat MM, editor. PLoS ONE. Public Library of Science; 2013;8(10):e76993. PMCID: PMC3793965
OpenUrl

[20] ↵
de Magistris L, Familiari V, Pascotto A, Sapone A, Frolli A, Iardino P, et al. Alterations of the intestinal barrier in patients with autism spectrum disorders and in their first-degree relatives. J. Pediatr. Gastroenterol. Nutr. 2010 Oct;51(4):418–24.
OpenUrl CrossRef PubMed

[21] DeAngelis KM, D’haeseleer P, Chivian D, Simmons B, Arkin AP, Mavromatis K, et al. Metagenomes of tropical soil-derived anaerobic switchgrass-adapted consortia with and without iron. Standards in genomic sciences [Internet]. 2013;7(3):382–98. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=24019987&retmode=ref&cmd=prlinks
OpenUrl

[22] ↵
DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, et al. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 2006 Jul;72(7):5069–72. PMCID: PMC1489311
OpenUrl Abstract/FREE Full Text

[23] ↵
Desbonnet L, Garrett L, Clarke G, Bienenstock J, Dinan TG. The probiotic Bifidobacteria infantis: An assessment of potential antidepressant properties in the rat. J Psychiatr Res. 2008 Dec;43(2):164–74.
OpenUrl CrossRef PubMed Web of Science

[24] Di Gioia D, Aloisio I, Mazzola G, Biavati B. Bifidobacteria: their impact on gut microbiota composition and their applications as probiotics in infants. Appl Microbiol Biotechnol. Springer Berlin Heidelberg; 2014 Jan;98(2):563–77.
OpenUrl

[25] ↵
Dixon P. VEGAN, a package of R functions for community ecology. Journal of Vegetation Science. 2003 Dec;14(6):927–30.
OpenUrl CrossRef Web of Science

[26] Downs R, Perna J, Vitelli A, Cook D, Dhurjati P. Model-based hypothesis of gut microbe populations and gut/brain barrier permeabilities in the development of regressive autism. Med. Hypotheses. 2014 Dec;83(6):649–55.
OpenUrl

[27] ↵
Duda M, Daniels J, Wall DP. Clinical Evaluation of a Novel and Mobile Autism Risk Assessment. J Autism Dev Disord. 2016 Jun;46(6):1953–61. PMCID: PMC4860199
OpenUrl

[28] ↵
Duda M, Kosmicki JA, Wall DP. Testing the accuracy of an observation-based classifier for rapid detection of autism risk. Transl Psychiatry. 2014;4:e424. PMCID: PMC4150240
OpenUrl

[29] ↵
Euzéby J. List of new names and new combinations previously effectively, but not validly, published. Int. J. Syst. Evol. Microbiol. Microbiology Society; 2010 May;60(Pt 5):1009–10.
OpenUrl

[30] Ezaki T, Kawamura Y, Li N, Shu S, Li ZY, Zhao L. Proposal of the genera Anaerococcus gen. nov., Peptoniphilus gen. nov. and Gallicola gen. nov. for members of the genus Peptostreptococcus. Int. J. Syst. Evol. Microbiol. [Internet]. 2001 Jul 1;51(4):1521–8. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=11491354&retmode=ref&cmd=prlinks
OpenUrl

[31] ↵
Faith DP, Baker AM. Phylogenetic diversity (PD) and biodiversity conservation: some bioinformatics challenges. Evol. Bioinform. Online. Libertas Academica Ltd; 2006;2:121–8. PMCID: PMC2674678
OpenUrl

[32] ↵
Finegold SM. State of the art; microbiology in health and disease. Intestinal bacterial flora in autism. Anaerobe. 2011 Dec;17(6):367–8.
OpenUrl

[33] ↵
Finegold SM, Dowd SE, Gontcharova V, Liu C, Henley KE, Wolcott RD, et al. Pyrosequencing study of fecal microflora of autistic and control children. Anaerobe. 2010 Aug;16(4):444–53.
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Finegold SM, Molitoris D, Song Y, Liu C, Vaisanen M-L, Bolte E, et al. Gastrointestinal microflora studies in late-onset autism. Clin. Infect. Dis. 2002 Sep 1;35(Suppl 1):S6–S16.
OpenUrl CrossRef PubMed Web of Science

[35] Frank DN, St Amand AL, Feldman RA, Boedeker EC, Harpaz N, Pace NR. Molecular-phylogenetic characterization of microbial community imbalances in human inflammatory bowel diseases. Proc. Natl. Acad. Sci. U.S.A. 2007 Aug 21;104(34):13780–5. PMCID: PMC1959459
OpenUrl Abstract/FREE Full Text

[36] Friedman SD, Shaw DW, Artru AA, Richards TL, Gardner J, Dawson G, et al. Regional brain chemical alterations in young children with autism spectrum disorder. Neurology. 2003;60(1):100–7.
OpenUrl CrossRef PubMed

[37] ↵
Frye RE, Rose S, Slattery J, Macfabe DF. Gastrointestinal dysfunction in autism spectrum disorder: the role of the mitochondria and the enteric microbiome. Microb. Ecol. Health Dis. Taylor & Francis; 2015;26(0):27458. PMCID: PMC4425813
OpenUrl

[38] Fukuda S, Toh H, Hase K, Oshima K, Nakanishi Y, Yoshimura K, et al. Bifidobacteria can protect from enteropathogenic infection through production of acetate. Nature. Nature Publishing Group; 2011 Jan 27;469(7331):543–7.
OpenUrl

[39] ↵
Fusaro VA, Daniels J, Duda M, Deluca TF, D’Angelo O, Tamburello J, et al. The potential of accelerating early detection of autism through content analysis of YouTube videos. PLoS ONE. 2014;9(4):e93533. PMCID: PMC3989176
OpenUrl

[40] ↵
Gardener H, Spiegelman D, Buka SL. Prenatal risk factors for autism: comprehensive meta-analysis. Br J Psychiatry. 2009 Jul;195(1):7–14. PMCID: PMC3712619
OpenUrl Abstract/FREE Full Text

[41] Gilbert JA, Krajmalnik-Brown R, Porazinska DL, Weiss SJ, Knight R. Toward effective probiotics for autism and other neurodevelopmental disorders. Cell. 2013 Dec 19;155(7):1446–8. PMCID: PMC4166551
OpenUrl CrossRef PubMed

[42] Golubeva AV, Joyce SA, Moloney G, Burokas A, Sherwin E, Arboleya S, et al. Microbiota-related Changes in Bile Acid & Tryptophan Metabolism are Associated with Gastrointestinal Dysfunction in a Mouse Model of Autism. EBioMedicine. 2017 Oct;24:166–78. PMCID: PMC5652137
OpenUrl

[43] ↵
Gondalia SV, Palombo EA, Knowles SR, Cox SB, Meyer D, Austin DW. Molecular Characterisation of Gastrointestinal Microbiota of Children With Autism (With and Without Gastrointestinal Dysfunction) and Their Neurotypical Siblings. Autism Res. 2012 Dec 1;5(6):419–27.
OpenUrl CrossRef PubMed Web of Science

[44] Haas KN, Blanchard JL. Kineothrix alysoides, gen. nov., sp. nov., a saccharolytic butyrate-producer within the family Lachnospiraceae. Int. J. Syst. Evol. Microbiol. Microbiology Society; 2017 Feb;67(2):402–10.
OpenUrl

[45] ↵
Hall D, Huerta MF, McAuliffe MJ, Farber GK. Sharing heterogeneous data: the national database for autism research. Neuroinformatics. 2012 Oct;10(4):331–9. PMCID: PMC4219200
OpenUrl CrossRef PubMed

[46] ↵
Harris PA, Taylor R, Thielke R, Payne J, Gonzalez N, Conde JG. Research electronic data capture (REDCap)-A metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009 Apr;42(2):377–81. PMCID: PMC2700030
OpenUrl CrossRef PubMed Web of Science

[47] ↵
Hong H, Gilbert E, Abowd GD, Arriaga RI. In-group Questions and Out-group Answers - Crowdsourcing Daily Living Advice for Individuals with Autism. CHI. 2015.

[48] ↵
Hsiao EY, McBride SW, Hsien S, Sharon G, Hyde ER, McCue T, et al. Microbiota modulate behavioral and physiological abnormalities associated with neurodevelopmental disorders. Cell. 2013 Dec 19;155(7):1451–63. PMCID: PMC3897394
OpenUrl CrossRef PubMed Web of Science

[49] ↵
Iwai S, Weinmaier T, Schmidt BL, Albertson DG, Poloso NJ, Dabbagh K, et al. Piphillin: Improved Prediction of Metagenomic Content by Direct Inference from Human Microbiomes. PLoS ONE. 2016;11(11):e0166104. PMCID: PMC5098786
OpenUrl CrossRef

[50] ↵
Jeong J-W, Morris MR, Teevan J, Liebling DJ. A Crowd-Powered Socially Embedded Search Engine. ICWSM. 2013.

[51] Jiang Y, Qi H, Zhang XM. Co-biodegradation of anthracene and naphthalene by the bacterium Acinetobacter johnsonii. J Environ Sci Health A Tox Hazard Subst Environ Eng. 2018 Jan 4;7(386):1–9.
OpenUrl

[52] Kaakoush NO. Insights into the Role of Erysipelotrichaceae in the Human Host. Front Cell Infect Microbiol. 2015;5.

[53] ↵
Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Research. 2017 Jan 4;45(D1):D353–61. PMCID: PMC5210567
OpenUrl CrossRef PubMed

[54] ↵
Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Research [Internet]. 2000 Jan 1;28(1):27–30. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=10592173&retmode=ref&cmd=prlinks. PMCID: PMC102409
OpenUrl

[55] ↵
Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Research. 2016 Jan 4;44(D1):D457–62. PMCID: PMC4702792
OpenUrl CrossRef PubMed

[56] ↵
Kang D-W, Adams JB, Gregory AC, Borody T, Chittick L, Fasano A, et al. Microbiota Transfer Therapy alters gut ecosystem and improves gastrointestinal and autism symptoms: an open-label study. Microbiome. BioMed Central; 2017a;5(1). PMCID: PMC5264285

[57] ↵
Kang D-W, Ilhan ZE, Isern NG, Hoyt DW, Howsmon DP, Shaffer M, et al. Differences in fecal microbial metabolites and microbiota of children with autism spectrum disorders. Anaerobe. 2017b Dec 22;49:121–31.
OpenUrl

[58] ↵
Kang D-W, Park JG, Ilhan ZE, Wallstrom G, LaBaer J, Adams JB, et al. Reduced Incidence of Prevotella and Other Fermenters in Intestinal Microflora of Autistic Children. PLoS ONE. 2013;8(7). PMCID: PMC3700858

[59] ↵
Kosmicki JA, Sochat V, Duda M, Wall DP. Searching for a minimal set of behaviors for autism detection through feature selection-based machine learning. Transl Psychiatry. 2015;5:e514. PMCID: PMC4445756
OpenUrl

[60] ↵
Krippendorff K. Reliability in Content Analysis: Some Common Misconceptions and Recommendations. Human Communication Research. 2004 Jul 1;30(3):411–33.
OpenUrl CrossRef Web of Science

[61] Krogius-Kurikka L, Kassinen A, Paulin L, Corander J, Mäkivuokko H, Tuimala J, et al. Sequence analysis of percent G+C fraction libraries of human faecal bacterial DNA reveals a high number of Actinobacteria. BMC Microbiol. BioMed Central; 2009 Apr 8;9(1):68. PMCID: PMC2679024
OpenUrl

[62] ↵
La Rosa PS, Brooks JP, Deych E, Boone EL, Edwards DJ, Wang Q, et al. Hypothesis testing and power calculations for taxonomic-based human microbiome data. PLoS ONE. Public Library of Science; 2012;7(12):e52078. PMCID: PMC3527355
OpenUrl

[63] ↵
Lakshmi Priya MD, Geetha A. Level of trace elements (copper, zinc, magnesium and selenium) and toxic elements (lead and mercury) in the hair and nail of children with autism. Biol Trace Elem Res. 2011 Aug;142(2):148–58.
OpenUrl CrossRef PubMed Web of Science

[64] ↵
Langille MGI, Zaneveld J, Caporaso JG, McDonald D, Knights D, Reyes JA, et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat. Biotechnol. 2013 Sep;31(9):814–21. PMCID: PMC3819121
OpenUrl CrossRef PubMed

[65] ↵
Lau NM, Green PHR, Taylor AK, Hellberg D, Ajamian M, Tan CZ, et al. Markers of Celiac Disease and Gluten Sensitivity in Children with Autism. PLoS ONE. 2013;8(6):e66155. PMCID: PMC3688832
OpenUrl CrossRef PubMed

[66] ↵
Levy S, Duda M, Haber N, Wall DP. Sparsifying machine learning models identify stable subsets of predictive features for behavioral detection of autism. Mol Autism. BioMed Central; 2017;8(1):65. PMCID: PMC5735531
OpenUrl

[67] ↵
Liu H, Wang J, He T, Becker S, Zhang G, Li D, et al. Butyrate: A Double-Edged Sword for Health? Adv Nutr. 2018 Jan 1;9(1):21–9.
OpenUrl

[68] ↵
Lowe HJ, Ferris TA, Hernandez PM, Weber SC. STRIDE--An integrated standards-based translational research informatics platform. AMIA Annu Symp Proc. American Medical Informatics Association; 2009 Nov 14;2009:391–5. PMCID: PMC2815452
OpenUrl

[69] ↵
Luo W, Friedman MS, Shedden K, Hankenson KD, Woolf PJ. GAGE: generally applicable gene set enrichment for pathway analysis. BMC Bioinformatics. 2009; 10(1):161–17.
OpenUrl CrossRef PubMed

[70] ↵
Macfabe D. Autism: metabolism, mitochondria, and the microbiome. Glob Adv Health Med. 2013 Nov;2(6):52–66. PMCID: PMC3865378
OpenUrl CrossRef PubMed

[71] ↵
Macfabe DF, Cain DP, Rodriguez-Capote K, Franklin AE, Hoffman JE, Boon F, et al. Neurobiological effects of intraventricular propionic acid in rats: possible role of short chain fatty acids on the pathogenesis and characteristics of autism spectrum disorders. Behav. Brain Res. 2007 Jan 10;176(1):149–69.
OpenUrl CrossRef PubMed Web of Science

[72] Magnusson KR, Hauck L, Jeffrey BM, Elias V, Humphrey A, Nath R, et al. Relationships between diet-related changes in the gut microbiome and cognitive flexibility. Neuroscience. 2015 Aug 6;300:128–40.
OpenUrl CrossRef PubMed

[73] Malinen E, Krogius-Kurikka L, Lyra A, Nikkilä J, Jääskeläinen A, Rinttilä T, et al. Association of symptoms with gastrointestinal microbiota in irritable bowel syndrome. World J. Gastroenterol. Baishideng Publishing Group Inc; 2010 Sep 28;16(36):4532–40. PMCID: PMC2945484
OpenUrl

[74] ↵
McDonald D, Hyde ER, Debelius JW, Morton JT, González A, Ackermann G, et al. American Gut: an Open Platform for Citizen-Science Microbiome Research. bioRxiv. Cold Spring Harbor Laboratory; 2018 Mar 6;: 1–80.

[75] ↵
Messaoudi M, Violle N, Bisson J-F, Desor D, Javelot H, Rougeot C. Beneficial psychological effects of a probiotic formulation (Lactobacillus helveticus R0052 and Bifidobacterium longum R0175) in healthy human volunteers. Gut Microbes. 2011 Jul;2(4):256–61.
OpenUrl CrossRef PubMed

[76] ↵
Midtvedt T. The gut: a triggering place for autism - possibilities and challenges. Microb. Ecol. Health Dis. 2012;23(0):237. PMCID: PMC3747739
OpenUrl

[77] Miller DC, Fung M, Carbo A. A Furry Friend’s Dirty Mouth: Brain Abscess Due to Aggregatibacter (Haemophilus) aphrophilus. Am. J. Med. 2017 Oct;130(10):e447–8.
OpenUrl

[78] Montaña S, Schramm STJ, Traglia GM, Chiem K, Parmeciano Di Noto G, Almuzara M, et al. The Genetic Analysis of an Acinetobacter johnsonii Clinical Strain Evidenced the Presence of Horizontal Genetic Transfer. Hall R, editor. PLoS ONE. 2016;11(8):e0161528. PMCID: PMC4993456
OpenUrl

[79] ↵
Mooney CZ. Monte Carlo Simulation. SAGE; 1997.

[80] ↵
Moore SJ, Turnpenny P, Quinn A, Glover S, Lloyd DJ, Montgomery T, et al. A clinical study of 57 children with fetal anticonvulsant syndromes. J. Med. Genet. 2000 Jul;37(7):489–97. PMCID: PMC1734633
OpenUrl Abstract/FREE Full Text

[81] Mudd AT, Berding K, Wang M, Donovan SM, Dilger RN. Serum cortisol mediates the relationship between fecal Ruminococcus and brain N-acetylaspartate in the young pig. Gut Microbes. Taylor & Francis; 2017 Nov 2;8(6):589–600. PMCID: PMC5730385
OpenUrl

[82] ↵
Naka S, Yamana A, Nakano K, Okawa R, Fujita K, Kojima A, et al. Distribution of periodontopathic bacterial species in Japanese children with developmental disabilities. BMC Oral Health. BioMed Central; 2009 Sep 23;9(1):24. PMCID: PMC2758840
OpenUrl

[83] Nichols FC, Housley WJ, O’Conor CA, Manning T, Wu S, Clark RB. Unique lipids from a common human bacterium represent a new class of Toll-like receptor 2 ligands capable of enhancing autoimmunity. Am. J. Pathol. 2009 Dec; 175(6):2430–8. PMCID: PMC2789629
OpenUrl CrossRef PubMed

[84] Nørskov-Lauritsen N. Classification, identification, and clinical significance of Haemophilus and Aggregatibacter species with host specificity for humans. Clin. Microbiol. Rev. American Society for Microbiology; 2014 Apr;27(2):214–40. PMCID: PMC3993099
OpenUrl

[85] ↵
Parada AE, Needham DM, Fuhrman JA. Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time series and global field samples. Environ. Microbiol. [Internet]. 2015 Oct 14;18(5):1403–14. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=26271760&retmode=ref&cmd=prlinks
OpenUrl

[86] ↵
Parner ET, Baron-Cohen S, Lauritsen MB, Jørgensen M, Schieve LA, Yeargin-Allsopp M, et al. Parental age and autism spectrum disorders. Ann Epidemiol. 2012 Mar;22(3):143–50.
OpenUrl CrossRef PubMed

[87] Parracho H, Bingham MO, Gibson GR, McCartney AL. Differences between the gut microflora of children with autistic spectrum disorders and that of healthy children. J. Med. Microbiol. 2005 Oct;54(10):987–91.
OpenUrl CrossRef PubMed Web of Science

[88] ↵
Paulson JN, Stine OC, Bravo HC, Pop M. Differential abundance analysis for microbial marker-gene surveys. Nat. Methods. BioMed Central; 2013 Sep 29;10(12):1200–2.
OpenUrl

[89] ↵
Saitou N, Nei M. The Neighbor-Joining Method - a New Method for Reconstructing Phylogenetic Trees. Mol. Biol. Evol. 1987 Jul;4(4):406–25.
OpenUrl CrossRef PubMed Web of Science

[90] Sandler RH, Finegold SM, Bolte ER, Buchanan CP, Maxwell AP, Väisänen ML, et al. Short-term benefit from oral vancomycin treatment of regressive-onset autism. J. Child Neurol. 5 ed. 2000 Jul;15(7):429–35.
OpenUrl

[91] ↵
Schliep KP. phangorn: phylogenetic analysis in R. Bioinformatics. 2011;27(4):592–3. PMCID: PMC3035803
OpenUrl CrossRef PubMed Web of Science

[92] Shaw W. Increased urinary excretion of a 3-(3-hydroxyphenyl)-3-hydroxypropionic acid (HPHPA), an abnormal phenylalanine metabolite of Clostridia spp. in the gastrointestinal tract, in urine samples from patients with autism and schizophrenia. Nutr Neurosci. 10 ed. Taylor & Francis; 2010 Jun;13(3):135–43.
OpenUrl CrossRef PubMed

[93] ↵
Song Y, Liu C, Finegold SM. Real-time PCR quantitation of clostridia in feces of autistic children. Appl. Environ. Microbiol. 2004 Nov;70(11):6459–65. PMCID: PMC525120
OpenUrl Abstract/FREE Full Text

[94] Spooner R, Weigel KM, Harrison PL, Lee K, Cangelosi GA, Yilmaz Ö. In Situ Anabolic Activity of Periodontal Pathogens Porphyromonas gingivalis and Filifactor alocis in Chronic Periodontitis. Sci Rep. Nature Publishing Group; 2016 Sep 19;6(1):33638. PMCID: PMC5027532
OpenUrl

[95] ↵
Spor A, Koren O, Ley R. Unravelling the effects of theenvironment and host genotypeon the gut microbiome. Nat. Rev. Microbiol. Nature Publishing Group; 2011 Apr 1;9(4):279–90.
OpenUrl

[96] ↵
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U.S.A. 2005 Oct 25;102(43):15545–50. PMCID: PMC1239896
OpenUrl Abstract/FREE Full Text

[97] ↵
Swan M. Crowdsourced health research studies: an important emerging complement to clinical trials in the public health research ecosystem. J Med Internet Res [Internet]. 2012 Mar 7;14(2):e46. Retrieved from: http://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=22397809&retmode=ref&cmd=prlinks. PMCID: PMC3376509
OpenUrl

[98] Van Veen HW, Abee T, Kleefsman AW, Melgers B, Kortstee GJ, Konings WN, et al. Energetics of alanine, lysine, and proline transport in cytoplasmic membranes of the polyphosphate-accumulating Acinetobacter johnsonii strain 210A. J. Bacteriol. American Society for Microbiology (ASM); 1994 May;176(9):2670–6. PMCID: PMC205407
OpenUrl

[99] Vital M, Howe AC, Tiedje JM. Revealing the bacterial butyrate synthesis pathways by analyzing (meta)genomic data. MBio. 2014 Apr 22;5(2):e00889–e00889–14. PMCID: PMC3994512
OpenUrl CrossRef PubMed

[100] ↵
Vuong HE, Hsiao EY. Emerging Roles for the Gut Microbiome in Autism Spectrum Disorder. Biol. Psychiatry. 2017;81(5):411–23. PMCID: PMC5285286
OpenUrl CrossRef PubMed

[101] ↵
Wald A. Tests of Statistical Hypotheses Concerning Several Parameters When the Number of Observations is Large. Transactions of the American Mathematical Society. 1943 Nov;54(3):426–82.
OpenUrl CrossRef Web of Science

[102] ↵
Wall DP, Kosmicki J, Deluca TF, Harstad E, Fusaro VA. Use of machine learning to shorten observation-based screening and diagnosis of autism. Transl Psychiatry. 2012;2:e100. PMCID: PMC3337074
OpenUrl

[103] ↵
Walters WA, Hyde ER, Berg-Lyons D, Ackermann GL, Humphrey G, Parada A, et al. Improved Bacterial 16S rRNA Gene (V4 and V4-5) and Fungal Internal Transcribed Spacer Marker Gene Primers for Microbial Community Surveys. American Society for Microbiology mSystem. 2015 Dec 16;1(1):1–10.
OpenUrl

[104] ↵
Wang L, Christophersen CT, Sorich MJ, Gerber JP, Angley MT, Conlon MA. Low Relative Abundances of the Mucolytic Bacterium Akkermansia muciniphila and Bifidobacterium spp. in Feces of Children with Autism. Appl. Environ. Microbiol. American Society for Microbiology; 2011 Sep;77(18):6718–21. PMCID: PMC3187122
OpenUrl

[105] ↵
Wang L, Christophersen CT, Sorich MJ, Gerber JP, Angley MT, Conlon MA. Elevated Fecal Short Chain Fatty Acid and Ammonia Concentrations in Children with Autism Spectrum Disorder. Dig. Dis. Sci. 2012 Aug;57(8):2096–102.
OpenUrl CrossRef PubMed

[106] Wang L, Christophersen CT, Sorich MJ, Gerber JP, Angley MT, Conlon MA. Increased abundance of Sutterella spp. and Ruminococcus torques in feces of children with autism spectrum disorder. Mol Autism. BioMed Central; 2013 Nov 4;4(1):42. PMCID: PMC3828002
OpenUrl

[107] ↵
Wang Q, Garrity GM, Tiedje JM, Cole JR. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 2007 Aug;73(16):5261–7. PMCID: PMC1950982
OpenUrl Abstract/FREE Full Text

[108] Wang Y, Kasper LH. The role of microbiome in central nervous system disorders. Brain Behav. Immun. 2014 May;38:1–12.
OpenUrl CrossRef PubMed

[109] ↵
Wilcoxon F, Katti SK, Wilcox RA. Critical Values and Probability Levels for the Wilcoxon Rank Sum Test and the Wilcoxon Signed Rank Test. 1963.

[110] ↵
Williams BL, Hornig M, Buie T, Bauman ML, Paik MC, Wick I, et al. Impaired Carbohydrate Digestion and Transport and Mucosal Dysbiosis in the Intestines of Children with Autism and Gastrointestinal Disturbances. PLoS ONE. 2011;6(9). PMCID: PMC3174969

[111] ↵
Yap IKS, Angley M, Veselkov KA, Holmes E, Lindon JC, Nicholson JK. Urinary Metabolic Phenotyping Differentiates Children with Autism from Their Unaffected Siblings and Age-Matched Controls. J. Proteome Res. American Chemical Society; 2010 Jun;9(6):2996–3004.
OpenUrl

[112] ↵
Yip M, Knox WE. Glutamine-Dependent Carbamyl Phosphate Synthetase - Properties and Distribution in Normal and Neoplastic Rat Tissues. J. Biol. Chem. 1970;245(9):2199–&.
OpenUrl Abstract/FREE Full Text

[113] ↵
Yoon S-H, Ha S-M, Kwon S, Lim J, Kim Y, Seo H, et al. Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies. Int. J. Syst. Evol. Microbiol. 2017 May;67(5):1613–7. PMCID: PMC5563544
OpenUrl CrossRef

[114] Ze X, Duncan SH, Louis P, Flint HJ. Ruminococcus bromii is a keystone species for the degradation of resistant starch in the human colon. ISME J. Nature Publishing Group; 2012 Aug;6(8):1535–43. PMCID: PMC3400402
OpenUrl

Crowdsourced study of children with autism and their typically developing siblings identifies differences in taxonomic and predicted function for stool-associated microbes using exact sequence variant analysis

SHORT ABSTRACT

INTRODUCTION

MATERIALS AND METHODS

Crowdsourcing Recruitment and Data Collection

Sampling Kits

ASD Diagnosis Confirmation

Mobile Autism Risk Assessment

Video Analysis

Lifestyle and Dietary Practices

DNA Extraction, Amplification and Sequencing

Sequence Filtering, Chimera Removal, Taxonomic Assignment and Phylogenic Tree

Statistical Analyses

Analysis of Alpha-Diversity Differences

Identification of Dietary and Lifestyle Habits Influencing the Microbial Community

Permutation Test on Sibling Pair Differentials

Models to Maximize the Likelihood of Detecting Low Abundance Species

Differential Ribosomal Analysis Based on the Negative Binomial Distribution

Zero Inflated Gaussian Analysis

Functional Profile Prediction using Piphillan

Gene Set Enrichment Analysis Using KEGG Orthologs

Power Calculation

RESULTS

Crowd Sourcing Recruitment and Participant Demographics

ASD Diagnosis Confirmation using the Mobile Autism Risk Assessment (MARA) and Video Classifier

Diet Differences between Children with ASD and Neurotypical Siblings

Dietary and Lifestyle Habits Influencing the Microbial Community

Similarity between Sibling Lifestyles

Reported Gastrointestinal Symptoms

Microbial Alpha-Diversity

Permutation Test on Sibling Pairs to determine ESVs that differentiate between ASD and NT

Models to Maximize the Likelihood of Detecting Low Abundance Species

Functional Profile Prediction

DISCUSSION

Crowdsourcing Recruitment

Lifestyle, Dietary Practices and GI symptoms

Microbial Community Diversity

Differential abundance analysis of microbial species

ESVs enriched in the ASD cohort

ESVs depleted in the ASD cohort

Pathway analysis

Butyrate Production Pathway

Power Calculation and Sample Size

Limitations

Future Work

Conclusion

Competing interests

Acknowledgements

References

Citation Manager Formats

Subject Area