De novo damaging coding mutations are strongly associated with obsessive-compulsive disorder and overlap with autism

Carolina Cappi; Melody E. Oliphant; Zsanett Péter; Gwyneth Zai; Catherine A. W. Sullivan; Abha R. Gupta; Ellen J. Hoffman; Manmeet Virdee; A. Jeremy Willsey; Roseli G. Shavitt; Euripedes C. Miguel; James L. Kennedy; Margaret A. Richter; Thomas V. Fernandez

doi:10.1101/127712

ABSTRACT

Obsessive-compulsive disorder (OCD) is a debilitating developmental neuropsychiatric disorder with a genetic risk component, yet identification of high-confidence risk genes has been challenging. We performed whole-exome sequencing in 222 OCD parent-child trios (184 trios after quality control), finding strong evidence that de novo likely gene disrupting and predicted damaging missense variants contribute to OCD risk. Together, these de novo damaging variants are enriched in OCD probands (RR 1.52, p=0.0005). We identified two high-confidence risk genes, each containing two de novo damaging variants in unrelated probands: CHD8 (Chromodomain Helicase DNA Binding Protein 8) and SCUBE1 (Signal Peptide, CUB Domain And EGF Like Domain Containing 1). Based on our data, we estimate that 34% of de novo damaging variants seen in OCD contribute to risk, and that de novo damaging variants in approximately 335 genes contribute to risk in 22% of OCD cases. Furthermore, genes harboring de novo damaging variants in OCD are enriched for those reported in neurodevelopmental disorders, particularly autism spectrum disorders. An exploratory network analysis reveals significant functional connectivity and enrichment in canonical pathways related to immune response.

SIGNIFICANCE STATEMENT Decades of genetic studies in obsessive-compulsive disorder (OCD) have yet to provide reproducible, statistically significant findings. Following an approach that has led to tremendous success in gene discovery for several neuropsychiatric disorders, here we report findings from DNA whole-exome sequencing of patients with OCD and their parents. We find strong evidence for the contribution of spontaneous, or de novo, predicted-damaging genetic variants to OCD risk, identify two high-confidence risk genes, and detect significant overlap with genes previously identified in autism. These results change the status quo of OCD genetics by identifying novel OCD risk genes, clarifying the genetic landscape of OCD with respect to de novo variation, and suggesting underlying biological pathways that will improve our understanding of OCD biology.

INTRODUCTION

Obsessive-compulsive disorder (OCD) is an often-disabling developmental neuropsychiatric disorder with onset typically during adolescence or young adulthood and a lifetime prevalence of 1.5-2.5% (1-5). Obsessions are intrusive thoughts, images, or urges experienced as irrational, excessive, and accompanied by anxiety or discomfort. Compulsions are behaviors undertaken to mitigate obsessions or subjective feelings (i.e., the need to relieve a tactile sensation or achieve a “just right” feeling); they are usually repetitive, stereotyped, and excessive (6, 7). The anxiety or distress associated with obsessions and compulsions and the time spent on them are sources of lifelong morbidity in OCD, having profound negative effects on both patients’ and families’ quality of life. Symptoms can be so disabling that the World Health Organization has ranked OCD among the 10 most debilitating disorders of any kind, in terms of lost earnings and diminished quality of life (8, 9). Furthermore, OCD has been linked to significantly increased mortality, even after controlling for comorbid psychiatric conditions, which can occur in up to 75% of cases (10, 11). Treatment-refractory disease is common, with about 40% of patients resistant to current pharmacological and psychotherapeutic treatments, and untreated OCD generally persists and becomes chronic (12, 13). The causes and underlying pathophysiology of OCD are not well understood, which has limited the development of new treatments and interventions. For these reasons, there is an urgent need for more research to elucidate OCD risk factors and disease mechanisms.

Twin and family studies provide strong evidence for a substantial genetic contribution to OCD risk, with modern estimates of heritability around 40-50% (14-17), yet progress in identifying risk genes has been slow. Decades of linkage, common-variant candidate gene association studies, and more recent genome-wide association studies in OCD (18-20) have yielded few reproducible associations and therefore have provided limited insights into disease biology. Further efforts are clearly needed to identify specific OCD risk variants and to confirm vulnerability pathways by modern genome-wide and comprehensive variant discovery approaches.

In contrast, genetic research into several other neuropsychiatric disorders has seen great progress in recent years, partly attributable to an increasing effort to evaluate the contribution of rare genetic sequence variation, especially de novo variants (arising spontaneously in the parental germ cells or in a zygote shortly after conception). This approach has shown great success for systematic risk gene discovery in other genetically complex neuropsychiatric disorders (21-23), particularly autism spectrum disorders (24-27). While an individual rare variant is unlikely to explain a sizeable fraction of disease risk in the context of a heterogeneous genetic architecture, concurrent investigations of multiple genes implicated by rare sequence and structural variation highlight convergence toward a limited number of important underlying biological mechanisms (28). Therefore, there is a proven avenue for risk gene discovery in complex neuropsychiatric disorders that has yet to be fully leveraged in OCD.

Following these previous studies in other disorders and our pilot study suggesting a role for de novo single nucleotide variants (SNVs) in OCD risk (29), we performed whole-exome sequencing (WES) in 222 OCD parent-child trios to identify de novo SNVs and insertion-deletion variants (indels). In 184 OCD trios passing quality control, we find strong evidence for the contribution of de novo likely gene disrupting (LGD; disruption of a stop codon, canonical splice site, or a frameshift indel) as well as predicted damaging missense (Mis-D) variants to OCD. Furthermore, we identified two high-confidence candidate risk genes based on observing gene-level recurrence of de novo damaging (LGD + Mis-D) variants in unrelated probands: CHD8 (Chromodomain Helicase DNA Binding Protein 8) and SCUBE1 (Signal Peptide, CUB Domain And EGF Like Domain Containing 1). We estimated that 22% of OCD cases will harbor a de novo damaging SNV or indel mediating OCD risk, and that there are approximately 335 genes affected by such variants contributing to the risk. Finally, we detected significant overlap between genes with damaging de novo variants in OCD and autism.

RESULTS

Damaging de novo SNVs and indels are associated with OCD risk

Exome sequencing was performed on 222 OCD parent-child trios. WES data from 855 unaffected trios, already sequenced from the Simons Simplex Collection, were pooled with our OCD trios for joint variant calling. After quality control methods, our sample size for a burden analysis was 184 OCD and 777 unaffected trios (Figure 1, Table 1, Table S1). To compare the de novo mutation rates between cases and controls, we limited our analysis to loci with at least 20x coverage in all members of a trio, as this was our pre-defined threshold for calling a de novo variant (see Methods). Based on our OCD pilot study (29) and work in other neurodevelopmental disorders (22, 24, 26, 30), we expected to find an enrichment of de novo LGD variants (stop codon, frameshift, or canonical splice-site variants) in OCD probands versus controls. We found a statistically significant increased rate of de novo LGD variants in OCD cases, confirming our hypothesis (rate ratio [RR] 1.93, 95% Confidence Interval [CI] 1.19-3.09, p=0.01). Furthermore, de novo missense variants predicted to be damaging by PolyPhen2 (Mis-D; Polyphen2 HDIV score ≥0.957) were also over-represented in OCD probands (RR 1.43, CI 1.13-1.80, p=0.006). Taken together, damaging de novo coding variants (LGD and Mis-D) occur more often in OCD probands versus controls (RR 1.52, CI 1.23-1.86, p=0.0005). We did not detect a difference in mutation rates for de novo synonymous variants (RR 0.99, CI 0.75-1.31, p=0.5). (See Table 1, Figure 2A)

Figure 1 Study summary.

We performed whole exome sequencing on 222 OCD and 855 control parent-child trios. After quality control, 184 OCD and 777 control trios remained for subsequent analyses. A burden analysis compared the rates of de novo single nucleotide (SNVs) and insertion-deletion (indel) variants between cases and controls. Next, we used the TADA-Denovo algorithm to assess the significance of gene-level recurrence of de novo damaging variants in our OCD group, identifying two high-confidence risk genes. Exploratory network, pathway, and cross-disorder analyses were then performed using genes harboring de novo damaging variants in our OCD subjects. Finally, based on the number of de novo damaging variants in OCD versus controls, we estimated the number of genes contributing to OCD risk, and used this estimate to predict future risk gene discovery as additional OCD parent-child trios are studied by exome sequencing.

Figure 2 De novo damaging variants are associated with OCD risk.

(A) Bar chart comparing the rates of de novo mutation types between OCD cases (red) and controls (blue). Comparisons are between per base pair (bp) mutation rates, considering only those “callable” loci in each family and cohort that meet required sequencing depth and quality scores to support high confidence de novo variant calling. Mutation rates were compared using a one-tailed rate ratio test. Statistically significant comparisons (p<0.05) are marked with asterisks. Error bars show 95% confidence intervals. (B) For the enriched classes of de novo variants, we quantified their contribution to OCD risk in two ways. First, we estimated the percentage of observed variants carrying risk by dividing the difference in rates (estimated coding variants per individual, see Table 1 and Methods) by the rate in OCD. Second, we estimated the percentage of cases with a mutation mediating risk by subtracting the proportion of controls carrying a mutation from the proportion in OCD probands carrying a mutation.

View this table:

Table 1 Distribution of de novo variants in OCD cases and controls

Damaging de novo SNVs and indels contribute to OCD risk in 22% of cases

Next, we estimated the fraction of observed de novo mutations that contribute to OCD risk, based on our dataset. By dividing the de novo mutation rate difference between cases and controls by the rate in cases, we estimate that 49.2% (CI 3.4-95.0%) of de novo LGD and 29.5% (CI 6.0-53.0%) of de novo Mis-D mutations contribute to OCD risk. As a group, we estimate that 33.9% (CI 13.3-54.6%) of damaging (LGD + Mis-D) de novo mutations contribute to OCD risk (Figure 2B).

We also used our data to estimate the proportion of cases harboring a de novo mutation contributing to OCD risk. By subtracting the percentage of controls from the percentage of OCD probands with at least one de novo mutation, we estimate that 15.0% (CI 3.1-26.9%) have a de novo Mis-D mutation and 7.3% (CI 0.50-14.0%) have a de novo LGD mutation mediating OCD risk. As a group, we estimate that 22.2% (CI 8.7-35.8%) of cases have a damaging de novo mutation contributing to OCD risk (Figure 2B).

Recurrent damaging de novo variants identify two candidate risk genes

Having established that de novo damaging variants occur more frequently in OCD probands, we next asked whether these variants cluster within specific genes. We identified three genes with multiple (≥2) de novo LGD or Mis-D variants in unrelated probands. Using TADA-Denovo (31) and previously established false discovery rate (FDR) thresholds, two of these genes met criteria for high-confidence risk genes (q<0.1): SCUBE1 (Signal Peptide, CUB Domain And EGF Like Domain Containing 1; q=0.040) and CHD8 (Chromodomain Helicase DNA Binding Protein 8; q=0.043) (Table 2). A third gene, TTN (Titin), did not meet this threshold (q=0.47) (Table 2).

View this table:

Table 2 Risk gene discovery in OCD

Approximately 335 genes contribute to OCD risk

Based on OCD proband vulnerability to de novo damaging variants in our dataset, we used two methods to estimate the number of genes contributing to OCD risk. Using a maximum likelihood estimation (MLE) method (32), we determined the most likely number of genes to be 335 (Figure S1). This agrees with an alternate method based on the “unseen species problem” (33); the estimated number of OCD risk genes using this alternate method is 317 (95% CI 190-454).

Figure S1 Maximum Likelihood Estimate (MLE) of number of OCD risk genes.

Assuming each number of possible risk genes between 1-2,500, 100,000 simulations were conducted to determine the number of risk genes that yielded the closest agreement between our observed and simulated data. In each simulation, we generated 95 variants (the number of de novo damaging variants observed in our OCD sample), then randomly assigned a percentage of variants (determined by the fraction of de novo damaging variants estimated to carry OCD risk) to the risk genes, recording the frequency at which the number of genes with two and three recurrent variants matched the number observed in our study (2 and 1, respectively). This MLE method yields an estimate of 335 OCD risk genes (red vertical line), a number that is in close agreement with that from an alternative “unseen species” method (317 genes, 95% CI: 190-454).

Next, we used the estimated number of OCD risk genes (n=335) to predict the likely future gene discovery yield as additional OCD trios are investigated by WES. Based upon 10,000 simulations at each cohort size, we predict discovery of the following numbers of risk genes as we sequence more OCD parent-child trios: 24 probable risk genes, including 11 high-confidence risk genes (24 / 11 genes) at 500 trios; 77 / 40 genes at 1,000 trios; 202 / 113 genes at 2,000 trios; 323 / 189 genes at 3,000 trios (Figure S2).

Figure S2 Gene discovery by number of trios sequenced.

Using the MLE estimate of 335 risk genes, we estimated the number of probable (q<0.3) and high-confidence (q<0.1) risk genes that will be discovered as more OCD trios are sequenced. We performed 10,000 simulations at each cohort size from 25-3,000 trios, randomly generating variants and assigning to risk genes in agreement with the proportions seen in our data, then applying the TADA-Denovo algorithm.

Overlap with ASD and CHD8 target genes

Using DNENRICH (34), we found significant overlap between genes harboring de novo damaging variants in OCD (n=89, excludes occurrences in controls) and several gene sets from the literature (Table 3, Table S4). Our OCD genes were significantly enriched for genes harboring de novo nonsynonymous (LGD, missense) variants in autism (ASD), genes achieving TADA q<0.1 in ASD, genes with genome-wide significant statistical evidence for association with developmental disorders, and genes that are targets of CHD8 in the developing human brain. There was no significant enrichment for genes harboring de novo variants in intellectual disability (ID) or schizophrenia (SCZ), no enrichment for de novo synonymous variants in any disorder, and no enrichment for any class of de novo variation in unaffected siblings in the SSC (Table 3, Table S4).

View this table:

Table 3 Overlap between OCD de novo damaging mutations and gene sets

Exploratory pathway and network analyses

Using our list of genes harboring de novo damaging variants in OCD (n=89), we performed exploratory analyses to determine shared underlying canonical pathways and functional connectivity. Using the GeNets algorithm, OCD genes mapping onto a meta-network displayed significantly more connectivity than expected by chance (p=0.026), and 68 additional “tier 1” candidate genes were predicted, based on their high connectivity to our input genes (Figure S6, Table S5). Our input gene list is significantly enriched for canonical pathways related to immune response, particularly the complement system, which was a top result from two pathway analysis tools, MetaCore and IPA. Other significant canonical pathways include granulocyte-macrophage colony stimulating factor (GM-CSF) signaling, neurotrophin/tyrosine kinase signaling, B cell receptor signaling, and focal adhesion kinase signaling (Table S6).

DISCUSSION

By whole-exome sequencing of OCD parent-child trios, we have demonstrated a strong association between de novo damaging (LGD and Mis-D) coding variants and OCD cases (Table 1, Figure 2). As seen in studies of other neurodevelopmental disorders, these results can be leveraged to systematically identify OCD risk genes. In the current study, two genes, CHD8 and SCUBE1, have a FDR q<0.1, meeting criteria for high-confidence association with OCD (Table 2).

SCUBE1 has not been extensively studied. While it is expressed in the developing brain and nervous system (35, 36), functional studies to date have focused mostly on its potential role in platelet activation and adhesion (37, 38). A study in mice has shown downregulated SCUBE1 expression in response to inflammatory stimuli (36), but this gene has not yet been implicated in disorders of the brain or nervous system. Interestingly, increased levels of pro-inflammatory markers have been reported in several studies of children and adolescents with OCD (39-42).

On the other hand, there are several recent and ongoing studies of CHD8, a gene that has emerged as having the strongest association with autism spectrum disorder via the identification of multiple de novo LGD variants in unrelated parent-child trios (Figure S5) (33, 43-45). CHD8 is highly expressed in the developing brain (46). It encodes an ATP-dependent chromatin remodeler that binds to tri-methylated histone H3 lysine 4, a post-translational histone modification present at active promoters (47-49). Loss of CHD8 function appears to contribute to autism pathology by disrupting the expression of its target genes, which are themselves enriched for high confidence autism risk genes (46). While OCD subjects with de novo damaging CHD8 variants in our study do not meet any diagnostic criteria for autism, this finding suggests there may be overlapping biological mechanisms between the two disorders, and leads us to hypothesize that genes regulated by CHD8 may similarly be enriched for OCD risk genes. Indeed, we see significant overlap between our OCD genes and ASD genes, as well as CHD8 gene targets mapped in the developing human brain (46) (Table 3).

Figure S3 PCA scree and individual plots.

Scree plots following Principal Components Analysis (PCA), showing (A) the percentage of variance captured by each of the first 32 principal components, and (B) the cumulative percentage of variance captured by these same components in the exome metrics data from cases and controls. The “elbow” of the scree plot is visualized to be around the 5^th principal component. This was confirmed by the Factominer R code function “estim_ncp()”. The first 5 PCs capture almost 80% of the variance, and this number of PCs was used to determine PCA outliers during quality control (see Table S1 and Supplementary Methods). (C) Individual plots for the first two principal components, based on PCA of exome sequencing quality metrics. OCD cases are plotted in red, and controls in blue. The first two PCs together capture 50.1% of the variance. R code to generate this data and figure are in Supplementary Methods, and individual PC factor values are in Table S1. This figure includes PCA outliers (>3 standard deviations from the mean in PCs 1-5), which were removed during quality control, prior to further analysis of case-control data.

Figure S4 Sequencing coverage and parental age distributions.

Distribution boxplots of values for (A) mean target coverage, (B) median target coverage, (C) number of base pairs in all “callable” regions, (D) number of base pairs in coding “callable” regions, (E) paternal age, and (F) maternal age for both OCD and control cohorts. For each cohort, the box extends from the first through third quartiles, and the horizontal line is at the second quartile (median) of the data. Whiskers extend to the largest non-outliers, and outlier data points are plotted individually. For each comparison, a p-value was calculated using a two-sided Wilcoxon rank sum test with continuity correction. Panels A-D show increased opportunity for variant calling in the OCD cohort, necessitating the use of de novo mutation rate comparisons within the callable exome, as explained in the main text and methods. Panels E-F show no significant difference in parental ages between case and control cohorts. Also see Table S1.

Figure S5 CHD8 variants in OCD and ASD.

Two de novo likely gene disrupting (LGD, red) and damaging missense (Mis-D, black) variants identified in CHD8 among OCD probands are indicated. ASD-associated de novo LGD and Mis-D mutations reported in the Simons Foundation Autism Research Initiative (SFARI) database (accessed April 12, 2017) are also shown in muted colors. Only variants with identifiable allele or residue sequence positions in the SFARI database were included in the above protein diagram, and splice site variants across cohorts are indicated by the respective allele change and underlined. Annotated protein domains predicted with confidence by the Simple Modular Architecture Research Tool (SMART) are shown as follows: CHR, chromatin organization modifier domain (blue), DEX, DEAD-like helicases superfamily (yellow), HELC, helicase superfamily c-terminal domain (pink), and BRK, domain of unknown function associated with CHROMO domain helicases (purple).

Based on data from this study, we estimate that 34% of de novo damaging mutations seen in OCD carry risk (Figure 2B) and that 335 genes confer risk in 22% of patients (Figure 2B). Given our OCD sample size, the 95% confidence intervals around these contribution estimates are wide, and need refinement by continued sequencing of OCD trios.

Mindful of the fact that more than half of genes harboring de novo damaging variants in our study may not be true risk genes, we consider our pathway and network analyses as exploratory at this stage. Nevertheless, we see preliminary evidence that genes identified by de novo damaging variants in OCD are functionally connected to a greater degree than expected by chance (Figure S6, Table S5). Furthermore, these genes may be enriched in immunological and cell signaling canonical pathways (Table S6), consistent with our pilot study of exome sequencing in 20 OCD trios (29). These analyses should be repeated as more high-confidence OCD risk genes are identified.

Figure S6 GeNets network analysis without candidates.

Using the GeNets algorithm (https://apps.broadinstitute.org/genets), we mapped all 89 genes harboring de novo damaging mutations in OCD (excluding two genes, TTN and CACNA1E, found to harbor de novo damaging variants in control subjects) onto the GeNets Metanetwork v1.0 to determine whether they are functionally connected. The density of the mapped network (density = number of edges / number of possible edges) was greater than 95% of randomly sampled gene sets, indicating that the network is significantly more connected than random (p=0.026). In the network, node (gene) size is proportional to the number of connections. The color is assigned by community, defined as a gene set that is more connected to one another than to another group of genes. Results from this analysis are available in interactive form here: https://www.broadinstitute.org/genets#/visualize/58d9425ea4e00291af652379

While not rising to the level of a high-confidence risk gene in this study, it is notable that we identified an OCD de novo damaging (Mis-D) variant in DLGAP1 (discs, large homolog- associated protein 1). In a genome-wide association study by the International OCD Foundation Genetic Collaborative (IOCDF-GC), the lowest p-values for their case-control analysis were found for two SNPs located within DLGAP1 (2.49×10⁻⁶, 3.44x10⁻⁶) (20). A subsequent GWAS by the OCD Collaborative Genetics Study (OCGAS) identified a SNP nearby this gene with a prominent signal (p=2.67×10⁻⁴) (18). Furthermore, a rare paternally-inherited duplication in DLGAP1 was recently reported in a child with OCD, Tourette syndrome, and anxiety (50). DLGAP1 is a member of the neuronal postsynaptic density complex and is in the same family as DLGAP3 (SAPAP3), a gene associated with OCD-like behaviors in a knockout mouse model (51). Therefore, evidence is beginning to converge on this gene as one of great interest in OCD genetics.

Successful gene discovery by leveraging gene-level recurrent de novo variation in autism, where over 65 genes have now been identified (24, 26, 27), and the results presented here for OCD, strongly reinforce the value of continuing WES in larger cohorts of OCD parent-child trios. Our models predict that by increasing the sample size of this study to 500 trios, we will gain 9 additional high-confidence risk genes and 22 probable risk genes. Further increasing to 1,000 trios will yield a total of 40 high-confidence and 77 probable risk genes. Discovering risk genes will change the status quo in OCD genetics by allowing new studies in model systems (e.g. animal models, induced pluripotent stem cells) and network analyses. Such studies will provide insights into OCD pathophysiology that are critical prerequisites for the discovery of novel therapeutic targets to alleviate the suffering of those with OCD.

MATERIALS AND METHODS

Subjects

This study was approved by the local institutional review boards of all participating institutions, and appropriate informed consent was obtained from participating subjects. 222 parent-child trios (139 male, 83 female), consisting of offspring meeting criteria for the diagnosis of obsessive-compulsive disorder, as defined by the Diagnostic and Statistical Manual for Mental Disorders (DSM-IV-TR or DSM-5) (52, 53), and their unaffected parents, were recruited for DNA sequencing. Trios were recruited at three sites: the University of São Paulo School of Medicine Obsessive-Compulsive Spectrum Disorders Program (42 trios), Centre for Addiction and Mental Health and the Frederick W. Thompson Anxiety Disorders Centre at the Sunnybrook Health Sciences Centre in Toronto (77 trios), and Yale University School of Medicine (61 trios). Additionally, we included 42 trios with OCD and chronic tics that were recruited for a separate study by TIC Genetics (54, 55). Other diagnostic criteria included: onset of symptoms prior to age 18 years; no previously diagnosed neurological disorder, intellectual disability, psychotic disorder, or OCD occurring exclusively in the context of depression; no known history of OCD in first degree relatives. Final diagnostic status was assigned based on the consensus of an experienced interviewer and a psychiatrist or psychologist after independent review and administration of a structured diagnostic interview. We prioritized the study of simplex OCD trios to increase the likelihood of detecting de novo sequence and structural variants. Available phenotype information, including gender and parental age, is included in Table S1.

Whole-exome sequencing (WES)

Exome capture and sequencing of blood-derived DNA from 222 affected children and their parents (666 samples total) were performed at the Yale Center for Genomic Analysis (YCGA), using the NimbleGen SeqCap EZExomeV2 (109 trios) or MedExome (113 trios) capture libraries (Roche NimbleGen, Madison, WI, USA) and the Illumina HiSeq 2000 platform (74 bp paired-end reads; Illumina, San Diego, CA). We multiplexed six samples during each capture reaction and sequencing lane, pooling parents and probands when possible. WES data from 855 unaffected parent-child trios (2565 samples total) were obtained from the Simons Simplex Collection via the NIH Data Archive (https://ndar.nih.gov/edit_collection.html?id=2042). These children and their parents have no evidence of autism spectrum or other neurodevelopmental disorders (56). Like our OCD samples, control WES was from blood-derived DNA and sequenced on the Illumina HiSeq 2000 sequencing platform after capture with the NimbleGen SeqCap EZExomeV2 library.

Sequence alignment and variant calling

Alignment and variant calling of the sequencing reads followed the latest Genome Analysis Toolkit (GATK) (57) Best Practices guidelines. Reads were aligned using BWA-mem (58) to the b37 human reference sequence with decoy sequences. Picard’s MarkDuplicates tool was used to mark PCR duplicates (https://broadinstitute.github.io/picard/). A target bed file was created by taking the intersection of the EZExomeV2 and MedExome target regions. GATK was used to realign indels, recalibrate quality scores, and generate GVCF files for each sample using the HaplotypeCaller tool. All samples were called jointly using GATK’s GenotypeGVCFs tool, variant score recalibration was applied to the called variants, and all variant call data was written to a VCF file. This pipeline uses GATK’s Best Practices parameters and the default parameters for BWA and Picard. Only passing variants were used in downstream analyses. Variants were annotated against the RefSeq hg19 gene definitions and multiple external databases of variant population frequency, conservation scores, variation intolerance, mutation severity, and predicted functional effects using ANNOVAR (59).

Quality control and de novo variant calling

Relatedness statistics were calculated based on the method of Manichaikul et al. (60), implemented in VCFtools v0.1.14.10 (61). Trios were omitted if expected family relationships were not confirmed or if there were unexpected relationships within or between families. Trios were omitted if > 5 de novo variants were observed. PLINK/SEQ (34) (i-stats; https://psychgen.u.hpc.mssm.edu/plinkseq/stats.shtml), PicardTools, and GATK DepthOfCoverage tools were used to generate quality metrics (Table S1). To identify outliers that might confound our case-control analysis, we performed principal components analysis (PCA) using this data. A scree plot determined the number of principal components accounting for the greatest proportion of variance, and we removed trios with family members falling more than three standard deviations from the mean in any of these principal components. See Figure S3, Table S1, and Supplementary Methods.

We used stringent thresholds for identifying de novo mutations because DNA from control subjects (recruited by the Simons Foundation Autism Research Initiative) was not available for confirmation by Sanger sequencing. De novo variants were called using an in-house script that required the following: child is heterozygous for a variant with alternate allele frequency between 0.3 and 0.7 in the child and < 0.05 in the parents, sequencing depth (DP) ≥ 20 in all family members at the variant position, alternate allele depth (AD) ≥ 5, observed allele frequency (AC) < 0.01 (1%) among all cases and controls, mapping quality (MQ) ≥ 30. False positive calls were removed by in silico visualization. We performed Sanger sequencing on the probands and parents for 383 putative de novo variants from this and another exome sequencing project using the same calling thresholds; 370 were confirmed, resulting in 96.6% specificity.

Mutation rate analysis

Within each cohort, we calculated the rate of de novo mutations per base pair. For accurate rate calculation, we first determined the number of “callable” base pairs per family using the GATK DepthOfCoverage tool. We considered only bases covered at ≥ 20x in all family members, with base quality ≥ 20, and map quality ≥ 30; these thresholds match those required for GATK and de novo variant calling. For each cohort, we summed the “callable” base pairs in every family and used this number as the denominator for de novo rate calculations. The resulting rate was divided by two to give haploid rates. Confidence intervals were calculated using the pois.conf.int (pois.exact) function from the epitools v0.5-9 package in R. We compared de novo mutation rates in cases versus controls (burden analysis) using a one-tailed rate ratio test in R (https://cran.r-project.org/package=rateratio.test), considering only those variants present with a frequency of <0.01 in the ExAC v0.3.1 database (62). See Table 1, Table S1, and Supplementary Methods.

TADA analysis

Prior exome analyses demonstrated that the observation of even a small number of rare de novo mutations in the same gene among unrelated individuals can provide considerable statistical power to establish association (33). We used the Transmitted And De novo Association (TADA-Denovo) test as a statistical method for risk gene discovery based on gene-level recurrence of de novo mutations within the classes of variants that we found enriched in OCD (27, 31). Parameter calculations and a detailed description of the method are given in Supplementary Methods. In summary, TADA-Denovo generates random mutational data based on each gene’s specified mutation rate to determine null distributions, then calculates a p-value and a false discovery rate (FDR) q-value for each gene using a Bayesian “direct posterior approach.” A low q-value represents strong evidence for OCD association. See Table 2, Table S3, and Supplementary Methods.

Estimation of number of risk gene

We first used a maximum likelihood estimation (MLE) method to estimate the number of genes contributing risk to OCD, based on vulnerability to de novo damaging variants (32). For every number of risk genes from 1 to 2,500, we simulated 95 variants (the number of damaging de novo variants observed in probands in our case-control burden analysis). Variant simulations were performed 100,000 times at each number of risk genes. Following each simulation, a percentage of variants was randomly assigned to the risk genes. The percentage of variants assigned to risk genes was determined by the fraction of de novo damaging variants estimated to carry OCD risk, and variant simulations were weighted by gene size and GC content (31). We then counted the number of risk and non-risk genes containing two variants and the number containing three or more variants. The frequency of concordance between our simulated and observed data was calculated. A curve was plotted to show the concordance frequency (y-axis) at each assumed number of risk genes (x-axis), and the peak was taken as the estimate of the most likely number of risk genes (32). See Figure S1 and Supplementary Methods.

Next, we used an alternate method for estimating the number of risk genes, using a statistical method based on the “unseen species” problem (33). This method uses the frequency and number of observed variant types (or species) to infer how many species are present in the population. See Supplementary Methods for details of these calculations.

Estimation of future risk gene discovery

We used the estimated number of OCD risk genes to predict the likely future gene discovery yield as additional OCD trios are investigated by WES. Fixing the gene number at 335 (from MLE estimate above), we varied the cohort size (from 25 to 3000, in increments of 25). At each cohort size, we simulated a number of variants matching the observed mutation rate in OCD probands. Simulated variants were randomly assigned to the risk genes. The percentage of variants assigned to risk genes was determined by the fraction of de novo damaging variants estimated to carry OCD risk, and variant simulations were weighted by gene size and GC content (31). At each cohort size, 10,000 simulations were performed. LGD and Mis-D variants were generated separately. Simulated variants were then combined and given as input to the TADA-Denovo algorithm, using the same parameters described above for the observed data. The number of high confidence (q<0.1) and probable (q<0.3) risk genes were recorded and plotted using polynomial regression fitting; this regression model allows prediction of the number of genes identified at a specified cohort size. See Figure S2 and Supplementary Methods.

Gene set overlap

We used DNENRICH (34) (https://psychgen.u.hpc.mssm.edu/dnenrich/) to test whether OCD genes harboring de novo damaging mutations (89 genes; excluding two genes, TTN and CACNA1E, found to harbor de novo damaging variants in control subjects) were significantly enriched among genes identified in autism (ASD), schizophrenia (SCZ), developmental disorders (DD), Tourette’s disorder (TD), and intellectual disability (ID). Gene lists were obtained from a recent cross-disorder study (63) that included de novo single nucleotide and indel variants from multiple exome sequencing studies in ASD (24, 26, 43, 64, 65), SCZ (34, 66-69), and ID (23, 70-72). Three of these studies also included de novo variants present in unaffected siblings (26, 64, 67). DD (21) and TD (54) genes were obtained from recently published WES studies. DNENRICH simulates random mutations while accounting for gene size, trinucleotide context, and mutational effect. We performed 100,000 permutations, comparing the observed and expected overlap with each gene set. Empirical p-values were generated, based on a one-sided enrichment analysis under a binomial model of greater than expected hits per gene set. We tested for overlap between our OCD genes and those in ASD, SCZ, DD, TD, ID, and unaffected siblings that harbored de novo (LGD, nonsynonymous, synonymous) mutations. We also tested for overlap with ASD genes achieving q<0.01 in a recent meta-analysis (27) and DD genes achieving genome-wide statistical significance (21). Finally, given that we identified CHD8 as an OCD risk gene in our study, we tested for overlap with lists of genes that are targets of CHD8 in human brain (46), See Table 3, Table S4, and Supplementary Methods.

Exploratory pathway and network analyses

To determine whether all genes harboring de novo damaging variants in OCD are enriched for specific biological pathways, we used the same gene list from our gene set overlap analysis (n=89) to identify the most significant canonical pathways suggested by MetaCore (Thomson Reuters, version 6.30, build 68780, https://portal.genego.com/) and Ingenuity Pathway Analysis (IPA, build version 430520M, content version 31813282, release date 2016-12-05; Ingenuity Systems, http://www.ingenuity.com/). The following default settings were used for MetaCore: Analyze Single Experiment Tool, species Homo sapiens, threshold 0, p-value 1, signals both. The following default settings were used for IPA: Reference set: Ingenuity Knowledge Base (Genes Only); direct and indirect relationships; does not include endogenous chemicals; consider only relationships where species = human and confidence = experimentally observed. See Table S6.

Using the GeNets algorithm (https://apps.broadinstitute.org/genets), we mapped all 89 genes harboring de novo damaging mutations in OCD onto the GeNets Metanetwork v1.0 to determine whether they are functionally connected. The GeNets Metanetwork contains integrated protein-protein interactions from InWeb3 (73), ConcensusPathDB (http://consensuspathdb.org) (74), and 5,057 drug-target interactions (75); the total network size is 530,532 interactions. The GeNets algorithm determines the density of the mapped network (density = number of edges / number of possible edges) and compares this to computed densities for randomly sampled gene sets. An empirically determined p-value is generated. The network is determined to be significantly more connected than random if the density is greater than 95% of the randomly sampled gene sets. Additionally, in the process of mapping our genes onto the Metanetwork, additional candidate genes are predicted, based on their connectivity to our input genes. An overall network connectivity p-value is generated, both with and without these additional predicted candidates. Also, as part of the GeNets analysis, gene “communities” were determined, defined as genes that are more connected to one another than they are to other groups of genes. See Figure S6 and Table S5. All GeNets results for this analysis are also available in interactive form here: https://www.broadinstitute.org/genets#/visualize/58d9425ea4e00291af652379

Gene set overlap using DNENRICH

The following Linux commands were used to run the DNENRICH analysis:

Supplemental Tables

Table S1 - Phenotype, exome sequencing metrics, and principal components analysis.

(see “TableS1.xlsx”)

First tab contains individual-level sample information (columns A-K), including family ID, individual ID, phenotype, cohort, collection site, gender, capture platform, size of “callable exome”, and parental age (years) at birth, where available. Column L lists reasons for any sample exclusions by quality control methods; “0” indicates that the sample was not excluded, and was included in subsequent analyses. Columns M-AH list individual sample sequencing metrics generated using PicardTools, and GATK DepthOfCoverage tools. Columns AI-AS list individual sample sequencing metrics generated using PLINK/SEQ (i-stats; https://psychgen.u.hpc.mssm.edu/plinkseq/stats.shtml). Columns B, M-AS were included in Principal Components Analysis (PCA). Third tab contains cohort-level metrics calculated using samples passing quality control. ±95% confidence intervals are given, when applicable. Fourth tab contains coordinates generated for each sample for the top 10 principal components following PCA. The code used to generate this data is included in Supplementary Methods. Using these coordinates, we removed trios with family members falling more than three standard deviations from the mean in any of the first five principal components; this information is contained in the fifth tab.

Table S2 - Annotated de novo variants in OCD and controls.

(see “TableS2.xlsx”)

Detailed information on all high confidence de novo variants in cases and controls. These variants were annotated using Annovar, based on RefSeq hg19 gene definitions. Column descriptions are provided in a separate tab of this file. A third tab provides the number of each de novo variant type per sample.

Table S3 - Gene-level de novo mutation rates, variant counts, and TADA-Denovo results.

(see “TableS3.xlsx”)

First tab contains de novo mutation rates used to perform subsequent maximum likelihood estimation (MLE) and TADA-Denovo analyses. The following mutation rates are listed for each gene: overall, likely gene disrupting (lgd), predicted damaging missense (misD), and all damaging (lgd + misD). These mutation rates were previously published (Ware et al., 2015) from unaffected parent-child trios. The code used to generate the mutation rate table is provided in Supplementary Methods. Second tab contains the input file for the TADA-Denovo algorithm. Gene-level expected mutation rates for LGD (“mut.cls1” column) and Mis-D variants (“mut.cls2” column) are listed, along with their respective observed mutation counts in our OCD data (“dn.cls1” and “dn.cls2”, respectively). Code for running TADA-Denovo is given in Supplementary Methods. Third tab contains the final output results from TADA-Denovo code provided in Supplementary Methods. Genes harboring more than one damaging de novo (LGD or Mis-D) variant in OCD probands are highlighted in yellow (SCUBE1, CHD8, TTN). Two of these genes (SCUBE1 and CHD8) exceeded thresholds for being considered a probable (qval < 0.3) or high confidence (qval < 0.1) risk gene.

Table S4 - DNENRICH gene lists and results.

(see “TableS4.xlsx”)

See Supplementary Methods for details of DNENRICH analysis. First tab contains input for DNENRICH analysis. Each row represents a de novo damaging mutation in an OCD proband. Second tab contains the input gene lists to determine enrichment for our OCD damaging de novo mutations. Third tab contains final results output from DNENRICH. Significantly enriched gene sets are highlighted.

Table S5 - GeNets network connectivity analysis results.

(see “TableS5.xlsx”)

Complete results from GeNets network analysis of de novo damaging variants found in OCD probands. First tab contains summary statistics of the resulting network, considered both with and without nearby predicted candidate genes. Second tab contains the input gene list and the candidate genes predicted by the network analysis. Third tab groups genes (without predicted candidates) into nearby “communities” that are more connected with each other than their neighbors. Fourth tab contains network edges without the predicted candidates. Fifth tab contains gene community groupings, including predicted candidates. Sixth tab contains network edges including predicted candidates. See Methods for further details of this analysis.

TableS6 - MetaCore and Ingenuity Pathway Analysis (IPA) gene enrichment analysis results.

(see “TableS6.xlsx”)

Complete results from Metacore (first tab) and IPA (second tab) gene enrichment analyses, with p-values calculated by each analysis algorithm. See Methods for details of these analyses.

ACKNOWLEDGEMENTS

We wish to thank the families who have participated in and contributed to this study. Control subject data were obtained from the NIH-supported National Database for Autism Research (NDAR). NDAR is a collaborative informatics system created by the National Institutes of Health to provide a national resource to support and accelerate research in autism. Dataset identifier: 2042. This manuscript reflects the views of the authors and may not reflect the opinions or views of the NIH or of the Submitters submitting original data to NDAR. Additionally, we wish to thank the Tourette International Collaborative Genetics Study for contributing published genetic data. This study was supported by grants from the Allison Family Foundation [TVF]; FAPESP (process number: 2014/01585-5) and CNPq (process number: 460928/2014-7) [CC]; the Ontario Mental Health Foundation (OMHF) and private donations from the Frederick W. Thompson family [MAR, JLK]. JLK is a Scientific Advisory Board member of AssureRx. JLK has received speaker honoraria and expenses from Eli Lilly and Novartis, consultant honoraria and expenses from Roche, and expenses from AssureRx. MAR has received research support through grants from Roche and speaker honoraria from Lundbeck. TVF has received research support from Shire, the Simons Foundation, and the National Institute of Mental Health; neither funded this project. The authors declare no potential conflict of interest.

REFERENCES

1.↵
Fontenelle LF, Mendlowicz MV, & Versiani M (2006) The descriptive epidemiology of obsessive-compulsive disorder. Progress in neuro-psychopharmacology & biological psychiatry 30(3):327–337.
OpenUrl CrossRef PubMed
2.↵
Karno M, Golding JM, Sorenson SB, & Burnam MA (1988) The epidemiology of obsessive-compulsive disorder in five US communities. Arch Gen Psychiatry 45(12):1094–1099.
OpenUrl CrossRef PubMed Web of Science
3.↵
Ruscio AM, Stein DJ, Chiu WT, & Kessler RC (2010) The epidemiology of obsessive-compulsive disorder in the National Comorbidity Survey Replication. Mol Psychiatry 15(1):53–63.
OpenUrl CrossRef PubMed Web of Science
4.↵
Weissman MM, et al. (1994) The cross national epidemiology of obsessive compulsive disorder. The Cross National Collaborative Group. J Clin Psychiatry 55 Suppl:5–10.
OpenUrl PubMed Web of Science
5.↵
Torres AR, et al. (2006) Obsessive-compulsive disorder: prevalence, comorbidity, impact, and help-seeking in the British National Psychiatric Morbidity Survey of 2000. Am J Psychiatry 163(11):1978–1985.
OpenUrl CrossRef PubMed Web of Science
6.↵
Miguel EC, et al. (2000) Sensory phenomena in obsessive-compulsive disorder and Tourette's disorder. J Clin Psychiatry 61(2):150-156; quiz 157.
OpenUrl CrossRef PubMed Web of Science
7.↵
Shavitt RG, et al. (2014) Phenomenology of OCD: lessons from a large multicenter study and implications for ICD-11. J Psychiatr Res 57:141-148.
OpenUrl
8.↵
Veale D & Roberts A (2014) Obsessive-compulsive disorder. BMJ (Clinical research ed.) 348:g2183.
OpenUrl FREE Full Text
9.↵
Bobes J, et al. (2001) Quality of life and disability in patients with obsessive-compulsive disorder. Eur Psychiatry 16(4):239–245.
OpenUrl CrossRef PubMed Web of Science
10.↵
Meier SM, et al. (2016) Mortality Among Persons With Obsessive-Compulsive Disorder in Denmark. JAMA psychiatry 73(3):268–274.
OpenUrl
11.↵
Lochner C, et al. (2014) Comorbidity in obsessive-compulsive disorder (OCD): a report from the International College of Obsessive-Compulsive Spectrum Disorders (ICOCS). Comprehensive psychiatry 55(7):1513–1519.
OpenUrl CrossRef PubMed
12.↵
Skoog G & Skoog I (1999) A 40-year follow-up of patients with obsessive-compulsive disorder [see commetns]. Arch Gen Psychiatry 56(2):121–127.
OpenUrl CrossRef PubMed Web of Science
13.↵
Eisen JL, et al. (2013) Five-year course of obsessive-compulsive disorder: predictors of remission and relapse. J Clin Psychiatry 74(3):233–239.
OpenUrl CrossRef PubMed
14.↵
Monzani B, Rijsdijk F, Harris J, & Mataix-Cols D (2014) The structure of genetic and environmental risk factors for dimensional representations of DSM-5 obsessive-compulsive spectrum disorders. JAMA psychiatry 71(2):182–189.
OpenUrl
15.
Pauls DL, Alsobrook JP, Goodman W, Rasmussen S, & Leckman JF (1995) A family study of obsessive-compulsive disorder. Am J Psychiatry 152(1):76–84.
OpenUrl CrossRef PubMed Web of Science
16.
Taylor S (2011) Etiology of obsessions and compulsions: a meta-analysis and narrative review of twin studies. Clinical psychology review 31(8):1361–1372.
OpenUrl CrossRef PubMed
17.↵
van Grootheest DS, Cath DC, Beekman AT, & Boomsma DI (2005) Twin studies on obsessive-compulsive disorder: a review. Twin research and human genetics: the official journal of the International Society for Twin Studies 8(5):450–458.
OpenUrl
18.↵
Mattheisen M, et al. (2014) Genome-wide association study in obsessive-compulsive disorder: results from the OCGAS. Mol Psychiatry.
19.
Costas J, et al. (2016) Exon-focused genome-wide association study of obsessive-compulsive disorder and shared polygenic risk with schizophrenia. Transl Psychiatry 6:e768.
OpenUrl
20.↵
Stewart SE, et al. (2013) Genome-wide association study of obsessive-compulsive disorder. Mol Psychiatry 18(7):788–798.
OpenUrl CrossRef PubMed Web of Science
21.↵
Deciphering Developmental Disorders Study (2017) Prevalence and architecture of de novo mutations in developmental disorders. Nature.
22.↵
Allen AS, et al. (2013) De novo mutations in epileptic encephalopathies. Nature 501(7466):217–221.
OpenUrl CrossRef PubMed Web of Science
23.↵
Rauch A, et al. (2012) Range of genetic mutations associated with severe nonsyndromic sporadic intellectual disability: an exome sequencing study. Lancet 380(9854):1674–1682.
OpenUrl CrossRef PubMed Web of Science
24.↵
De Rubeis S, et al. (2014) Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 515(7526):209–215.
OpenUrl CrossRef PubMed Web of Science
25.
Dong S, et al. (2014) De novo insertions and deletions of predominantly paternal origin are associated with autism spectrum disorder. Cell reports 9(1):16–23.
OpenUrl
26.↵
Iossifov I, et al. (2014) The contribution of de novo coding mutations to autism spectrum disorder. Nature 515(7526):216–221.
OpenUrl CrossRef PubMed Web of Science
27.↵
Sanders SJ, et al. (2015) Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron 87(6):1215–1233.
OpenUrl CrossRef PubMed
28.↵
Willsey AJ, et al. (2013) Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism. Cell 155(5):997–1007.
OpenUrl CrossRef PubMed Web of Science
29.↵
Cappi C, et al. (2016) Whole-exome sequencing in obsessive-compulsive disorder identifies rare mutations in immunological and neurodevelopmental pathways. Transl Psychiatry 6:e764.
OpenUrl
30.↵
Deciphering Developmental Disorders Study (2015) Large-scale discovery of novel genetic causes of developmental disorders. Nature 519(7542):223–228.
OpenUrl CrossRef PubMed
31.↵
He X, et al. (2013) Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes. PLoS Genet 9(8):e1003671.
OpenUrl CrossRef PubMed
32.↵
Homsy J, et al. (2015) De novo mutations in congenital heart disease with neurodevelopmental and other congenital anomalies. Science 350(6265):1262–1266.
OpenUrl Abstract/FREE Full Text
33.↵
Sanders SJ, et al. (2012) De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485(7397):237–241.
OpenUrl CrossRef PubMed Web of Science
34.↵
Fromer M, et al. (2014) De novo mutations in schizophrenia implicate synaptic networks. Nature 506(7487):179–184.
OpenUrl CrossRef PubMed Web of Science
35.↵
Grimmond S, et al. (2000) Cloning, mapping, and expression analysis of a gene encoding a novel mammalian EGF-related protein (SCUBE1). Genomics 70(1):74–81.
OpenUrl CrossRef PubMed Web of Science
36.↵
Yang RB, et al. (2002) Identification of a novel family of cell-surface proteins expressed in human vascular endothelium. J Biol Chem 277(48):46364–46373.
OpenUrl Abstract/FREE Full Text
37.↵
Tu CF, et al. (2006) Localization and characterization of a novel secreted protein SCUBE1 in human platelets. Cardiovascular research 71(3):486–495.
OpenUrl CrossRef PubMed Web of Science
38.↵
Wu MY, et al. (2014) Inhibition of the plasma SCUBE1, a novel platelet adhesive protein, protects mice against thrombosis. Arteriosclerosis, thrombosis, and vascular biology 34(7):1390–1398.
OpenUrl Abstract/FREE Full Text
39.↵
Gabbay V, et al. (2009) A cytokine study in children and adolescents with Tourette's disorder. Progress in neuro-psychopharmacology & biological psychiatry 33(6):967–971.
OpenUrl CrossRef PubMed
40.
Mittleman BB, et al. (1997) Cerebrospinal fluid cytokines in pediatric neuropsychiatric disease. Journal of immunology (Baltimore, Md.: 1950) 159(6):2994–2999.
OpenUrl Abstract
41.
Leckman JF, et al. (2005) Increased serum levels of interleukin-12 and tumor necrosis factor-alpha in Tourette's syndrome. Biol Psychiatry 57(6):667–673.
OpenUrl CrossRef PubMed Web of Science
42.↵
Mitchell RH & Goldstein BI (2014) Inflammation in children and adolescents with neuropsychiatric disorders: a systematic review. J Am Acad Child Adolesc Psychiatry 53(3):274–296.
OpenUrl CrossRef PubMed
43.↵
Neale BM, et al. (2012) Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 485(7397):242–245.
OpenUrl CrossRef PubMed Web of Science
44.
Bernier R, et al. (2014) Disruptive CHD8 mutations define a subtype of autism early in development. Cell 158(2):263–276.
OpenUrl CrossRef PubMed Web of Science
45.↵
O'Roak BJ, et al. (2012) Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science 338(6114):1619–1622.
OpenUrl Abstract/FREE Full Text
46.↵
Cotney J, et al. (2015) The autism-associated chromatin modifier CHD8 regulates other autism risk genes during human neurodevelopment. Nature communications 6:6404.
OpenUrl
47.↵
Thompson BA, Tremblay V, Lin G, & Bochar DA (2008) CHD8 is an ATP-dependent chromatin remodeling factor that regulates beta-catenin target genes. Mol Cell Biol 28(12):3894–3904.
OpenUrl Abstract/FREE Full Text
48.
Yuan CC, et al. (2007) CHD8 associates with human Staf and contributes to efficient U6 RNA polymerase III transcription. Mol Cell Biol 27(24):8729–8738.
OpenUrl Abstract/FREE Full Text
49.↵
Barski A, et al. (2007) High-resolution profiling of histone methylations in the human genome. Cell 129(4):823–837.
OpenUrl CrossRef PubMed Web of Science
50.↵
Gazzellone MJ, et al. (2016) Uncovering obsessive-compulsive disorder risk genes in a pediatric cohort by high-resolution analysis of copy number variation. Journal of neurodevelopmental disorders 8:36.
OpenUrl
51.↵
Welch JM, et al. (2007) Cortico-striatal synaptic defects and OCD-like behaviours in Sapap3-mutant mice. Nature 448(7156):894–900.
OpenUrl CrossRef PubMed Web of Science
52.↵
American Psychiatric Association (2000) Diagnostic and statistical manual of mental disorders: DSM-IV-TR (American Psychiatric Association, Washington, DC).
53.↵
American Psychiatric Association (2013) Diagnostic and statistical manual of mental disorders: DSM-5.
54.↵
Willsey AJ, et al. (2017) De Novo Coding Variants Are Strongly Associated with Tourette Disorder. Neuron 94(3):486–499 e489.
OpenUrl
55.↵
Dietrich A, et al. (2015) The Tourette International Collaborative Genetics (TIC Genetics) study, finding the genes causing Tourette syndrome: objectives and methods. European Child & Adolescent Psychiatry 24(2):141–151.
OpenUrl
56.↵
Fischbach GD & Lord C (2010) The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron 68(2):192–195.
OpenUrl CrossRef PubMed Web of Science
57.↵
McKenna A, et al. (2010) The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20(9):1297–1303.
OpenUrl Abstract/FREE Full Text
58.↵
Li H & Durbin R (2010) Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26(5):589–595.
OpenUrl CrossRef PubMed Web of Science
59.↵
Wang K, Li M, & Hakonarson H (2010) ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 38(16):e164.
OpenUrl CrossRef PubMed
60.↵
Manichaikul A, et al. (2010) Robust relationship inference in genome-wide association studies. Bioinformatics (Oxford, England) 26(22):2867–2873.
OpenUrl CrossRef PubMed Web of Science
61.↵
Danecek P, et al. (2011) The variant call format and VCFtools. Bioinformatics (Oxford, England) 27(15):2156–2158.
OpenUrl CrossRef PubMed Web of Science
62.↵
Lek M, et al. (2016) Analysis of protein-coding genetic variation in 60,706 humans. Nature 536(7616):285–291.
OpenUrl CrossRef PubMed
63.↵
Shohat S, Ben-David E, & Shifman S (2017) Varying Intolerance of Gene Pathways to Mutational Classes Explain Genetic Convergence across Neuropsychiatric Disorders. Cell reports 18(9):2217–2227.
OpenUrl
64.↵
Iossifov I, et al. (2012) De novo gene disruptions in children on the autistic spectrum. Neuron 74(2):285–299.
OpenUrl CrossRef PubMed Web of Science
65.↵
O'Roak BJ, et al. (2012) Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature 485(7397):246–250.
OpenUrl CrossRef PubMed Web of Science
66.↵
Girard SL, et al. (2011) Increased exonic de novo mutation rate in individuals with schizophrenia. Nat Genet 43(9):860–863.
OpenUrl CrossRef PubMed
67.↵
Gulsuner S, et al. (2013) Spatial and temporal mapping of de novo mutations in schizophrenia to a fetal prefrontal cortical network. Cell 154(3):518–529.
OpenUrl CrossRef PubMed Web of Science
68.
McCarthy SE, et al. (2014) De novo mutations in schizophrenia implicate chromatin remodeling and support a genetic overlap with autism and intellectual disability. Mol Psychiatry 19(6):652–658.
OpenUrl CrossRef PubMed Web of Science
69.↵
Xu B, et al. (2011) Exome sequencing supports a de novo mutational paradigm for schizophrenia. Nat Genet 43(9):864–868.
OpenUrl CrossRef PubMed
70.↵
de Ligt J, et al. (2012) Diagnostic exome sequencing in persons with severe intellectual disability. The New England journal of medicine 367(20):1921–1929.
OpenUrl CrossRef PubMed Web of Science
71.
Gilissen C, et al. (2014) Genome sequencing identifies major causes of severe intellectual disability. Nature 511(7509):344–347.
OpenUrl CrossRef PubMed Web of Science
72.↵
Hamdan FF, et al. (2014) De novo mutations in moderate or severe intellectual disability. PLoS Genet 10(10):e1004772.
OpenUrl CrossRef PubMed
73.↵
Lage K, et al. (2007) A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat Biotechnol 25(3):309–316.
OpenUrl CrossRef PubMed Web of Science
74.↵
Kamburov A, Stelzl U, Lehrach H, & Herwig R (2013) The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res 41(Database issue):D793–800.
OpenUrl CrossRef PubMed Web of Science
75.↵
Rask-Andersen M, Masuram S, & Schioth HB (2014) The druggable genome: Evaluation of drug targets in clinical trials suggests major shifts in molecular class and indication. Annual review of pharmacology and toxicology 54:9–26.
OpenUrl CrossRef PubMed

References

1.
Ware JS, Samocha KE, Homsy J, & Daly MJ (2015) Interpreting de novo Variation in Human Disease Using denovolyzeR. Current protocols in human genetics / editorial board, Jonathan L. Haines … [et al.] 87:7.25.21-15.
OpenUrl
2.
He X, et al. (2013) Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes. PLoS Genet 9(8):e1003671.
OpenUrl CrossRef PubMed
3.
Sanders SJ, et al. (2015) Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron 87(6):1215–1233.
OpenUrl CrossRef PubMed
4.
Sanders SJ, et al. (2012) De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485(7397):237–241.
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted September 21, 2017.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Genetics

Subject Areas

All Articles

Animal Behavior and Cognition (5214)
Biochemistry (11745)
Bioengineering (8751)
Bioinformatics (29195)
Biophysics (14971)
Cancer Biology (12095)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18306)
Genetics (12245)
Genomics (16801)
Immunology (11867)
Microbiology (28083)
Molecular Biology (11592)
Neuroscience (60965)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7339)
Zoology (1651)

[1] 1.↵
Fontenelle LF, Mendlowicz MV, & Versiani M (2006) The descriptive epidemiology of obsessive-compulsive disorder. Progress in neuro-psychopharmacology & biological psychiatry 30(3):327–337.
OpenUrl CrossRef PubMed

[2] 2.↵
Karno M, Golding JM, Sorenson SB, & Burnam MA (1988) The epidemiology of obsessive-compulsive disorder in five US communities. Arch Gen Psychiatry 45(12):1094–1099.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Ruscio AM, Stein DJ, Chiu WT, & Kessler RC (2010) The epidemiology of obsessive-compulsive disorder in the National Comorbidity Survey Replication. Mol Psychiatry 15(1):53–63.
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Weissman MM, et al. (1994) The cross national epidemiology of obsessive compulsive disorder. The Cross National Collaborative Group. J Clin Psychiatry 55 Suppl:5–10.
OpenUrl PubMed Web of Science

[5] 5.↵
Torres AR, et al. (2006) Obsessive-compulsive disorder: prevalence, comorbidity, impact, and help-seeking in the British National Psychiatric Morbidity Survey of 2000. Am J Psychiatry 163(11):1978–1985.
OpenUrl CrossRef PubMed Web of Science

[6] 6.↵
Miguel EC, et al. (2000) Sensory phenomena in obsessive-compulsive disorder and Tourette's disorder. J Clin Psychiatry 61(2):150-156; quiz 157.
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Shavitt RG, et al. (2014) Phenomenology of OCD: lessons from a large multicenter study and implications for ICD-11. J Psychiatr Res 57:141-148.
OpenUrl

[8] 8.↵
Veale D & Roberts A (2014) Obsessive-compulsive disorder. BMJ (Clinical research ed.) 348:g2183.
OpenUrl FREE Full Text

[9] 9.↵
Bobes J, et al. (2001) Quality of life and disability in patients with obsessive-compulsive disorder. Eur Psychiatry 16(4):239–245.
OpenUrl CrossRef PubMed Web of Science

[10] 10.↵
Meier SM, et al. (2016) Mortality Among Persons With Obsessive-Compulsive Disorder in Denmark. JAMA psychiatry 73(3):268–274.
OpenUrl

[11] 11.↵
Lochner C, et al. (2014) Comorbidity in obsessive-compulsive disorder (OCD): a report from the International College of Obsessive-Compulsive Spectrum Disorders (ICOCS). Comprehensive psychiatry 55(7):1513–1519.
OpenUrl CrossRef PubMed

[12] 12.↵
Skoog G & Skoog I (1999) A 40-year follow-up of patients with obsessive-compulsive disorder [see commetns]. Arch Gen Psychiatry 56(2):121–127.
OpenUrl CrossRef PubMed Web of Science

[13] 13.↵
Eisen JL, et al. (2013) Five-year course of obsessive-compulsive disorder: predictors of remission and relapse. J Clin Psychiatry 74(3):233–239.
OpenUrl CrossRef PubMed

[14] 14.↵
Monzani B, Rijsdijk F, Harris J, & Mataix-Cols D (2014) The structure of genetic and environmental risk factors for dimensional representations of DSM-5 obsessive-compulsive spectrum disorders. JAMA psychiatry 71(2):182–189.
OpenUrl

[15] 15.
Pauls DL, Alsobrook JP, Goodman W, Rasmussen S, & Leckman JF (1995) A family study of obsessive-compulsive disorder. Am J Psychiatry 152(1):76–84.
OpenUrl CrossRef PubMed Web of Science

[16] 16.
Taylor S (2011) Etiology of obsessions and compulsions: a meta-analysis and narrative review of twin studies. Clinical psychology review 31(8):1361–1372.
OpenUrl CrossRef PubMed

[17] 17.↵
van Grootheest DS, Cath DC, Beekman AT, & Boomsma DI (2005) Twin studies on obsessive-compulsive disorder: a review. Twin research and human genetics: the official journal of the International Society for Twin Studies 8(5):450–458.
OpenUrl

[18] 18.↵
Mattheisen M, et al. (2014) Genome-wide association study in obsessive-compulsive disorder: results from the OCGAS. Mol Psychiatry.

[19] 19.
Costas J, et al. (2016) Exon-focused genome-wide association study of obsessive-compulsive disorder and shared polygenic risk with schizophrenia. Transl Psychiatry 6:e768.
OpenUrl

[20] 20.↵
Stewart SE, et al. (2013) Genome-wide association study of obsessive-compulsive disorder. Mol Psychiatry 18(7):788–798.
OpenUrl CrossRef PubMed Web of Science

[21] 21.↵
Deciphering Developmental Disorders Study (2017) Prevalence and architecture of de novo mutations in developmental disorders. Nature.

[22] 22.↵
Allen AS, et al. (2013) De novo mutations in epileptic encephalopathies. Nature 501(7466):217–221.
OpenUrl CrossRef PubMed Web of Science

[23] 23.↵
Rauch A, et al. (2012) Range of genetic mutations associated with severe nonsyndromic sporadic intellectual disability: an exome sequencing study. Lancet 380(9854):1674–1682.
OpenUrl CrossRef PubMed Web of Science

[24] 24.↵
De Rubeis S, et al. (2014) Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 515(7526):209–215.
OpenUrl CrossRef PubMed Web of Science

[25] 25.
Dong S, et al. (2014) De novo insertions and deletions of predominantly paternal origin are associated with autism spectrum disorder. Cell reports 9(1):16–23.
OpenUrl

[26] 26.↵
Iossifov I, et al. (2014) The contribution of de novo coding mutations to autism spectrum disorder. Nature 515(7526):216–221.
OpenUrl CrossRef PubMed Web of Science

[27] 27.↵
Sanders SJ, et al. (2015) Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron 87(6):1215–1233.
OpenUrl CrossRef PubMed

[28] 28.↵
Willsey AJ, et al. (2013) Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism. Cell 155(5):997–1007.
OpenUrl CrossRef PubMed Web of Science

[29] 29.↵
Cappi C, et al. (2016) Whole-exome sequencing in obsessive-compulsive disorder identifies rare mutations in immunological and neurodevelopmental pathways. Transl Psychiatry 6:e764.
OpenUrl

[30] 30.↵
Deciphering Developmental Disorders Study (2015) Large-scale discovery of novel genetic causes of developmental disorders. Nature 519(7542):223–228.
OpenUrl CrossRef PubMed

[31] 31.↵
He X, et al. (2013) Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes. PLoS Genet 9(8):e1003671.
OpenUrl CrossRef PubMed

[32] 32.↵
Homsy J, et al. (2015) De novo mutations in congenital heart disease with neurodevelopmental and other congenital anomalies. Science 350(6265):1262–1266.
OpenUrl Abstract/FREE Full Text

[33] 33.↵
Sanders SJ, et al. (2012) De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485(7397):237–241.
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Fromer M, et al. (2014) De novo mutations in schizophrenia implicate synaptic networks. Nature 506(7487):179–184.
OpenUrl CrossRef PubMed Web of Science

[35] 35.↵
Grimmond S, et al. (2000) Cloning, mapping, and expression analysis of a gene encoding a novel mammalian EGF-related protein (SCUBE1). Genomics 70(1):74–81.
OpenUrl CrossRef PubMed Web of Science

[36] 36.↵
Yang RB, et al. (2002) Identification of a novel family of cell-surface proteins expressed in human vascular endothelium. J Biol Chem 277(48):46364–46373.
OpenUrl Abstract/FREE Full Text

[37] 37.↵
Tu CF, et al. (2006) Localization and characterization of a novel secreted protein SCUBE1 in human platelets. Cardiovascular research 71(3):486–495.
OpenUrl CrossRef PubMed Web of Science

[38] 38.↵
Wu MY, et al. (2014) Inhibition of the plasma SCUBE1, a novel platelet adhesive protein, protects mice against thrombosis. Arteriosclerosis, thrombosis, and vascular biology 34(7):1390–1398.
OpenUrl Abstract/FREE Full Text

[39] 39.↵
Gabbay V, et al. (2009) A cytokine study in children and adolescents with Tourette's disorder. Progress in neuro-psychopharmacology & biological psychiatry 33(6):967–971.
OpenUrl CrossRef PubMed

[40] 40.
Mittleman BB, et al. (1997) Cerebrospinal fluid cytokines in pediatric neuropsychiatric disease. Journal of immunology (Baltimore, Md.: 1950) 159(6):2994–2999.
OpenUrl Abstract

[41] 41.
Leckman JF, et al. (2005) Increased serum levels of interleukin-12 and tumor necrosis factor-alpha in Tourette's syndrome. Biol Psychiatry 57(6):667–673.
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Mitchell RH & Goldstein BI (2014) Inflammation in children and adolescents with neuropsychiatric disorders: a systematic review. J Am Acad Child Adolesc Psychiatry 53(3):274–296.
OpenUrl CrossRef PubMed

[43] 43.↵
Neale BM, et al. (2012) Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 485(7397):242–245.
OpenUrl CrossRef PubMed Web of Science

[44] 44.
Bernier R, et al. (2014) Disruptive CHD8 mutations define a subtype of autism early in development. Cell 158(2):263–276.
OpenUrl CrossRef PubMed Web of Science

[45] 45.↵
O'Roak BJ, et al. (2012) Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science 338(6114):1619–1622.
OpenUrl Abstract/FREE Full Text

[46] 46.↵
Cotney J, et al. (2015) The autism-associated chromatin modifier CHD8 regulates other autism risk genes during human neurodevelopment. Nature communications 6:6404.
OpenUrl

[47] 47.↵
Thompson BA, Tremblay V, Lin G, & Bochar DA (2008) CHD8 is an ATP-dependent chromatin remodeling factor that regulates beta-catenin target genes. Mol Cell Biol 28(12):3894–3904.
OpenUrl Abstract/FREE Full Text

[48] 48.
Yuan CC, et al. (2007) CHD8 associates with human Staf and contributes to efficient U6 RNA polymerase III transcription. Mol Cell Biol 27(24):8729–8738.
OpenUrl Abstract/FREE Full Text

[49] 49.↵
Barski A, et al. (2007) High-resolution profiling of histone methylations in the human genome. Cell 129(4):823–837.
OpenUrl CrossRef PubMed Web of Science

[50] 50.↵
Gazzellone MJ, et al. (2016) Uncovering obsessive-compulsive disorder risk genes in a pediatric cohort by high-resolution analysis of copy number variation. Journal of neurodevelopmental disorders 8:36.
OpenUrl

[51] 51.↵
Welch JM, et al. (2007) Cortico-striatal synaptic defects and OCD-like behaviours in Sapap3-mutant mice. Nature 448(7156):894–900.
OpenUrl CrossRef PubMed Web of Science

[52] 52.↵
American Psychiatric Association (2000) Diagnostic and statistical manual of mental disorders: DSM-IV-TR (American Psychiatric Association, Washington, DC).

[53] 53.↵
American Psychiatric Association (2013) Diagnostic and statistical manual of mental disorders: DSM-5.

[54] 54.↵
Willsey AJ, et al. (2017) De Novo Coding Variants Are Strongly Associated with Tourette Disorder. Neuron 94(3):486–499 e489.
OpenUrl

[55] 55.↵
Dietrich A, et al. (2015) The Tourette International Collaborative Genetics (TIC Genetics) study, finding the genes causing Tourette syndrome: objectives and methods. European Child & Adolescent Psychiatry 24(2):141–151.
OpenUrl

[56] 56.↵
Fischbach GD & Lord C (2010) The Simons Simplex Collection: a resource for identification of autism genetic risk factors. Neuron 68(2):192–195.
OpenUrl CrossRef PubMed Web of Science

[57] 57.↵
McKenna A, et al. (2010) The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20(9):1297–1303.
OpenUrl Abstract/FREE Full Text

[58] 58.↵
Li H & Durbin R (2010) Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26(5):589–595.
OpenUrl CrossRef PubMed Web of Science

[59] 59.↵
Wang K, Li M, & Hakonarson H (2010) ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 38(16):e164.
OpenUrl CrossRef PubMed

[60] 60.↵
Manichaikul A, et al. (2010) Robust relationship inference in genome-wide association studies. Bioinformatics (Oxford, England) 26(22):2867–2873.
OpenUrl CrossRef PubMed Web of Science

[61] 61.↵
Danecek P, et al. (2011) The variant call format and VCFtools. Bioinformatics (Oxford, England) 27(15):2156–2158.
OpenUrl CrossRef PubMed Web of Science

[62] 62.↵
Lek M, et al. (2016) Analysis of protein-coding genetic variation in 60,706 humans. Nature 536(7616):285–291.
OpenUrl CrossRef PubMed

[63] 63.↵
Shohat S, Ben-David E, & Shifman S (2017) Varying Intolerance of Gene Pathways to Mutational Classes Explain Genetic Convergence across Neuropsychiatric Disorders. Cell reports 18(9):2217–2227.
OpenUrl

[64] 64.↵
Iossifov I, et al. (2012) De novo gene disruptions in children on the autistic spectrum. Neuron 74(2):285–299.
OpenUrl CrossRef PubMed Web of Science

[65] 65.↵
O'Roak BJ, et al. (2012) Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature 485(7397):246–250.
OpenUrl CrossRef PubMed Web of Science

[66] 66.↵
Girard SL, et al. (2011) Increased exonic de novo mutation rate in individuals with schizophrenia. Nat Genet 43(9):860–863.
OpenUrl CrossRef PubMed

[67] 67.↵
Gulsuner S, et al. (2013) Spatial and temporal mapping of de novo mutations in schizophrenia to a fetal prefrontal cortical network. Cell 154(3):518–529.
OpenUrl CrossRef PubMed Web of Science

[68] 68.
McCarthy SE, et al. (2014) De novo mutations in schizophrenia implicate chromatin remodeling and support a genetic overlap with autism and intellectual disability. Mol Psychiatry 19(6):652–658.
OpenUrl CrossRef PubMed Web of Science

[69] 69.↵
Xu B, et al. (2011) Exome sequencing supports a de novo mutational paradigm for schizophrenia. Nat Genet 43(9):864–868.
OpenUrl CrossRef PubMed

[70] 70.↵
de Ligt J, et al. (2012) Diagnostic exome sequencing in persons with severe intellectual disability. The New England journal of medicine 367(20):1921–1929.
OpenUrl CrossRef PubMed Web of Science

[71] 71.
Gilissen C, et al. (2014) Genome sequencing identifies major causes of severe intellectual disability. Nature 511(7509):344–347.
OpenUrl CrossRef PubMed Web of Science

[72] 72.↵
Hamdan FF, et al. (2014) De novo mutations in moderate or severe intellectual disability. PLoS Genet 10(10):e1004772.
OpenUrl CrossRef PubMed

[73] 73.↵
Lage K, et al. (2007) A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat Biotechnol 25(3):309–316.
OpenUrl CrossRef PubMed Web of Science

[74] 74.↵
Kamburov A, Stelzl U, Lehrach H, & Herwig R (2013) The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res 41(Database issue):D793–800.
OpenUrl CrossRef PubMed Web of Science

[75] 75.↵
Rask-Andersen M, Masuram S, & Schioth HB (2014) The druggable genome: Evaluation of drug targets in clinical trials suggests major shifts in molecular class and indication. Annual review of pharmacology and toxicology 54:9–26.
OpenUrl CrossRef PubMed

De novo damaging coding mutations are strongly associated with obsessive-compulsive disorder and overlap with autism

ABSTRACT

INTRODUCTION

RESULTS

Damaging de novo SNVs and indels are associated with OCD risk

Damaging de novo SNVs and indels contribute to OCD risk in 22% of cases

Recurrent damaging de novo variants identify two candidate risk genes

Approximately 335 genes contribute to OCD risk

Overlap with ASD and CHD8 target genes

Exploratory pathway and network analyses

DISCUSSION

MATERIALS AND METHODS

Subjects

Whole-exome sequencing (WES)

Sequence alignment and variant calling

Quality control and de novo variant calling

Mutation rate analysis

TADA analysis

Estimation of number of risk gene

Estimation of future risk gene discovery

Gene set overlap

Exploratory pathway and network analyses

Gene set overlap using DNENRICH

Supplemental Tables

ACKNOWLEDGEMENTS

REFERENCES

References

Citation Manager Formats

Subject Area