Evaluation of Chromatin Accessibility in Prefrontal Cortex of Schizophrenia Cases and Controls

Julien Bryois; Melanie E Garrett; Lingyun Song; Alexias Safi; Paola Giusti-Rodriguez; Graham D Johnson; Alfonso Buil Demur; John F Fullard; Panos Roussos; Pamela Sklar; Schahram Akbarian; Vahram Haroutunian; Craig A Stockmeier; Gregory A Wray; Kevin P White; Chunyu Liu; Timothy E Reddy; Allison Ashley-Koch; Patrick F Sullivan; Gregory E Crawford

doi:10.1101/141986

Abstract

Schizophrenia genome-wide association (GWA) studies have identified over 150 regions of the genome that are associated with disease risk, yet there is little evidence that coding mutations contribute to this disorder. To explore the mechanism of non-coding regulatory elements in schizophrenia, we performed ATAC-seq on adult prefrontal cortex brain samples from 135 individuals with schizophrenia and 137 controls, and identified 118,152 ATAC-seq peaks. These accessible chromatin regions in brain are highly enriched for SNP-heritability for schizophrenia (10.6 fold enrichment, P=2.4×10⁻⁴, second only to genomic regions conserved in Eutherian mammals) and replicated in an independent dataset (9.0 fold enrichment, P=2.7×10⁻⁴). This degree of enrichment of schizophrenia heritability was higher than in open chromatin found in 138 different cell and tissue types. Brain open chromatin regions that overlapped highly conserved regions exhibited an even higher degree of heritability enrichment, indicating that conservation can identify functional subsets within regulatory elements active in brain. However, we did not identify chromatin accessibility differences between schizophrenia cases and controls, nor did we find an interaction of chromatin QTLs with case-control status. This indicates that although causal variants map within regulatory elements, mechanisms other than differential chromatin may govern the contribution of regulatory element variation to schizophrenia risk. Our results strongly implicate gene regulatory processes involving open chromatin in the pathogenesis of schizophrenia, and suggest a strategy to understand the hundreds of common variants emerging from large genomic studies of complex brain diseases.

Introduction

Schizophrenia genomics is progressing rapidly, and our mechanistic understanding of this common and often devastating neuropsychiatric disorder is markedly better than five years ago.¹ Evidence for a non-specific genetic component for schizophrenia has been known for decades (e.g., via sibling recurrence risk of 8.6 and phenotype heritability estimates of at least 60%).^2–6 The bulk of the genetic basis of schizophrenia is due to common variation.⁷ A 2014 paper identified 108 genetic regions and a subsequent report has added over 40 new regions, but the implicated regions are broad and usually do not implicate specific genes.^7–9 It was hypothesized that schizophrenia risk would include many exonic variants of strong effect,^{10, 11} but subsequent large whole exome sequencing studies provide minimal support for this hypothesis.^12–14 Identifying “actionable” genes has proven complex,¹⁵ with a few exceptions such as rare exon variants in SETD1A¹⁶ and copy number variation in single genes like NRXN1 and C4.^17–19

These studies provide strong evidence that genetic risk for schizophrenia results from the concerted effects of many genes. Schizophrenia may be a disorder of subtly altered amounts of protein isoforms rather than changes in individual amino acids. Non-coding regulatory variation is a major contributor to risk for schizophrenia,^20–26 and genomic regions associated with schizophrenia are enriched for gene expression quantitative trait loci (eQTLs) identified in human brains.^{27, 28} Combining functional genomic data with GWA results may be crucial to deciphering connections to specific genes and disease mechanisms in schizophrenia.^28–30 ENCODE³¹ and the Roadmap Epigenomics Mapping Consortium³² provided considerable human functional genomic data and insights into genomic function. However, these studies provided limited insight into psychiatric disorders as most samples were non-neuronal and none were from affected individuals. This study is part of the PsychENCODE consortium which is intended to provide functional genomic data from the brains of individuals with and without severe neuropsychiatric disorders.³³

Epigenetic changes in brain are widely hypothesized to partly mediate risk for schizophrenia.³⁴ Multiple epigenetic changes have been assessed in schizophrenia (reviewed in³⁵), and methylation differences in peripheral blood^36–38 and brain³⁹ have been associated with schizophrenia. Of the many epigenetic changes, chromatin accessibility is particularly important and is a conserved eukaryotic feature characteristic of active regulatory elements, including promoters, enhancers, silencers, insulators, transcription factor binding sites and active histone modifications.^{40–43,43–45} Chromatin accessibility has not been systematically evaluated in human brain for schizophrenia, except for a small study.⁴⁶ Preferentially accessible regions of chromatin can readily be identified by high throughput sequencing following the transposition of sequencing adaptors into the DNA backbone via the Tn5 transposase using the assay for transposase-accessible chromatin sequencing (ATAC-seq).⁴⁷ Unlike other nuclease-sensitivity assays, this approach is amenable to limited amounts of postmortem tissue. As chromatin accessibility can differ by >30% between tissues and cell types,⁴³ it is important to study the brains of schizophrenia cases and controls.

Our overall goal is to comprehensively identify active gene regulatory elements in a brain region relevant to schizophrenia, and to quantify how genetic variation alters function such as SNPs that alter chromatin accessibility (i.e., chromatin quantitative trait loci, cQTL). We use these data to parse genetic risk for schizophrenia from large GWA. These analyses provide insight into the molecular mechanisms governing schizophrenia risk. To our knowledge, this is the largest study of chromatin accessibility in schizophrenia and among the largest for any human disease.

Results

In the Online Methods, we provide the rationale for our choices of ATAC-seq,⁴⁷ brain region, and study design. Key features of our approach include the use of the same samples subjected to mRNA-seq and genotyping analysis by the CommonMind Consortium,²⁸ as well as careful experimentation including randomization, blinding, comprehensive quality control, empirical covariate selection, and verification of subject identity.

We performed ATAC-seq on 314 brain samples (142 schizophrenia, 143 control, 23 mood disorders, and 6 other). After quality control, the analysis dataset consisted of ATAC-seq on postmortem Brodmann area 9 (dorso-lateral prefrontal cortex, DLPFC) tissue from 135 cases with schizophrenia and 137 controls (Figure 1A). Sixteen individuals with mood disorders were included for open chromatin peak calling and cQTL analyses but otherwise excluded. Table 1 summarizes the demographic and clinical features of the subjects. Cases and controls were comparable for sex, ethnicity, age at death, and postmortem brain pH. Cases had greater postmortem intervals and lower RNA integrity number (RIN) scores relative to controls. These differences appeared to have a lesser impact on DNA-based ATAC-seq assays as cases were comparable to controls for unique aligned reads and normalized open chromatin peak calls. However, these differences motivated comprehensive and careful selection of covariates (Online Methods).

Figure 1: ATAC-seq on frozen DLPFC samples

(A) Sequencing statistics from all libraries show largely similar total number of reads (pink), uniquely aligning reads (green), and reads that map to ATAC-seq peaks (blue). (B) Individual brain ATAC-seq data in a representative genomic region showing largely congruent identification of regions of open chromatin. Some samples have lower signal-to-noise (correlated with covariates like postmortem interval and RNA integrity number). Region in red is a chromatin QTL, note lower signal for indivdiuals with AA genotype vs. GG genotype. (C) Open chromatin in relation to transcription start sites (TSS). The TSS is at zero, right is inward in the direction of transcription, and left is outward from the gene. Density curves for all known GENCODE v25 transcripts (one principal transcript per protein-coding gene). Colored curves show different expression levels in DLPFC: Q0=no expression, Q1-Q4=lowest to highest expression quartiles. (D) Number of DLPFC ATAC-seq peaks that overlap with putative regulatory elements identified from 111 different cell types analyzed by the Epigenome Roadmap Project. Approximately 15% (n=~17,000) ATAC-seq peaks are unique to DLPFC. (E) Location of DLPFC-only ATAC-seq peaks relative to the TSS indicates the majority are non-promoter regions.

View this table:

Table 1:

Sample description. All samples were from Brodmann area 9 of left hemisphere. ^† Additional ethnicities in cases were African-American (17, 12.6%), Hispanic (5, 3.7%), and Asian (1, 0.74%), and in controls African-American (20,14.6%), Hispanic (16, 11.7%), and Asian (3, 2.2%). ‡ Full details are in reference.²⁸ The proportions of oligodendrocytes (0.038), endothelial cells (0.004), microglia (0.0007) were similar in cases and controls. The RNA-based measures are pertinent for the DNA-based ATAC-seq assay as the samples were from the same aliquots.

ATAC-seq evaluation

ATAC-seq data were aligned and peaks were called using MACS2 (FDR < 0.01; Figure 1B). We identified 118,152 open chromatin peaks totaling 35.5 Mb. As a crude comparison, the human exome has around 131K exons totaling 47 Mb. We compared these regions of open chromatin to those from smaller experiments (Table S1). Allowing for small sample sizes, overlap of our open chromatin results with that identified in these other studies was greatest for the most similar studies (DLPFC in adults), somewhat lower in adult cortical samples of sorted neurons, and lower in fetal cortex. Overlap with the diverse ENCODE samples was low, but higher for CNS-relevant samples. These results show congruence of our larger ATAC-seq data with prior experiments, and underscore the need to study brain.

View this table:

Table S1.

Overlap of ATAC-seq data from this study with external samples. CNS=central nervous system. DHS=DNase I hypersensitivity. NeuN is a marker for post-mitotic neurons. This table compares our ATAC-seq dataset with regions of open chromatic identified in other experiments including ENCODE,³¹ Roadmap Project,³² a pilot study from Roussos and colleagues,⁴⁶ and a separate pilot study from our team (unpublished). Given the widely varying sample sizes and different methods used, we queried the proportion of the open chromatin overlapping with our study. Overlap of open chromatin identified in these other studies was greatest for the most similar studies (DLPFC in adults), somewhat lower in adult samples of sorted neurons, and considerably lower in fetal cortex. Overlap of ENCODE data was low for the full set of highly diverse tissues and cell lines but somewhat higher for the set of CNS-relevant tissues and cell lines. This pattern of results suggests the congruence of our data with those from these other studies.

About a quarter of open chromatin regions map near a transcription start site (TSS): 23% were ±5 kb, and 53.0% were >25 kb from any TSS. This distribution was similar to previous studies using DNase-seq.^{48, 49} There is a stronger enrichment for ATAC-seq reads at the TSS for genes that are highly expressed compared to genes that were not expressed (Figure 1C). We compared the 118,152 peaks to putative regulatory elements from 111 cell types from reg2map as part of the Epigenome Roadmap project (URLs). We found that 85% of our brain ATAC-seq peaks overlap a promoter or enhancer in one or more of these cell types (Figure 1D). This indicates 15% ATAC-seq peak calls are unique to the brain frontal cortex and highlights the need to identify putative regulatory elements from disease relevant tissues. Most of these DLPFC-unique ATAC-seq peaks did not map to promoter regions (Figure 1E).

ATAC-seq peaks are enriched for schizophrenia heritability

We evaluated the relevance of these DLFPC open chromatin regions to schizophrenia using partitioned LD score regression.⁵⁰ This method evaluates whether the SNP-heritability of schizophrenia is enriched in pre-defined genomic features. This approach accounts for multiple technical issues, and conducts a head-to-head comparative evaluation of dozens of genomic features. An earlier analysis found that the SNP-heritability of schizophrenia was strongly enriched in evolutionary conserved genomic regions, but not in open chromatin regions from non-brain cell lines or tissues.⁵⁰ We tested heritability enrichment using the same genomic features⁵⁰ including our DLPFC ATAC-seq peaks and an additional annotation created by expanding the ATAC-seq peaks by ±250 bp to prevent upward bias in heritability estimates.^{50, 51}

We found that regions of open chromatin in adult DLPFC were very strongly enriched for genetic variation relevant for schizophrenia (Figure 2A): the 1.1% of SNPs located in ATAC-seq peaks explained 11.7% of the SNP-heritability of schizophrenia (10.6 fold enrichment, P=2.4×10⁻⁴). This level of enrichment was almost as large as conserved regions.⁵⁰ We replicated this result in an independent ATAC-seq dataset from the Chicago psychENCODE group (UIC and U of C) based on 295 adult DLPFC samples from an independent cohort processed with the same bioinformatics pipeline (Figure 2B and Figure S1). We obtained strikingly similar results with 1.5% of the SNPs located in this new set of peaks explaining 13.8% of the SNP-heritability of schizophrenia (9.0 fold enrichment, P=2.7×10⁻⁴). Thus, common genetic variation that mediates risk for schizophrenia is not randomly distributed in the genome but is concentrated in definable genomic features, particularly regions of open chromatin in brain cortex.

Figure 2: Schizophrenia heritability is enriched for brain-specific accessible chromatin

A) Schizophrenia heritability enrichment (standard error) and significance level (−log10(P)) of different functional genomic and genomic annotations. Enrichment of ATAC-seq regions in DLPFC is second only to genomic regions conserved across 29 Eutherian mammals. B) Enrichment across a subset of 125 DNase-seq and ATAC-seq datasets (see Figure S9 for complete comparison). The black bar represents the Bonferroni significance threshold (=0.01/125). Top enrichments are in DLPFC ATAC-seq peaks generated from Duke and U Chicago groups, followed by other mostly brain specific tissues and cell lines. ATAC-seq tissue samples (cerebellum, liver, muscle, heart, kidney, and lung) represent an independent study to control for batch and method effects C) Schizophrenia heritability enrichment and standard error for peaks of different width. D) Heritability enrichment of ATAC-seq peaks from brain display significant enrichment (* = Bonferroni corrected P < 0.01) for GWA variants associated with schizophrenia, but not for educational attainment (Edu), height, and total cholesterol (TC). E) Heritability enrichment of evolutionary conserved regions are significantly enriched for schizophrenia, educational attainment, and height, but not TC. F) Heritability is 4x more enriched for regions that overlap between ATAC-seq and conservation than for regions that are either conserved or in ATAC-seq peaks.

To evaluate the specificity of this result, we compared open chromatin from DLPFC to that in 124 cell types and tissues generated by the ENCODE Consortium using DNase-seq. The DLPFC ATAC-seq data displayed the greatest association with schizophrenia compared to any other cell or tissue type tested (Figure 2B and Figure S2). These data again indicate the importance of studying relevant tissues from cohort samples. DNase-seq from frontal cortex, cerebellum, a neuroblastoma cell line, and a medulloblastoma cell line also showed more association relative to samples not from brain (Figure S2). We generated ATAC-seq from fetal brain samples, and showed slightly lower but still significant enrichment, indicating that relevant regulatory regions that confer risk are accessible earlier in development (Figure 2B). Since DNase-seq was used for most of the 124 cell types, we generated ATAC-seq from six diverse human tissues (cerebellum, liver, skeletal muscle, heart, kidney, and lung) and found significant association with schizophrenia only in cerebellum (Figure 2B).

The regions flanking ATAC-seq peaks were markedly enriched for schizophrenia heritability. This suggested that causal variants may extend beyond the center of ATAC-seq peaks (Figure 2A). This enrichment could be due to our stringent quality control limiting peak size or number, or because open chromatin boundaries vary between individuals.⁵⁰ To test whether SNPs nearer to the peak centers were more likely to be causal, we determined heritability enrichment of ATAC-seq peaks of varying widths (100 bp, 300 bp, 1 kb, 2 kb, 5 kb, and 10 kb). Schizophrenia SNP-heritability enrichment decreased as peak width increased (Figure 2C), indicating that SNPs nearer to the peak center explained more SNP-heritability than SNPs farther away.

The SNP-heritability enrichment of DLPFC ATAC-seq peaks was specific to schizophrenia, and was not significantly enriched for GWA variants for educational attainment, height, or total cholesterol (Figure 2D). This is in contrast to evolutionarily conserved regions, which are enriched for GWA variants for schizophrenia, educational attainment, and height (Figure 2E). To investigate whether sub-regions of the ATAC-seq peaks were particularly enriched for schizophrenia SNP-heritability, we intersected the ATAC-seq peaks with evolutionarily conserved regions,⁵² and found that conserved regions located in DLPFC ATAC-seq peaks were more highly enriched in SNP-heritability (38-fold enrichment; Figure 2F). Conserved regions that map within ATAC-seq peaks cover 6 Mb, much smaller than the 35 Mb covered by ATAC-seq peaks. The intersection of open chromatin and conserved regions tended to be near protein coding transcription start sites (TSS) with 41% located ±10kb from the TSS of the closest protein coding gene whereas only 30.9% of the ATAC-seq peaks and 25.3% of the conserved regions were located ±10kb of a TSS.

Conserved regions in ATAC-seq peaks are significantly enriched for CTCF binding sites (Homer P-value < 1 × 10⁻¹⁰⁰, MEME-chip evalue < 1 × 10⁻¹⁰⁰), which are involved in the formation of topological associated domains,⁵³ and other transcription factors involved in neuron differentiation (Table S2).⁵⁴ This indicates that common genetic variation implicated in schizophrenia could impact higherorder DNA conformation and/or affect neuron differentiation. Our results support that DLPFC ATAC-seq peaks are enriched for causal variants in schizophrenia, and that restricting these regions to those that are evolutionarily conserved reduces the search space for putative casual variants.

View this table:

Table S2.

Enrichment of transcription factor binding motifs in DLPFC ATAC-seq peaks that overlap evolutionarily conserved sequences. Shown are top enriched motifs using two different motif enrichment methods, Homer and MEME-chip.

Identification of differential open chromatin

We next explored possible mechanisms of how these non-coding variants contribute to schizophrenia risk. From an exhaustive list of potential technical and clinical covariates, we identified 13 empirical covariates associated with chromatin peak quantification (Online Methods, Figure S3). Controlling for these covariates, we evaluated differential chromatin for age at death and postmortem interval to show that we can detect subtle differences in chromatin accessibility. For age at death, we detected 2,310 sites showing significant differences (FDR < 0.05) in chromatin accessibility (Figures 3A-B). These differences may indicate changes in cell heterogeneity (e.g., fewer neurons) or possibly differential chromatin within one or more cell types within the brain that occurs during the aging process. For postmortem interval, we detected 466 sites showing significant differences in chromatin accessibility (Figures 3C-D), suggesting that specific regions of the genome are more sensitive to chromatin degradation after death. However, when comparing schizophrenia cases to controls, we detected only three regions with differential open chromatin (Figures 3E-F). These three regions did not overlap schizophrenia GWA loci and are not nearby genes that are differentially expressed between cases and controls.²⁸ To rule out the possibility of samples with lower signal-to-noise interfering with ability to identify differential chromatin, we performed the same differential chromatin assessment with 62 cases and 75 controls with the highest signal-to-noise, which also resulted in few significant differences in accessibility between schizophrenia and control samples.

Figure 3: Differential chromatin accessibility differences detected in 288 brain samples

A) Differential chromatin detected as a function of age at death. Dots in red indicate significance (FDR < 0.05). B) Distribution of P-values for age. C) Differential chromatin detected as a function of postmortem interval (PMI). D) Distribution of P-values for PMI. E) Differential chromatin detected as a function of case-control status. F) Distribution of P-values for case-control status. Peaks on chrX were meta-analyzed (inverse variance weighted) and peaks on chrY were only tested in males

A more focused analysis on 1,119 ATAC-seq peaks that map within 96 schizophrenia GWA regions⁸ did not identify any differentially accessible peaks between schizophrenia cases and controls (5% FDR), as well as no enrichment in lower P-values. Furthermore, a focused analysis on ATAC-seq peaks located near genes (30 kb upstream to 10 kb downstream) differentially expressed between cases and controls in the same samples²⁸ (693 genes, 5% FDR, 4,614 ATAC-seq peaks) did not result in significant differences (5% FDR) and with no enrichment in lower P-values. Finally, an analysis of multiple pathways previously associated with schizophrenia, namely genes with RBFOX binding sites, genes whose mRNA bind FMRP, or genes that are the target of antipsychotics^{9, 55}, did not result in significant differences between cases and controls (data not shown).

Identification of chromatin QTL (cQTL)

Previous studies showed that schizophrenia risk alleles are enriched for brain eQTLs.²⁷ To determine if genetic variants associated with schizophrenia can also impact chromatin accessibility, we performed a cQTLs analysis using our DLPFC ATAC-seq data (Figure 4, Online Methods). In total, we identified 6,200 SNPs that were significantly associated with differences in chromatin accessibility (5% FDR, Figure 4A). There was no skewing of rare or common allele frequency in this subset of SNPs (Figure 4B). The influence of gene variation on chromatin accessibility appeared to be localized to nearby regions as 10% of cQTLs (n = 622) were adjacent to (±2 kb) or within the center of the peak with which they were associated (52.0%; Figure 4B).

Figure 4: Identification of chromatin QTLs (cQTLs)

A) QQ plot of cQTLs using window size of 5kb (green) or 50kb (black). Both show marked deviations from the expected. B) Distance of cQTLs relative to the center of ATAC-seq peaks as a function of minor allele frequency. C) Most significant cQTL, rs1549428 (chr12:9436157–9436457); individuals homozygous for the reference allele (0) display more chromatin accessibility than individuals that are heterozygous (1) and homozygous for the alternate allele (2). D) Most significant cQTL that is also a schizophrenia GWA loci, rs11615998 (chr12:2364960–2365260). Due to a low MAF (0.059) only 1 individual was homozygous for the alternate allele. The results were comparable with or without this homozygous individual.

The most significant cQTL was rs1549428 associated with an open chromatin peak at chr12:9436157-9436457 (q=3.9×10⁻⁸², Figure 1B, Figure 4C). Only 176 of the 6,200 cQTLs (2.8%) were located in a schizophrenia GWA locus,⁸ the most significant of which was rs11615998 in CACNA1C (q=2.7×10⁻⁴⁴, Figure 4D). As an independent confirmation, we show that individuals who are heterozygous for cQTL alleles display allele skewing (Figure 5A-B) in the direction that is expected (Figure 5C). Similar to eQTLs,²⁸ we detect no significant enrichment of cQTLs in schizophrenia GWA regions.

Figure 5: Heterozygous individuals for cQTLs display allele bias

A) Allele counts across cQTLs for individuals homozygous for reference alleles (green), homozygous for alternative alleles (red), or heterozygous for cQTL variants (blue). B) Same as A, but for non-significant cQTLs. C) For individuals heterozygous for cQTLs, we determined the % of each cQTL that correlated to the alternate allele (y-axis) and compared those values to the cQTL effect size (x-axis).

cQTLs overlap with eQTLs

We compared cQTLs to previously identified eQTLs in the same dataset.²⁸ The estimated true proportion of significant cQTLs that are also eQTLs is 23.3% (Online Methods), which agrees with similar estimates from lymphoblastoid cell lines.⁵⁶ For SNPs that were both cQTLs and eQTLs, 63.6% were concordant (p < 1 × 10⁻⁴), meaning the allelic association indicated both more open chromatin and higher gene expression (Figure 6A). Despite this correlation of direction of the effect, there was no correlation in the sizes of the effect (Figure 6B). Finally, for SNPs in high LD, the direction of effect for cQTL and eQTL was not always the same (Figure 6C-6D). Looking at the direction for all QTLs that have pairwise SNPs with r² > 0.8, approximately 31% go in the opposite direction. This is roughly the same percentage going in the opposite direction as it is among all QTLs, regardless of LD (Figure 6A). This may indicate independent mechanisms by which these SNPs impact gene expression.

Figure 6: SNPs that are both eQTLs and cQTLs

A) Direction of effects of cQTL vs. eQTL. Concordant effects are when cQTL allele associated with greater open chromatin is also associated with higher eQTL expression. B) Even though the direction of cQTL and eQTL is largely concordant, the effect sizes of these differences are only modestly correlated (r=0.21). C) Three SNPs in high LD on chromosome 1. The tested allele is associated with lower expression and more open chromatin for all three SNPs. Distance between SNP1-2 is 10 kb, and distance between SNP2-3 is 6.3 kb. D) Three SNPs in high LD on chromosome 2. The tested allele is associated with lower expression in all three instances, but with more open chromatin for two SNPs and more closed chromatin for the third SNP. Distance between SNP1-2 is 2kb and distance between SNP2-3 is 20kb.

cQTL-schizophrenia interaction

Finally, we attempted to identify cQTL SNPs that interacted with schizophrenia status to predict open chromatin regions, but found that none were significant at an FDR below 5%. The number of nominally significant cQTLs in a schizophrenia-associated GWA locus also did not deviate from chance expectations. These results are in line with the eQTL study performed by the CommonMind Consortium that did not detect many eQTLs that interacted with schizophrenia status.²⁸

Discussion

Groups such as the Psychiatric Genomics Consortium have generated some of the largest GWA data for schizophrenia to date.¹ These studies have largely pointed to non-coding regions of the genome, suggesting that variants in gene regulatory elements contribute to schizophrenia risk. This is supported by whole exome sequencing efforts that have identified few pathogenic coding variants, 14 as well as expression data showing that brain eQTLs are enriched for schizophrenia association statistics.²⁸ Thus, we believe that the next step in schizophrenia research is to provide more targeted evidence that schizophrenia risk alleles indeed fall within putative gene regulatory elements, and to characterize the mechanism of those non-coding variants and how they alter gene expression levels. As part of the psychENCODE project,³³ we have made progress in both areas, and summarize our results in Table 2.

View this table:

Table 2:

Summary of results pertaining to schizophrenia genomic findings.

Our study shows that, of diverse genomic and epigenomic datasets that span many different cell types and tissues, ATAC-seq data from DLPFC displays is the most associated with schizophrenia common variant GWA results. Furthermore, genomic regions conserved across 29 Eutherian mammals within adult DLPFC ATAC-seq peaks display an even higher degree of heritability enrichment. This strongly suggests that multiple orthologous data types will be needed to identify causal variants that contribute to schizophrenia risk. This important discovery narrows the search space for identifying causal alleles.

Our initial hypothesis was that these non-coding risk alleles worked through a mechanism that altered chromatin accessibility. This is based on brain eQTLs being enriched for schizophrenia risk alleles,²⁷ nearly 600 genes displaying differential expression in brain samples from cases and controls,²⁸ and that cQTLs explain a large number of eQTLs in lymphoblastoid cell lines.⁵⁶ Our hypothesis was not confirmed as we did not detect differential chromatin accessibility between schizophrenia cases and controls. This is despite the fact that we were able to detect subtle differences in chromatin accessibility across different age groups and different post mortem intervals, as well as detecting thousands of cQTLs that largely overlap with eQTLs identified in the same samples. Similar to the CommonMind eQTL paper that did not detect many eQTLs that interacted with case/control status,²⁸ we did not detect a significant interaction of cQTLs with case/control status. Together, these data indicate that differential gene expression between schizophrenia cases and controls may not be manifested as differential chromatin accessibility.

While it appears that differential chromatin accessibility is not a biological “readout” of non-coding genetic risk for schizophrenia, we cannot rule out that differential chromatin accessibility and/or cQTLs may still be involved. The sample size of our study may have been underpowered to identify differences in chromatin state between cases and controls. We also cannot rule out that cell heterogeneity may mask rare but important cell-type specific signals. In other work we implicated cortical pyramidal neurons as an important cell type for schizophrenia,⁵⁷ and as pyramidal neurons comprise ~40% of the DLPFC, this may be less likely. Furthermore, in silico mixing experiments demonstrated that differential chromatin accessibility can accurately be attributed to specific cell types comprising a heterogeneous sample (Figure S4). Another possibility is that differential chromatin accessibility between cases and controls is manifested only within specific developmental windows. Indeed, we have identified chromatin accessibility differences that occur during development in post-mitotic mouse neurons.⁵⁸ While careful age-matched comparisons are needed in human to explore this possibility, identifying differences between cases and controls will be challenging as psychotic symptoms typically do not appear until early adulthood.

While these scenarios above are feasible, it is possible that functional non-coding SNPs (including eQTLs) contribute to schizophrenia in a mechanism independent of changes in chromatin accessibility. These functional non-coding variants may still impact specific transcription factor binding sites but overall chromatin accessibility might be maintained and stabilized by other transcription factors and complexes that bind at these regions. Indeed, conserved sequences that are in brain open chromatin regions are highly enriched in CTCF and other transcription factors involved in neuron differentiation. Altering CTCF binding might also impact three dimensional genome structure to allow distal enhancers to abnormally influence gene expression.⁵⁹ Strategies to explore these possibilities include ChIP-seq for specific transcription factors, and Hi-C on brains representing different genotypes. Additional independent strategies include functional detection of variants on regulatory element activity using high-throughput reporter assays like POP-STARR-seq⁶⁰ or regulatory element CRISPR screens.⁶¹

Future experiments tackling these issues will be needed if we are to further understand the mechanism behind schizophrenia risk as well as disease risk for many other common disorders. These results can help narrow the search space for those studies.

URLs

BedTools, http://bedtools.readthedocs.io/en/latest/index.html

bowtie2, http://bowtie-bio.sourceforge.net/bowtie2

cutadapt, http://cutadapt.readthedocs.io

MACS2, https://github.com/taoliu/MACS

Mt Sinai NIH Brain and Tissue Repository, http://icahn.mssm.edu/research/labs/neuropathology-and-brain-banking

Open chromatin “blacklists”, https://sites.google.com/site/anshulkundaje/projects/blacklists

Picard tools, https://broadinstitute.github.io/picard

psychENCODE Knowledge Portal, https://www.synapse.org/#!Synapse:syn4921369/wiki/235539

reg2map, https://personal.broadinstitute.org/meuleman/reg2map/HoneyBadger_release

samtools, https://github.com/samtools

UCSC tools, http://hgdownload.cse.ucsc.edu

MEME-chip, http://meme-suite.org/tools/meme-chip

Conflicts of Interest

PFS is a scientific advisor for Pfizer, Inc. GEC and TER are co-founders of Element Genomics, Inc.

Online Methods

Study design

We conducted a case-control comparison of brain samples to investigate the role of chromatin accessibility in the etiology of schizophrenia. To our knowledge, this is one of the largest studies to date of open chromatin in brain for schizophrenia. Key features were: use of ATAC-seq (a newer method of identifying regions of open chromatin);⁴⁷ use of the same brain tissue samples studied with RNA-seq by the CommonMind Consortium;²⁸ and careful experimental design (including randomization, blinding, comprehensive quality control, empirical selection of salient covariates, and verification of subject identity). We attempted ATAC-seq on 314 brain samples (142 schizophrenia, 143 control, 16 bipolar disorder, 7 affective disorder, and 6 other). After the quality control procedures described below, the analysis dataset consisted of ATAC-seq on postmortem brain samples from 135 cases with schizophrenia and 137 controls. For some analyses (e.g., identification of brain cQTLs, covariate selection, differential chromatin analysis), we included 16 individuals with mood disorders. The purpose of including these individuals was to increase power for the cQTLs detection, to increase power to detect covariates affecting ATAC-seq peaks quantification and to better estimate parameters that are not dependent on the case/control status in the differential chromatin analysis.

Rationale: choice of intact brain versus cell populations

The brain is a complex mixture of cell types.⁶² At present, there is no ideal approach to comprehensively deal with cell heterogeneity. (1) Nuclei sorting from frozen brain provides enrichment for specific cell types.^63–67 However, sorting is never perfect, relatively few cell types can be sorted in humans, and cells may change state during the 5+ hour process of thawing, dissociation, ultra-centrifugation, and sorting. Sorting neuronal populations from mice with a cell-type specific fluorescent tag is possible, but there is only a ~50% overlap in regulatory regions between mice and human⁶⁸ and Crawford, unpublished). (2) Laser capture microdissection on frozen tissue provides spatial resolution, but yields limited quantity and quality of chromatin, and artifacts from thawing and excess heat are concerns. (3) Single-cell analysis of open chromatin is being developed but is technically difficult, resolution is currently limited, and is not yet available for frozen brain samples.⁶⁹ (4) Ex vivo stem cell cultures can yield a realistic cell type but all current ex vivo microenvironments do not recapitulate the normal development of the human brain. A further challenge is that the neural cell types that contribute to schizophrenia are not well characterized.⁷⁰

We performed in silico mixing experiments and demonstrated that we can detect cell type specific gene regulatory elements at various cell concentrations (Figure S4A). Our analysis of the pros and cons of using intact tissues versus cell sorting is shown in Figure S4B. As the cell types causally related to schizophrenia are unknown, we believe that generating chromatin maps in intact brain–i.e., the union of all cell types present in a brain region associated with schizophrenia–provides one strategy of identifying schizophrenia-relevant regulatory elements. Using intact tissue will allow us to analyze more samples, providing better power to identify chromatin QTLs.

Rationale: choice of brain region

Dorsolateral prefrontal cortex (DLPFC) samples corresponding to, Brodmann areas 9 and 46 were studied. These regions were chosen due to their relevance to schizophrenia based on brain anatomy, imaging, and gene expression.^71–94 DLPFC was also the focus of a recent paper from the CommonMind Consortium.²⁸ In fact, RNA-seq from that study and ATAC-seq data from this study have been generated on tissue aliquots isolated from the same DLPFC dissections.

Subjects and brain samples

We used ATAC-seq to characterize human DLPFC cortical samples from the Mt Sinai NIH Brain and Tissue Repository (see URLs, Dr. Vahram Haroutunian). All cases met DSM-IV criteria for schizophrenia via standard diagnostic procedures.²⁸ Controls had never met criteria for schizophrenia or a psychotic disorder. Subjects were excluded if they had neuropathology related to Alzheimer’s or Parkinson’s disease, acute perimortem neurological insults, or were on a mechanical ventilator near the time of death. No case or control had a large pathogenic copy number variant, and cases had inherited a significantly greater number of schizophrenia risk alleles.⁹⁵

All schizophrenia cases and all controls were dissected from the left hemisphere of fresh frozen coronal slabs cut at autopsy from the DLPFC corresponding to Brodmann area 9). All bipolar disorder samples were from Brodmann area 46. Immediately after dissection, samples were cooled to −190 °C and dry homogenized to a fine powder using a liquid nitrogen-cooled mortar and pestle. Aliquots from each sample were prepared, and used for multiple purposes including ATAC-seq (reported here), RNA-seq,²⁸ and SNP genotyping (Illumina OmniExpressExome array). Tissue aliquots were shipped as a dry powder on dry ice to the Crawford lab at Duke University.

Additional adult dorsolateral prefrontal cortex samples (Brodmann area 9, see Table S1) were dissected from postmortem samples from nine adult schizophrenia cases and nine adult control brains, and were obtained from Dr Craig Stockmeier (University of Mississippi Medical Center). Cases and controls were sex- and age-matched. All adult samples were of European ancestry. Controls had no history of psychiatric disorders or substance abuse.

Frontal cortex from nine fetal brains (Table S1), gestation age 17-19 weeks, were obtained from the NIH NeuroBiobank (https://neurobiobank.nih.gov). All fetal samples were of African American ancestry. Samples were genotyped on the Illumina Human Omni Express chip in order to confirm sample integrity. Samples were dry homogenized to a fine powder using a liquid nitrogen-cooled mortar and pestle. Aliquots from each sample were prepared, and used for multiple purposes including ATAC-seq, RNA-seq, and DNA microarray. Sample processing was conducted blind to case-control status.

Confirmation of sample identity

We confirmed subject identity by comparing Illumina SNP genotypes from the Illumina OmniExpressExome array to those recoverable from the ATAC-seq reads (described below). ATAC-seq reads were aligned to hg19 using bowtie2 and variants were called using the multi-sample HaplotypeCaller according to best practices in the Genome Analysis Toolkit.⁹⁶ To achieve a high level of confidence in the variant calls from ATAC-seq, we only kept variants in peaks with a mean read depth ?10 and with minor allele frequency > 0.05. This stringent filtering yielded 10,939 SNPs present in both the ATAC-seq and Illumina data. Identity-by-decent was then estimated for each pairwise combination of ATAC-seq and GWA samples using PLINK⁹⁷ For two subjects, genotypes from ATAC-seq and Illumina genotyping did not match and were excluded in all further analyses.

ATAC-seq library preparation and sequencing

Samples were processed in batches of eight. Samples were randomly assigned to batches that were balanced with respect to case/control status and sex. Sample processing was conducted blind to case-control status. Frozen pulverized brain samples were received from the Mt. Sinai Brain Repository. Approximately 20 mg of pulverized material was used for ATAC-seq. Frozen samples were thawed in 1 ml of nuclear isolation buffer (20 mM Tris-HCL, 50 mM EDTA, 5mM Spermidine, 0.15 mM Spermine, 0.1% mercaptoethanol, 40% glycerol, pH 7.5), inverted for 5 minutes to mix, and samples were filtered through Miracloth to remove larger pieces of tissue. Samples were centrifuged at 1100 × g for 10 min at 4°C. The resulting pellet was washed with 50 μl RSB buffer, centrifuged again, and supernatant was removed. The final crude nuclear pellet was re-suspended in transposition reaction mix and libraries prepared for sequencing as described in Buenrostro et al.⁴⁷ All samples were barcoded, and combined into pools. Each pool contained 8 randomly selected samples (selection balanced by case/control status and sex). Each pool was sequenced on two lanes of an Illumina 2500 or 4000 sequencer (San Diego, CA, USA) at the Duke Sequencing and Genomic Technologies shared resource.

ATAC-seq initial processing

The raw fastq files were processed through cutadapt (version 1.2.0, URLs)⁹⁸ to remove adaptors and low-quality reads. cutadapt-filtered reads were aligned to hg19 using bowtie2 (version 2.1.0, URLs)⁹⁹ using default parameters. In alignment, all reads are treated as single-read sequences, regardless of whether ATAC-seq libraries were sequenced as single-end or paired-end. The aligned bam files were sorted using samtools (version 0.1.18, URLs),¹⁰⁰ duplicates removed using Picard MarkDuplicates, and then converted to bed format using BedTools (version: v2.17.0, URLs).¹⁰¹ ENCODE “blacklist” regions were removed (i.e., empirically identified genomic regions that produce artefactual high signal in functional genomic experiments, URLs). Narrow open chromatin peaks were called from the final bed files using MACS2, with parameter --nomodel --shift −100 --ext 200. For visualization, bigwig files were generated using wigToBigWig (version 4, URLs)¹⁰² and bedgraph files were output by MACS2. All data has been submitted and made publicly available on Synapse.

Identification of sample outliers

We conducted an empirical analysis to identify outliers. An initial analysis identified 8 samples that had only had single end sequencing (unlike the paired end used for all other samples). These samples were excluded.

Performance of samples

A total of 314 libraries were sequenced across 86 lanes of either Illumina 2500 or 4000, and generated 53,556,161,474 sequences, which total 7,839,829,094,924 bp of data. After filtering with cutadapt, 51,639,643,049 (96.4% of total) sequences were aligned by bowtie2 and generated 27,809,813,130 (53.9% of reads entering aligner) uniquely-aligned reads, and 21,887,340,983 (42.4% of reads entering aligner) multi-aligned reads. Only 3.76% of reads were not aligned. Within the aligned reads, 20,09,897,1743 reads (38.9% of total aligned) were aligned to the mitochondrial genome. In average, MACS2 generated 20,434 ± 12,322 peak calls for each replicate at FDR < 0.01, 28,399 ± 16,917 peak calls at FDR < 0.05 and 35,607 ± 20,750 peak calls at FDR < 0.10. Non-redundant fraction (NRF) of each replicates is 0.881 ± 0.032, PCR Bottleneck Coefficient 1(PBC1) is 0.933±0.030, and PCR Bottleneck Coefficient 2(PBC2) is 19.244 ± 8.295.

Performance of replicates

We prepared 9 ATAC-seq replicate samples from an independent brain sample aliquots. We observed that the normalized peak values of replicates were significantly more correlated within pairs of replicates (mean spearman correlation = 0.65) then between unrelated samples (mean spearman correlation = 0.42) (P=0.0001, Figure S5).

Quantification of open chromatin peaks

Peaks called at FDR of 1% in each sample were merged, quantified, and normalized using the diffBind R package.¹⁰³ Only peaks with overlapping coordinates observed in ≥2 samples were quantified. All reads were extended to 300 bp prior to the quantification process. For replicate samples, we retained the replicate with the highest fraction of total reads overlapping with peaks (and for ties, highest number of peaks detected). Peaks were then merged and quantified again as described without the lower quality replicates and forcing the peak width to be 300bp using the summit option in the dba.count function of the diffBind R package. Samples were normalized using the trimmed mean of M values method (TMM).¹⁰⁴

Ancestry estimation & population stratification

We wished to capture empirical ancestry using Illumina OmniExpressExome SNP data. These were then available as potential covariates for analyses of differential chromatin accessibility and cQTL. We performed principal component analysis (PCA) of LD-pruned SNP array data using EIGENSOFT.¹⁰⁵ From the eigenvalue scree plot, we determined that 5 PCs were sufficient to control for effects of population stratification in the GWA data. As such, 5 PCs were included as covariates in the cQTL analysis.

Evaluation of variables affecting chromatin peaks

For each sample, we recorded a comprehensive set of 206 metadata features that could conceivably capture some aspect of sample quality. These features were collected from the ATAC-seq processing pipeline, RNA-seq processing of aliquots from the same brain regions,²⁸ and genome-wide SNP genotyping.²⁸ For example, the metadata included transposase batch, date processed, date submitted, PCR cycles, mean GC percentage of sequenced reads, numbers of lanes sequenced, mean mapped read length, subject age at death, sex, diagnosis, postmortem interval, antipsychotic use, history of seizures, and RNA quality. We included 10 ancestry-informative PCs from the genome-wide SNP data. We excluded 19 features with high missingness (e.g., time of death, date of death), 24 features that were invariant in all samples (e.g., ATAC-seq library technician, ATAC-seq data processor, brain region, hemisphere), and 40 features with >5% missing values (e.g., hypertension, body mass index, number of weeks without antipsychotics, tobacco use). To prevent potential over-fitting in downstream analysis, we excluded 18 features with >30 levels (e.g., sequencing batch). We then used the R package mice¹⁰⁶ to impute missing values using the classification and regression trees methodology. One feature (RNA-seq expression profiling efficiency) could not be confidently imputed and was excluded. This resulted in a total of 104 metadata variables (65 numeric, 39 categorical) for each of the samples. Five variables were deconvolution results estimating the proportions of major cell types in the brain from the RNA-seq data (neuron, astrocyte, oligodendrocyte, microglia, endothelial).²⁸

Covariate selection

In order to detect covariate for our differential chromatin analysis, we performed linear regression of all meta data variables against the first 20 principal components (PCs) of the TMM normalized peak quantification. In an iterative process, we selected one variable (preferentially a variable directly related to the ATAC-seq experiment, explaining one of the largest proportion of variance and with few parameters), regressed its effect on the peak quantifications and performed a new principal component analysis independent of the selected variable(s). We repeated this procedure until we could not further remove the effect of Bonferroni significant variables. Two meta data variables (sequencing lane and whether the sample was excluded from the RNA-seq isolation step) remained associated with PC4, PC9, PC13 and PC15 and could not be corrected because of collinearity with the variable “date submitted”. We believe that not correcting for these two variables is unlikely to result in false positives as they were among the least different meta data variables between cases and controls (96th and 89th among 100 meta variables tested). In total, we selected the following variables for our differential chromatin analysis: GC (%), date submitted, ChrM aligned (%), NRF, RNA-seq intronic aligned (%), RNA-seq astrocyte (%), RNA-seq oligodendrocyte (%), PC1 genotype, seizures, age of death, post-mortem interval, sex, and diagnosis.

Association between diagnosis and meta data variables

As shown in Figure S6, we observed that many numerical variables were partially correlated within a large cluster of sequencing-related variables (PCR cycles, number of trimmed reads, number of uniquely aligned reads, etc.). To reduce the dimensionality of the data, we applied principal components analysis, and observed that the first principal component largely captured the sequencing-related variables. We also observed that PCs 1, 4, and 9 were significantly associated with diagnosis (Figure S7), indicating that the structure of the metadata was different in several dimensions between cases and controls. To identify the metadata variables associated with case-control status, we evaluated the associations of all metadata variables (104) with case-control status (135 cases, 137 controls). We performed linear regression for numerical variables (65) and χ² tests for categorical variables (38, excluding diagnosis) using R.

As shown in Figure S3, we observed that 5 numerical and 4 categorical metadata variables were associated at a Bonferroni significance level with case-control status. These variables were: postmortem interval, clinical dementia rating, atypical antipsychotic use, sample storage box, RNA-seq quality, perimortem antipsychotic use, RNA-seq intronic rate, RNA-seq exonic rate, and RNA-seq 28S/16S). The four RNA-seq related variables were highly correlated (Figure S8) but not correlated with postmortem interval. The variable most strongly associated with case-control status was postmortem interval (Figure S9.

Association of metadata with ATAC-seq peaks detected

In the analyses above, we did not observe any significant case-control differences in the number of ATAC-seq peaks detected nor in the estimated cell type proportions (from RNA-seq). To identify the variables with an effect on ATAC-seq quality, we performed linear regression for all 100 imputed metadata variables (excluding number of significant ATAC-seq peak calls). We found that 24 variables were associated at a Bonferroni significance level with the number of peaks detected (Figure S10). The numerical variables significantly associated with the number of peaks calls formed highly correlated clusters (Figure S11), indicating that they captured similar technical variability on the number of peak calls.

Although we did not observe any differences between cases and controls in terms of estimated cell type proportions (from RNA-seq data), we observed that the estimated astrocyte and neuron proportions significantly predicted the number of peak calls. Interestingly, the proportion of neurons and astrocytes were highly correlated with RNA quality (RIN) and slightly less with the mean percentage of GC sequenced. This analysis provided a set of important technical variables affecting the number of peak calls. Nevertheless, this analysis is not sufficient for selecting covariate for the differential chromatin analysis as a variable might not have an effect on the number of peaks detected but an effect on the quantification of the peaks. Hence, we selected covariates for our differential chromatin analysis by looking at the effect of imputed variables on the principal components of the matrix of TMM normalized peak quantification (see above).

Chromatin QTL analysis (cQTL)

Association analyses for GWA SNPs and open chromatin peaks were evaluated using fastQTL,¹⁰⁷ which utilizes a ß approximation of permutations to determine significance. Imputed genotype probabilities²⁸ were converted to dosages for input into fastQTL. Peaks were normalized and regressed on SNP dosage in a 5kb window, controlling for 10 PCs from PCA of peaks and 5 ancestry PCs from PCA of SNP array data. Only the most significant SNP for each peak was retained. To control for testing multiple peaks, we applied FDR correction¹⁰⁸ to the ß approximated permutation P-values.

Heterozygous individuals display allele bias in direction of cQTL association

To provide additional characterization of cQTLs, we analyzed ATAC-seq reads from individuals heterozygous for each of the 600 cQTLs that mapped within an ATAC-seq peak. Allele counts from these individuals show a high degree of allele bias that skews from expected 50:50 split (Figure 5A). This is significantly different (Pearson’s Chi-squared test, p-value 2.2e-16) to SNPs that were not identified as cQTLs, which show a higher degree of 50% allele counts (Figure 5B). We also find that the direction of the bias in ATAC-seq reads from heterozygous individuals is largely in the same direction as the effect size of each cQTL (Figure 5C). In other words, the allele that has more read counts in heterozygous individuals is also the same allele that displays the most accessible chromatin in individuals that are homozygous for that allele. This provides further evidence that cQTL variants are likely contributing directly to chromatin accessibility.

cQTL interaction with schizophrenia

272 individuals (135 schizophrenia cases and 137 controls) were utilized for the cQTL interaction analysis. This analysis was performed similarly to the cQTL analysis described above, with the addition of a term for the main effect of diagnosis and a term where the SNP genotype interacted with SCZ diagnosis. 10,000 permutations were performed to estimate significance, and the Storey and Tibshirani correction was applied to the exact P-values.¹⁰⁹

Overlap of eQTLs with cQTLs

To determine the true proportion of cQTLs that were also eQTLs, we performed a separate FDR calculation using only the SNPs that were present in both analyses, with the qvalue package in R. Because there were multiple eQTLs in a gene, we randomly kept one, matched it to the corresponding cQTL, and performed the FDR correction using that subset of cQTL p-values. We also repeated this procedure twice keeping only the most (round 1) or least (round 2) significant eQTL per gene to obtain a range of estimates for the true proportion cQTLs that were also eQTLs (data not shown). Choosing one random eQTL per gene gave an estimate in between these two extreme estimates, as expected. Conversely, we attempted to estimate the true proportion of eQTLs that were also cQTLs. However, all of the nominal eQTL P-values corresponding to a significant cQTL were < 0.10. Because the distribution was truncated, we could not accurately estimate the true proportion.

Differential chromatin analysis

We used DESeq2¹¹⁰ to detect differential chromatin. The primary analysis was for case-control status, but we also evaluated sex, age at death, and postmortem interval. The following variables were included in the model: GC (%), date submitted, ChrM aligned (%), NRF, RNA-seq Intronic aligned (%), RNA-seq astrocyte (%), RNA-seq oligodendrocyte (%), PC1 genotype, seizures, Age of death, post-mortem interval, sex and diagnosis. All 288 samples were used for this analysis as this increases power to correctly estimate the parameters of the model. The case-control difference was performed between control individuals (N=135) and individuals with schizophrenia (N=137). Although we corrected for sex in all our analysis, we observed that peaks located on the sex chromosomes were more often called significant than peaks located on autosomes. We performed sex stratified analysis for case/control difference in open chromatin and did not observe that peaks on sex chromosomes were enriched in low P-values compared to other chromosomes. Therefore, we believe that the initial enrichment in significant hits for all analysis (case/control, sex, PMI and age) on the X and Y chromosomes were likely false positives. In order to prevent potential bias due to the sex chromosomes, we meta-analyzed using an inverse variance weighted approach¹¹¹ for our sex-stratified differential chromatin analysis on chrX and only used P-value obtained in male for chrY. For the remaining chromosomes, the full model was used.

Heritability enrichment analysis

We used LD score regression to estimate heritability enrichment in our ATAC-seq peak calls.⁵⁰ We added SNPs tested in the PGC schizophrenia meta-analysis⁸ and located in our ATAC-seq peak to the “baseline” model which consists of 53 categories representing different genomic annotations (TSS, promoter, enhancer, CTCF binding sites, etc.). In addition, we added SNPs located in a 500bp window (250 bp upstream and 250 downstream) on both side of our ATAC-seq peak calls as an extra annotation to prevent upward bias in the heritability enrichment. Furthermore, we added all SNPs that were present both in the ATAC-seq peak calls and in the conserved regions as an extra annotation. To compare the association of schizophrenia across different tissues and samples, we only added SNPs falling into the respective open chromatin region to the “baseline” model and used the coefficient z-score as a measure of association of the annotation and schizophrenia, as recommended.⁵⁰

Amount of coverage from and overlap between all DNase-seq/ATAC-seq data

For each DNase-seq and ATAC-seq dataset from 125 tissues, we calculated the total number of bases covered (Figure S12) and the average Jaccard index (an indicator of overall similarity of datasets, Figure S13. Jaccard index indicates that ATAC-seq data is most similar to ATAC-seq data from sorted neuronal (NeuN+ sorted) and glial (NeuN-) cells (Figure S14).

Motif enrichment with MEME-chip

We intersected conserved regions⁵² with our ATAC-seq peaks. As MEME-chip requires all input sequences to have the same length, we set the width of each intersected region to 32bp (16bp upstream and downstream of the center of the intersected region, corresponding to the mean size of the intersected regions). We obtained hg19 sequence corresponding to these regions using bedtools. We then used MEME-chip¹¹² (URL) to look for motif enrichment using default settings;; and used HOMER¹¹³ (v4.9) with intersected conserved regions with our ATACseq peaks as input and randomly chosen 1000,000 conserved regions as background to do motif search.

Data sharing

These ATAC-seq data are available from Sage Bionetworks-Synapse website via the psychENCODE Knowledge Portal (URLS).

Genome build

All genome coordinates are GRCh37 / hg19.

Acknowledgements

This project was funded by the US NIMH (R01 MH105472, PI Crawford, co-PI Sullivan) and NIGMS (P30GM103328, PI Stockmeier). PFS was supported by the Swedish Research Council (Vetenskapsrådet, award D0886501). JB was supported by the Swiss National Science Foundation. We are deeply indebted to Vahram Haroutunian, Pamela Sklar, and the Common Mind Consortium for providing brain samples. We also thank the NICHD tissue bank (Baltimore, MD) for additional postmortem human tissues, and the NIH NeuroBioBank for fetal brain samples. This work is part of the PsychENCODE Consortium, which was funded by US NIH grants: U01MH103339, U01MH103365, U01MH103392, U01MH103340, U01MH103346, R01MH105472, R01MH094714, R01MH105898, R21MH102791, R21MH105881, R21MH103877, and P50MH106934 awarded to: Schahram Akbarian (Icahn School of Medicine at Mount Sinai), Gregory Crawford (Duke), Stella Dracheva (Icahn School of Medicine at Mount Sinai), Peggy Farnham (USC), Mark Gerstein (Yale), Daniel Geschwind (UCLA), Thomas M. Hyde (LIBD), Andrew Jaffe (LIBD), James A. Knowles (USC), Chunyu Liu (UIC), Dalila Pinto (Icahn School of Medicine at Mount Sinai), Nenad Sestan (Yale), Pamela Sklar (Icahn School of Medicine at Mount Sinai), Matthew State (UCSF), Patrick Sullivan (UNC), Flora Vaccarino (Yale), Sherman Weissman (Yale), Kevin White (U Chicago) and Peter Zandi (JHU). We thank the Duke Sequencing and Genomic Technologies Core facility for sequencing the ATAC-seq libraries.

References

1.↵
Sullivan, P.F. et al. Psychiatric Genomics: An Update and an Agenda. (Submitted).
2.↵
Lichtenstein, P. et al. Recurrence risks for schizophrenia in a Swedish national cohort. Psychological Medicine 36, 1417–1426 (2006).
OpenUrl CrossRef PubMed Web of Science
3.
Lichtenstein, P. et al. Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet 373, 234–239 (2009).
OpenUrl CrossRef PubMed Web of Science
4.
Sullivan, P.F., Kendler, K.S. & Neale, M.C. Schizophrenia as a complex trait: evidence from a meta-analysis of twin studies. Archives of General Psychiatry 60, 1187–1192 (2003).
OpenUrl CrossRef PubMed Web of Science
5.
Sullivan, P.F. Puzzling over schizophrenia: schizophrenia as a pathway disease. Nature Medicine 18, 210–211 (2012).
OpenUrl CrossRef PubMed
6.↵
Wray, N.R. & Gottesman, II Using summary data from the Danish national registers to estimate heritabilities for schizophrenia, bipolar disorder, and major depressive disorder. Front Genet 3, 118 (2012).
OpenUrl PubMed
7.↵
Purcell, S.M. et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 506, 185–190 (2014).
OpenUrl CrossRef PubMed Web of Science
8.↵
Schizophrenia Working Group of the Psychiatric Genomics Consortium Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
OpenUrl CrossRef PubMed Web of Science
9.↵
Pardiñas, A.F. et al. Common schizophrenia alleles are enriched in mutation-intolerant genes and maintained by background selection. http://biorxiv.org/content/early/2016/08/09/068593.abstract (Submitted).
10.↵
Goldstein, D.B. Common genetic variation and human traits. N Engl J Med 360, 1696–1698 (2009).
OpenUrl CrossRef PubMed Web of Science
11.↵
McClellan, J. & King, M.C. Genetic heterogeneity in human disease. Cell 141, 210–217 (2010).
OpenUrl CrossRef PubMed Web of Science
12.↵
Need, A.C. et al. Exome sequencing followed by large--scale genotyping suggests a limited role for moderately rare risk factors of strong effect in schizophrenia. American Journal of Human Genetics (2012).
13.
Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature 506, 179–184 (2014).
OpenUrl CrossRef PubMed Web of Science
14.↵
Genovese, G. et al. Increased burden of ultra-rare protein-altering variants among 4,877 individuals with schizophrenia. Nature Neuroscience (2016).
15.↵
Zuk, O. et al. Searching for missing heritability: Designing rare variant association studies. Proceedings of the National Academy of Sciences of the United States of America (2014).
16.↵
Singh, T. et al. Rare loss-of-function variants in SETD1A are associated with schizophrenia and developmental disorders. Nat Neurosci 19, 571–577 (2016).
OpenUrl CrossRef PubMed
17.↵
Rujescu, D. et al. Disruption of the neurexin 1 gene is associated with schizophrenia. Hum Mol Genet 18, 988–996 (2009).
OpenUrl CrossRef PubMed Web of Science
18.
Sekar, A. et al. Schizophrenia risk from complex variation of complement component 4. Nature 530, 177–183 (2016).
OpenUrl CrossRef PubMed
19.↵
CNV Working Group of the Psychiatric Genomics Consortium & Schizophrenia Working Groups of the Psychiatric Genomics Consortium Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects. Nat Genet 49, 27–35 (2016).
OpenUrl
20.↵
Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43–49 (2011).
OpenUrl CrossRef PubMed Web of Science
21.
Hindorff, L.A. et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proceedings of the National Academy of Sciences of the United States of America 106, 9362–9367 (2009).
OpenUrl Abstract/FREE Full Text
22.
Maurano, M.T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science (New York, N.Y.) 337, 1190–1195 (2012).
OpenUrl Abstract/FREE Full Text
23.
Reddy, T.E. et al. Effects of sequence variation on differential allelic transcription factor occupancy and gene expression. Genome Research 22, 860–869 (2012).
OpenUrl Abstract/FREE Full Text
24.
Sethupathy, P. & Collins, F.S. MicroRNA target site polymorphisms and human disease. Trends in Genetics 24, 489–497 (2008).
OpenUrl CrossRef PubMed Web of Science
25.
Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature (2014).
26.↵
Purcell, S.M. et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature (2014).
27.↵
Richards, A.L. et al. Schizophrenia susceptibility alleles are enriched for alleles that affect gene expression in adult human brain. Mol Psychiatry 17, 193–201 (2012).
OpenUrl CrossRef PubMed Web of Science
28.↵
Fromer, M. et al. Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nature Neuroscience 19, 1442–1453 (2016).
OpenUrl CrossRef PubMed
29.
Won, H. et al. Chromosome conformation elucidates regulatory relationships in developing human brain. Nature 538, 523–527 (2016).
OpenUrl CrossRef PubMed
30.↵
Roussos, P. et al. A role for noncoding variation in schizophrenia. Cell Rep 9, 1417–1429 (2014).
OpenUrl CrossRef PubMed Web of Science
31.↵
Encode Project Consortium A user’s guide to the encyclopedia of DNA elements (ENCODE). PLoS biology 9, e1001046 (2011).
OpenUrl CrossRef PubMed
32.↵
Roadmap Epigenomics Consortium et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
OpenUrl CrossRef PubMed
33.↵
Akbarian, S. et al. The PsychENCODE project. Nat Neurosci 18, 1707–1712 (2015).
OpenUrl CrossRef PubMed
34.↵
Petronis, A. Epigenetics as a unifying principle in the aetiology of complex traits and diseases. Nature 465, 721–727 (2010).
OpenUrl CrossRef PubMed Web of Science
35.↵
Gavin, D.P. & Floreani, C. Epigenetics of schizophrenia: an open and shut case. Int Rev Neurobiol 115, 155–201 (2014).
OpenUrl CrossRef
36.↵
Åberg, K.A. et al. Methylome-wide association study of schizophrenia: identifying blood biomarker signatures of environmental insults. JAMA psychiatry 71, 255–264 (2014).
OpenUrl
37.
Hannon, E. et al. An integrated genetic-epigenetic analysis of schizophrenia: evidence for co-localization of genetic associations and differential DNA methylation. Genome Biol 17, 176 (2016).
OpenUrl CrossRef
38.↵
Montano, C. et al. Association of DNA Methylation Differences With Schizophrenia in an Epigenome-Wide Association Study. JAMA psychiatry 73, 506–514 (2016).
OpenUrl
39.↵
Jaffe, A.E. et al. Mapping DNA methylation across development, genotype and schizophrenia in the human frontal cortex. Nat Neurosci 19, 40–47 (2016).
OpenUrl CrossRef PubMed
40.↵
Berchowitz, L.E., Hanlon, S.E., Lieb, J.D. & Copenhaver, G.P. A positive but complex association between meiotic double-strand break hotspots and open chromatin in Saccharomyces cerevisiae. Genome Research 19, 2245–2257 (2009).
OpenUrl Abstract/FREE Full Text
41.
Cockerill, P.N. Structure and function of active chromatin and DNase I hypersensitive sites. Febs Journal 278, 2182–2210 (2011).
OpenUrl
42.
Gross, D.S. & Garrard, W.T. Nuclease hypersensitive sites in chromatin. Annu Rev Biochem 57, 159–197 (1988).
OpenUrl CrossRef PubMed Web of Science
43.↵
Song, L. et al. Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity. Genome Research 21, 1757–1767 (2011).
OpenUrl Abstract/FREE Full Text
44.
McDaniell, R. et al. Heritable individual-specific and allele-specific chromatin signatures in humans. Science (New York, N.Y.) 328, 235–239 (2010).
OpenUrl Abstract/FREE Full Text
45.↵
Thurman, R.E. et al. The accessible chromatin landscape of the human genome. Nature 489, 75–82 (2012).
OpenUrl CrossRef PubMed Web of Science
46.↵
Fullard, J.F. et al. Cell-type specific open chromatin profiling in human postmortem brain infers functional roles for non-coding schizophrenia loci. (Submitted).
47.↵
Buenrostro, J.D., Giresi, P.G., Zaba, L.C., Chang, H.Y. & Greenleaf, W.J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods 10, 1213–1218 (2013).
OpenUrl CrossRef PubMed Web of Science
48.↵
Song, L. et al. Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity. Genome Res 21, 1757–1767 (2011).
OpenUrl Abstract/FREE Full Text
49.↵
Boyle, A.P. et al. High-resolution mapping and characterization of open chromatin across the genome. Cell 132, 311–322 (2008).
OpenUrl CrossRef PubMed Web of Science
50.↵
Finucane, H.K. et al. Partitioning heritability by functional category using GWAS summary statistics. Nature Genetics 47, 1228–1235 (2015).
OpenUrl CrossRef PubMed
51.↵
Gusev, A. et al. Partitioning heritability of regulatory and cel-type-specific variants across 11 common diseases. American Journal of Human Genetics 95, 535–552 (2014).
OpenUrl CrossRef PubMed
52.↵
Lindblad-Toh, K. et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature 478, 476–482 (2011).
OpenUrl CrossRef PubMed Web of Science
53.↵
Rao, S.S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
OpenUrl CrossRef PubMed Web of Science
54.↵
Duggan, A. et al. Transient expression of the conserved zinc finger gene INSM1 in progenitors and nascent neurons throughout embryonic and adult neurogenesis. J Comp Neurol 507, 1497–1520 (2008).
OpenUrl CrossRef PubMed Web of Science
55.↵
Gaspar, H.A. & Breen, G. Pathways analyses of schizophrenia GWAS focusing on known and novel drug targets. (Submitted).
56.↵
Degner, J.F. et al. DNase I sensitivity QTLs are a major determinant of human expression variation. Nature 482, 390–394 (2012).
OpenUrl CrossRef PubMed Web of Science
57.↵
Skene, N.G. et al. Brain cell types and the genetic basis of schizophrenia. (Submitted).
58.↵
Frank, C.L. et al. Regulation of chromatin accessibility and Zic binding at enhancers in the developing cerebellum. Nature neuroscience 18, 647–656 (2015).
OpenUrl CrossRef PubMed
59.↵
Lupianez, D.G. et al. Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions. Cell 161, 1012–1025 (2015).
OpenUrl CrossRef PubMed
60.↵
Vockley, C.M. et al. Direct GR Binding Sites Potentiate Clusters of TF Binding across the Human Genome. Cell 166, 1269–1281 e1219 (2016).
OpenUrl CrossRef PubMed
61.↵
Klann, T.S. et al. CRISPR-Cas9 epigenome editing enables high-throughput screening for functional regulatory elements in the human genome. Nature biotechnology (2017).
62.↵
Purves, D. et al. (eds.) Neuroscience, Edn. 2nd. (Sinauer Associates, Sunderland, MA; 2001).
63.↵
Cheung, I. et al. Developmental regulation and individual differences of neuronal H3K4me3 epigenomes in the prefrontal cortex. Proceedings of the National Academy of Sciences of the United States of America 107, 8824–8829 (2010).
OpenUrl Abstract/FREE Full Text
64.
Dammer, E.B. et al. Neuron enriched nuclear proteome isolated from human brain. Journal of proteome research 12, 3193–3206 (2013).
OpenUrl CrossRef PubMed
65.
Jiang, Y., Matevossian, A., Huang, H.S., Straubhaar, J. & Akbarian, S. Isolation of neuronal chromatin from brain tissue. BMC neuroscience 9, 42 (2008).
OpenUrl CrossRef PubMed
66.
Lister, R. et al. Global epigenomic reconfiguration during mammalian brain development. Science (New York, N.Y.) 341, 1237905 (2013).
OpenUrl Abstract/FREE Full Text
67.↵
Matevossian, A. & Akbarian, S. Neuronal nuclei isolation from human postmortem brain tissue. Journal of visualized experiments : JoVE (2008).
68.↵
Mouse Encode Consortium et al. An encyclopedia of mouse DNA elements (Mouse ENCODE). Genome Biol 13, 418 (2012).
OpenUrl CrossRef PubMed
69.↵
Buenrostro, J.D. et al. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature 523, 486–490 (2015).
OpenUrl CrossRef PubMed
70.↵
Harrison, P.J. The neuropathology of schizophrenia. A critical review of the data and their interpretation. Brain 122 (Pt 4), 593–624 (1999).
OpenUrl CrossRef PubMed Web of Science
71.↵
Barbalat, G., Chambon, V., Franck, N., Koechlin, E. & Farrer, C. Organization of cognitive control within the lateral prefrontal cortex in schizophrenia. Archives of general psychiatry 66, 377–386 (2009).
OpenUrl CrossRef PubMed Web of Science
72.
Benedetti, F. et al. Functional and structural brain correlates of theory of mind and empathy deficits in schizophrenia. Schizophrenia research 114, 154–160 (2009).
OpenUrl CrossRef PubMed Web of Science
73.
Blot, K., Bai, J. & Otani, S. The effect of non-competitive NMDA receptor antagonist MK-801 on neuronal activity in rodent prefrontal cortex: an animal model for cognitive symptoms of schizophrenia. Journal of physiology, Paris (2013).
74.
Blumberg, H.P. et al. Amygdala and hippocampal volumes in adolescents and adults with bipolar disorder. Archives of general psychiatry 60, 1201–1208 (2003).
OpenUrl CrossRef PubMed Web of Science
75.
Brassen, S., Kalisch, R., Weber-Fahr, W., Braus, D.F. & Buchel, C. Ventromedial prefrontal cortex processing during emotional evaluation in late-life depression: a longitudinal functional magnetic resonance imaging study. Biological psychiatry 64, 349–355 (2008).
OpenUrl CrossRef PubMed Web of Science
76.
Chen, J. et al. Comparative study of regional homogeneity in schizophrenia and major depressive disorder. American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of 162, 36–43 (2013).
OpenUrl
77.
Cheung, C. et al. White matter fractional anisotrophy differences and correlates of diagnostic symptoms in autism. Journal of child psychology and psychiatry, and allied disciplines 50, 1102–1112 (2009).
OpenUrl CrossRef PubMed Web of Science
78.
Douaud, G. et al. Anatomically related grey and white matter abnormalities in adolescent-onset schizophrenia. Brain 130, 2375–2386 (2007).
OpenUrl CrossRef PubMed Web of Science
79.
Doyle-Thomas, K.A. et al. The effect of diagnosis, age, and symptom severity on cortical surface area in the cingulate cortex and insula in autism spectrum disorders. Journal of child neurology 28, 729–736 (2013).
OpenUrl
80.
Ecker, C. et al. Brain anatomy and its relationship to behavior in adults with autism spectrum disorder: a multicenter magnetic resonance imaging study. Archives of general psychiatry 69, 195–209 (2012).
OpenUrl CrossRef PubMed Web of Science
81.
Ellison-Wright, I. & Bullmore, E. Anatomy of bipolar disorder and schizophrenia: a meta-analysis. Schizophrenia research 117, 1–12 (2010).
OpenUrl CrossRef PubMed Web of Science
82.
Greimel, E. et al. Changes in grey matter development in autism spectrum disorder. Brain structure & function (2012).
83.
Honea, R., Crow, T.J., Passingham, D. & Mackay, C.E. Regional deficits in brain volume in schizophrenia: a meta-analysis of voxel-based morphometry studies. The American journal of psychiatry 162, 2233–2245 (2005).
OpenUrl CrossRef PubMed Web of Science
84.
McAlonan, G.M. et al. Mapping the brain in autism. A voxel-based MRI study of volumetric differences and intercorrelations in autism. Brain 128, 268–276 (2005).
OpenUrl CrossRef PubMed Web of Science
85.
McDonald, C. et al. Association of genetic risks for schizophrenia and bipolar disorder with specific and generic brain structural endophenotypes. Archives of general psychiatry 61, 974–984 (2004).
OpenUrl CrossRef PubMed Web of Science
86.
Myers-Schulz, B. & Koenigs, M. Functional anatomy of ventromedial prefrontal cortex: implications for mood and anxiety disorders. Mol Psychiatry 17, 132–141 (2012).
OpenUrl CrossRef PubMed Web of Science
87.
Neuhaus, E., Beauchaine, T.P. & Bernier, R. Neurobiological correlates of social functioning in autism. Clinical psychology review 30, 733–748 (2010).
OpenUrl CrossRef PubMed
88.
Ong, D. et al. Size and shape of the caudate nucleus in individuals with bipolar affective disorder. The Australian and New Zealand journal of psychiatry 46, 340–351 (2012).
OpenUrl CrossRef PubMed
89.
Rogers, T.D. et al. Is autism a disease of the cerebellum? An integration of clinical and pre-clinical research. Frontiers in systems neuroscience 7, 15 (2013).
OpenUrl
90.
Rossi, R. et al. Structural brain features of borderline personality and bipolar disorders. Psychiatry research (2012).
91.
Savitz, J. et al. Inflammation and neurological disease-related genes are differentially expressed in depressed patients with mood disorders and correlate with morphometric and functional imaging abnormalities. Brain, behavior, and immunity 31, 161–171 (2013).
OpenUrl CrossRef PubMed
92.
Schultz, R.T. Developmental deficits in social perception in autism: the role of the amygdala and fusiform face area. International journal of developmental neuroscience : the official journal of the International Society for Developmental Neuroscience 23, 125–141 (2005).
OpenUrl CrossRef PubMed Web of Science
93.
Sepede, G. et al. Impaired sustained attention in euthymic bipolar disorder patients and non-affected relatives: an fMRI study. Bipolar disorders 14, 764–779 (2012).
OpenUrl PubMed
94.↵
Victor, T.A., Furey, M.L., Fromm, S.J., Ohman, A. & Drevets, W.C. Relationship between amygdala responses to masked faces and mood state and treatment in major depressive disorder. Archives of general psychiatry 67, 1128–1138 (2010).
OpenUrl CrossRef PubMed Web of Science
95.↵
American Psychiatric Association Diagnostic and Statistical Manual of Mental Disorders, Edn. Fourth Edition. (American Psychiatric Association, Washington, DC; 1994).
96.↵
Van der Auwera, G.A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics 11, 11 10 11–11 10 33 (2013).
OpenUrl
97.↵
Chang, C.C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
OpenUrl CrossRef PubMed
98.↵
Martin, M. Cutadapt Removes Adapter Sequences From High-Throughput Sequencing Reads. EMBnet.journal 17.
99.↵
Langmead, B. & Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357–359 (2012).
OpenUrl CrossRef PubMed Web of Science
100.↵
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
OpenUrl CrossRef PubMed Web of Science
101.↵
Quinlan, A.R. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr Protoc Bioinformatics 47, 11 12 11–34 (2014).
OpenUrl CrossRef PubMed
102.↵
Speir, M.L. et al. The UCSC Genome Browser database: 2016 update. Nucleic Acids Res 44, D717–725 (2016).
OpenUrl CrossRef PubMed
103.↵
Ross-Innes, C.S. et al. Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature 481, 389–393 (2012).
OpenUrl CrossRef PubMed Web of Science
104.↵
Robinson, M.D. & Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol 11, R25 (2010).
OpenUrl CrossRef PubMed
105.↵
Patterson, N., Price, A.L. & Reich, D. Population structure and eigenanalysis. PLoS Genet 22, e190 (2006).
OpenUrl
106.↵
van Buuren, S. & Groothuis-Oudshoom, K. mice: Multivariate Imputation by Chained Equations in R. Journal of Statistical Software 45, 1–67 (2011).
OpenUrl
107.↵
Ongen, H., Buil, A., Brown, A.A., Dermitzakis, E.T. & Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479–1485 (2016).
OpenUrl CrossRef PubMed
108.↵
Storey, J.D. & Tibshirani, R. Estimating false discovery rates under dependence, with applications to DNA microarrays (Technical Report 2001-28). (Department of Statistics, Stanford University, Palo Alto, CA; 2001).
109.↵
Storey, J.D. & Tibshirani, R. Statistical methods for identifying differentially expressed genes in DNA microarrays. Methods Mol Biol 224, 149–157 (2003).
OpenUrl CrossRef PubMed
110.↵
Love, M.I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014).
OpenUrl CrossRef PubMed
111.↵
de Bakker, P.I. et al. Practical aspects of imputation-driven meta-analysis of genome-wide association studies. Hum Mol Genet 17, R122–128 (2008).
OpenUrl CrossRef PubMed Web of Science
112.↵
Ma, W., Noble, W.S. & Bailey, T.L. Motif-based analysis of large nucleotide data sets using MEME-ChIP. Nat Protoc 9, 1428–1450 (2014).
OpenUrl CrossRef PubMed
113.↵
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell 38, 576–589 (2010).
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted May 25, 2017.

Download PDF

Citation Tools

Subject Area

Genomics

Subject Areas

All Articles

Animal Behavior and Cognition (5200)
Biochemistry (11703)
Bioengineering (8718)
Bioinformatics (29127)
Biophysics (14930)
Cancer Biology (12048)
Cell Biology (17353)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14143)
Epidemiology (2067)
Evolutionary Biology (18266)
Genetics (12219)
Genomics (16765)
Immunology (11841)
Microbiology (28003)
Molecular Biology (11551)
Neuroscience (60804)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3229)
Physiology (4939)
Plant Biology (10383)
Scientific Communication and Education (1679)
Synthetic Biology (2877)
Systems Biology (7333)
Zoology (1642)

[1] 1.↵
Sullivan, P.F. et al. Psychiatric Genomics: An Update and an Agenda. (Submitted).

[2] 2.↵
Lichtenstein, P. et al. Recurrence risks for schizophrenia in a Swedish national cohort. Psychological Medicine 36, 1417–1426 (2006).
OpenUrl CrossRef PubMed Web of Science

[3] 3.
Lichtenstein, P. et al. Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet 373, 234–239 (2009).
OpenUrl CrossRef PubMed Web of Science

[4] 4.
Sullivan, P.F., Kendler, K.S. & Neale, M.C. Schizophrenia as a complex trait: evidence from a meta-analysis of twin studies. Archives of General Psychiatry 60, 1187–1192 (2003).
OpenUrl CrossRef PubMed Web of Science

[5] 5.
Sullivan, P.F. Puzzling over schizophrenia: schizophrenia as a pathway disease. Nature Medicine 18, 210–211 (2012).
OpenUrl CrossRef PubMed

[6] 6.↵
Wray, N.R. & Gottesman, II Using summary data from the Danish national registers to estimate heritabilities for schizophrenia, bipolar disorder, and major depressive disorder. Front Genet 3, 118 (2012).
OpenUrl PubMed

[7] 7.↵
Purcell, S.M. et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 506, 185–190 (2014).
OpenUrl CrossRef PubMed Web of Science

[8] 8.↵
Schizophrenia Working Group of the Psychiatric Genomics Consortium Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
OpenUrl CrossRef PubMed Web of Science

[9] 9.↵
Pardiñas, A.F. et al. Common schizophrenia alleles are enriched in mutation-intolerant genes and maintained by background selection. http://biorxiv.org/content/early/2016/08/09/068593.abstract (Submitted).

[10] 10.↵
Goldstein, D.B. Common genetic variation and human traits. N Engl J Med 360, 1696–1698 (2009).
OpenUrl CrossRef PubMed Web of Science

[11] 11.↵
McClellan, J. & King, M.C. Genetic heterogeneity in human disease. Cell 141, 210–217 (2010).
OpenUrl CrossRef PubMed Web of Science

[12] 12.↵
Need, A.C. et al. Exome sequencing followed by large--scale genotyping suggests a limited role for moderately rare risk factors of strong effect in schizophrenia. American Journal of Human Genetics (2012).

[13] 13.
Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature 506, 179–184 (2014).
OpenUrl CrossRef PubMed Web of Science

[14] 14.↵
Genovese, G. et al. Increased burden of ultra-rare protein-altering variants among 4,877 individuals with schizophrenia. Nature Neuroscience (2016).

[15] 15.↵
Zuk, O. et al. Searching for missing heritability: Designing rare variant association studies. Proceedings of the National Academy of Sciences of the United States of America (2014).

[16] 16.↵
Singh, T. et al. Rare loss-of-function variants in SETD1A are associated with schizophrenia and developmental disorders. Nat Neurosci 19, 571–577 (2016).
OpenUrl CrossRef PubMed

[17] 17.↵
Rujescu, D. et al. Disruption of the neurexin 1 gene is associated with schizophrenia. Hum Mol Genet 18, 988–996 (2009).
OpenUrl CrossRef PubMed Web of Science

[18] 18.
Sekar, A. et al. Schizophrenia risk from complex variation of complement component 4. Nature 530, 177–183 (2016).
OpenUrl CrossRef PubMed

[19] 19.↵
CNV Working Group of the Psychiatric Genomics Consortium & Schizophrenia Working Groups of the Psychiatric Genomics Consortium Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects. Nat Genet 49, 27–35 (2016).
OpenUrl

[20] 20.↵
Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43–49 (2011).
OpenUrl CrossRef PubMed Web of Science

[21] 21.
Hindorff, L.A. et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proceedings of the National Academy of Sciences of the United States of America 106, 9362–9367 (2009).
OpenUrl Abstract/FREE Full Text

[22] 22.
Maurano, M.T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science (New York, N.Y.) 337, 1190–1195 (2012).
OpenUrl Abstract/FREE Full Text

[23] 23.
Reddy, T.E. et al. Effects of sequence variation on differential allelic transcription factor occupancy and gene expression. Genome Research 22, 860–869 (2012).
OpenUrl Abstract/FREE Full Text

[24] 24.
Sethupathy, P. & Collins, F.S. MicroRNA target site polymorphisms and human disease. Trends in Genetics 24, 489–497 (2008).
OpenUrl CrossRef PubMed Web of Science

[25] 25.
Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature (2014).

[26] 26.↵
Purcell, S.M. et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature (2014).

[27] 27.↵
Richards, A.L. et al. Schizophrenia susceptibility alleles are enriched for alleles that affect gene expression in adult human brain. Mol Psychiatry 17, 193–201 (2012).
OpenUrl CrossRef PubMed Web of Science

[28] 28.↵
Fromer, M. et al. Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nature Neuroscience 19, 1442–1453 (2016).
OpenUrl CrossRef PubMed

[29] 29.
Won, H. et al. Chromosome conformation elucidates regulatory relationships in developing human brain. Nature 538, 523–527 (2016).
OpenUrl CrossRef PubMed

[30] 30.↵
Roussos, P. et al. A role for noncoding variation in schizophrenia. Cell Rep 9, 1417–1429 (2014).
OpenUrl CrossRef PubMed Web of Science

[31] 31.↵
Encode Project Consortium A user’s guide to the encyclopedia of DNA elements (ENCODE). PLoS biology 9, e1001046 (2011).
OpenUrl CrossRef PubMed

[32] 32.↵
Roadmap Epigenomics Consortium et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
OpenUrl CrossRef PubMed

[33] 33.↵
Akbarian, S. et al. The PsychENCODE project. Nat Neurosci 18, 1707–1712 (2015).
OpenUrl CrossRef PubMed

[34] 34.↵
Petronis, A. Epigenetics as a unifying principle in the aetiology of complex traits and diseases. Nature 465, 721–727 (2010).
OpenUrl CrossRef PubMed Web of Science

[35] 35.↵
Gavin, D.P. & Floreani, C. Epigenetics of schizophrenia: an open and shut case. Int Rev Neurobiol 115, 155–201 (2014).
OpenUrl CrossRef

[36] 36.↵
Åberg, K.A. et al. Methylome-wide association study of schizophrenia: identifying blood biomarker signatures of environmental insults. JAMA psychiatry 71, 255–264 (2014).
OpenUrl

[37] 37.
Hannon, E. et al. An integrated genetic-epigenetic analysis of schizophrenia: evidence for co-localization of genetic associations and differential DNA methylation. Genome Biol 17, 176 (2016).
OpenUrl CrossRef

[38] 38.↵
Montano, C. et al. Association of DNA Methylation Differences With Schizophrenia in an Epigenome-Wide Association Study. JAMA psychiatry 73, 506–514 (2016).
OpenUrl

[39] 39.↵
Jaffe, A.E. et al. Mapping DNA methylation across development, genotype and schizophrenia in the human frontal cortex. Nat Neurosci 19, 40–47 (2016).
OpenUrl CrossRef PubMed

[40] 40.↵
Berchowitz, L.E., Hanlon, S.E., Lieb, J.D. & Copenhaver, G.P. A positive but complex association between meiotic double-strand break hotspots and open chromatin in Saccharomyces cerevisiae. Genome Research 19, 2245–2257 (2009).
OpenUrl Abstract/FREE Full Text

[41] 41.
Cockerill, P.N. Structure and function of active chromatin and DNase I hypersensitive sites. Febs Journal 278, 2182–2210 (2011).
OpenUrl

[42] 42.
Gross, D.S. & Garrard, W.T. Nuclease hypersensitive sites in chromatin. Annu Rev Biochem 57, 159–197 (1988).
OpenUrl CrossRef PubMed Web of Science

[43] 43.↵
Song, L. et al. Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity. Genome Research 21, 1757–1767 (2011).
OpenUrl Abstract/FREE Full Text

[44] 44.
McDaniell, R. et al. Heritable individual-specific and allele-specific chromatin signatures in humans. Science (New York, N.Y.) 328, 235–239 (2010).
OpenUrl Abstract/FREE Full Text

[45] 45.↵
Thurman, R.E. et al. The accessible chromatin landscape of the human genome. Nature 489, 75–82 (2012).
OpenUrl CrossRef PubMed Web of Science

[46] 46.↵
Fullard, J.F. et al. Cell-type specific open chromatin profiling in human postmortem brain infers functional roles for non-coding schizophrenia loci. (Submitted).

[47] 47.↵
Buenrostro, J.D., Giresi, P.G., Zaba, L.C., Chang, H.Y. & Greenleaf, W.J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods 10, 1213–1218 (2013).
OpenUrl CrossRef PubMed Web of Science

[48] 48.↵
Song, L. et al. Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity. Genome Res 21, 1757–1767 (2011).
OpenUrl Abstract/FREE Full Text

[49] 49.↵
Boyle, A.P. et al. High-resolution mapping and characterization of open chromatin across the genome. Cell 132, 311–322 (2008).
OpenUrl CrossRef PubMed Web of Science

[50] 50.↵
Finucane, H.K. et al. Partitioning heritability by functional category using GWAS summary statistics. Nature Genetics 47, 1228–1235 (2015).
OpenUrl CrossRef PubMed

[51] 51.↵
Gusev, A. et al. Partitioning heritability of regulatory and cel-type-specific variants across 11 common diseases. American Journal of Human Genetics 95, 535–552 (2014).
OpenUrl CrossRef PubMed

[52] 52.↵
Lindblad-Toh, K. et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature 478, 476–482 (2011).
OpenUrl CrossRef PubMed Web of Science

[53] 53.↵
Rao, S.S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
OpenUrl CrossRef PubMed Web of Science

[54] 54.↵
Duggan, A. et al. Transient expression of the conserved zinc finger gene INSM1 in progenitors and nascent neurons throughout embryonic and adult neurogenesis. J Comp Neurol 507, 1497–1520 (2008).
OpenUrl CrossRef PubMed Web of Science

[55] 55.↵
Gaspar, H.A. & Breen, G. Pathways analyses of schizophrenia GWAS focusing on known and novel drug targets. (Submitted).

[56] 56.↵
Degner, J.F. et al. DNase I sensitivity QTLs are a major determinant of human expression variation. Nature 482, 390–394 (2012).
OpenUrl CrossRef PubMed Web of Science

[57] 57.↵
Skene, N.G. et al. Brain cell types and the genetic basis of schizophrenia. (Submitted).

[58] 58.↵
Frank, C.L. et al. Regulation of chromatin accessibility and Zic binding at enhancers in the developing cerebellum. Nature neuroscience 18, 647–656 (2015).
OpenUrl CrossRef PubMed

[59] 59.↵
Lupianez, D.G. et al. Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions. Cell 161, 1012–1025 (2015).
OpenUrl CrossRef PubMed

[60] 60.↵
Vockley, C.M. et al. Direct GR Binding Sites Potentiate Clusters of TF Binding across the Human Genome. Cell 166, 1269–1281 e1219 (2016).
OpenUrl CrossRef PubMed

[61] 61.↵
Klann, T.S. et al. CRISPR-Cas9 epigenome editing enables high-throughput screening for functional regulatory elements in the human genome. Nature biotechnology (2017).

[62] 62.↵
Purves, D. et al. (eds.) Neuroscience, Edn. 2nd. (Sinauer Associates, Sunderland, MA; 2001).

[63] 63.↵
Cheung, I. et al. Developmental regulation and individual differences of neuronal H3K4me3 epigenomes in the prefrontal cortex. Proceedings of the National Academy of Sciences of the United States of America 107, 8824–8829 (2010).
OpenUrl Abstract/FREE Full Text

[64] 64.
Dammer, E.B. et al. Neuron enriched nuclear proteome isolated from human brain. Journal of proteome research 12, 3193–3206 (2013).
OpenUrl CrossRef PubMed

[65] 65.
Jiang, Y., Matevossian, A., Huang, H.S., Straubhaar, J. & Akbarian, S. Isolation of neuronal chromatin from brain tissue. BMC neuroscience 9, 42 (2008).
OpenUrl CrossRef PubMed

[66] 66.
Lister, R. et al. Global epigenomic reconfiguration during mammalian brain development. Science (New York, N.Y.) 341, 1237905 (2013).
OpenUrl Abstract/FREE Full Text

[67] 67.↵
Matevossian, A. & Akbarian, S. Neuronal nuclei isolation from human postmortem brain tissue. Journal of visualized experiments : JoVE (2008).

[68] 68.↵
Mouse Encode Consortium et al. An encyclopedia of mouse DNA elements (Mouse ENCODE). Genome Biol 13, 418 (2012).
OpenUrl CrossRef PubMed

[69] 69.↵
Buenrostro, J.D. et al. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature 523, 486–490 (2015).
OpenUrl CrossRef PubMed

[70] 70.↵
Harrison, P.J. The neuropathology of schizophrenia. A critical review of the data and their interpretation. Brain 122 (Pt 4), 593–624 (1999).
OpenUrl CrossRef PubMed Web of Science

[71] 71.↵
Barbalat, G., Chambon, V., Franck, N., Koechlin, E. & Farrer, C. Organization of cognitive control within the lateral prefrontal cortex in schizophrenia. Archives of general psychiatry 66, 377–386 (2009).
OpenUrl CrossRef PubMed Web of Science

[72] 72.
Benedetti, F. et al. Functional and structural brain correlates of theory of mind and empathy deficits in schizophrenia. Schizophrenia research 114, 154–160 (2009).
OpenUrl CrossRef PubMed Web of Science

[73] 73.
Blot, K., Bai, J. & Otani, S. The effect of non-competitive NMDA receptor antagonist MK-801 on neuronal activity in rodent prefrontal cortex: an animal model for cognitive symptoms of schizophrenia. Journal of physiology, Paris (2013).

[74] 74.
Blumberg, H.P. et al. Amygdala and hippocampal volumes in adolescents and adults with bipolar disorder. Archives of general psychiatry 60, 1201–1208 (2003).
OpenUrl CrossRef PubMed Web of Science

[75] 75.
Brassen, S., Kalisch, R., Weber-Fahr, W., Braus, D.F. & Buchel, C. Ventromedial prefrontal cortex processing during emotional evaluation in late-life depression: a longitudinal functional magnetic resonance imaging study. Biological psychiatry 64, 349–355 (2008).
OpenUrl CrossRef PubMed Web of Science

[76] 76.
Chen, J. et al. Comparative study of regional homogeneity in schizophrenia and major depressive disorder. American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of 162, 36–43 (2013).
OpenUrl

[77] 77.
Cheung, C. et al. White matter fractional anisotrophy differences and correlates of diagnostic symptoms in autism. Journal of child psychology and psychiatry, and allied disciplines 50, 1102–1112 (2009).
OpenUrl CrossRef PubMed Web of Science

[78] 78.
Douaud, G. et al. Anatomically related grey and white matter abnormalities in adolescent-onset schizophrenia. Brain 130, 2375–2386 (2007).
OpenUrl CrossRef PubMed Web of Science

[79] 79.
Doyle-Thomas, K.A. et al. The effect of diagnosis, age, and symptom severity on cortical surface area in the cingulate cortex and insula in autism spectrum disorders. Journal of child neurology 28, 729–736 (2013).
OpenUrl

[80] 80.
Ecker, C. et al. Brain anatomy and its relationship to behavior in adults with autism spectrum disorder: a multicenter magnetic resonance imaging study. Archives of general psychiatry 69, 195–209 (2012).
OpenUrl CrossRef PubMed Web of Science

[81] 81.
Ellison-Wright, I. & Bullmore, E. Anatomy of bipolar disorder and schizophrenia: a meta-analysis. Schizophrenia research 117, 1–12 (2010).
OpenUrl CrossRef PubMed Web of Science

[82] 82.
Greimel, E. et al. Changes in grey matter development in autism spectrum disorder. Brain structure & function (2012).

[83] 83.
Honea, R., Crow, T.J., Passingham, D. & Mackay, C.E. Regional deficits in brain volume in schizophrenia: a meta-analysis of voxel-based morphometry studies. The American journal of psychiatry 162, 2233–2245 (2005).
OpenUrl CrossRef PubMed Web of Science

[84] 84.
McAlonan, G.M. et al. Mapping the brain in autism. A voxel-based MRI study of volumetric differences and intercorrelations in autism. Brain 128, 268–276 (2005).
OpenUrl CrossRef PubMed Web of Science

[85] 85.
McDonald, C. et al. Association of genetic risks for schizophrenia and bipolar disorder with specific and generic brain structural endophenotypes. Archives of general psychiatry 61, 974–984 (2004).
OpenUrl CrossRef PubMed Web of Science

[86] 86.
Myers-Schulz, B. & Koenigs, M. Functional anatomy of ventromedial prefrontal cortex: implications for mood and anxiety disorders. Mol Psychiatry 17, 132–141 (2012).
OpenUrl CrossRef PubMed Web of Science

[87] 87.
Neuhaus, E., Beauchaine, T.P. & Bernier, R. Neurobiological correlates of social functioning in autism. Clinical psychology review 30, 733–748 (2010).
OpenUrl CrossRef PubMed

[88] 88.
Ong, D. et al. Size and shape of the caudate nucleus in individuals with bipolar affective disorder. The Australian and New Zealand journal of psychiatry 46, 340–351 (2012).
OpenUrl CrossRef PubMed

[89] 89.
Rogers, T.D. et al. Is autism a disease of the cerebellum? An integration of clinical and pre-clinical research. Frontiers in systems neuroscience 7, 15 (2013).
OpenUrl

[90] 90.
Rossi, R. et al. Structural brain features of borderline personality and bipolar disorders. Psychiatry research (2012).

[91] 91.
Savitz, J. et al. Inflammation and neurological disease-related genes are differentially expressed in depressed patients with mood disorders and correlate with morphometric and functional imaging abnormalities. Brain, behavior, and immunity 31, 161–171 (2013).
OpenUrl CrossRef PubMed

[92] 92.
Schultz, R.T. Developmental deficits in social perception in autism: the role of the amygdala and fusiform face area. International journal of developmental neuroscience : the official journal of the International Society for Developmental Neuroscience 23, 125–141 (2005).
OpenUrl CrossRef PubMed Web of Science

[93] 93.
Sepede, G. et al. Impaired sustained attention in euthymic bipolar disorder patients and non-affected relatives: an fMRI study. Bipolar disorders 14, 764–779 (2012).
OpenUrl PubMed

[94] 94.↵
Victor, T.A., Furey, M.L., Fromm, S.J., Ohman, A. & Drevets, W.C. Relationship between amygdala responses to masked faces and mood state and treatment in major depressive disorder. Archives of general psychiatry 67, 1128–1138 (2010).
OpenUrl CrossRef PubMed Web of Science

[95] 95.↵
American Psychiatric Association Diagnostic and Statistical Manual of Mental Disorders, Edn. Fourth Edition. (American Psychiatric Association, Washington, DC; 1994).

[96] 96.↵
Van der Auwera, G.A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics 11, 11 10 11–11 10 33 (2013).
OpenUrl

[97] 97.↵
Chang, C.C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
OpenUrl CrossRef PubMed

[98] 98.↵
Martin, M. Cutadapt Removes Adapter Sequences From High-Throughput Sequencing Reads. EMBnet.journal 17.

[99] 99.↵
Langmead, B. & Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357–359 (2012).
OpenUrl CrossRef PubMed Web of Science

[100] 100.↵
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
OpenUrl CrossRef PubMed Web of Science

[101] 101.↵
Quinlan, A.R. BEDTools: The Swiss-Army Tool for Genome Feature Analysis. Curr Protoc Bioinformatics 47, 11 12 11–34 (2014).
OpenUrl CrossRef PubMed

[102] 102.↵
Speir, M.L. et al. The UCSC Genome Browser database: 2016 update. Nucleic Acids Res 44, D717–725 (2016).
OpenUrl CrossRef PubMed

[103] 103.↵
Ross-Innes, C.S. et al. Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature 481, 389–393 (2012).
OpenUrl CrossRef PubMed Web of Science

[104] 104.↵
Robinson, M.D. & Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol 11, R25 (2010).
OpenUrl CrossRef PubMed

[105] 105.↵
Patterson, N., Price, A.L. & Reich, D. Population structure and eigenanalysis. PLoS Genet 22, e190 (2006).
OpenUrl

[106] 106.↵
van Buuren, S. & Groothuis-Oudshoom, K. mice: Multivariate Imputation by Chained Equations in R. Journal of Statistical Software 45, 1–67 (2011).
OpenUrl

[107] 107.↵
Ongen, H., Buil, A., Brown, A.A., Dermitzakis, E.T. & Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479–1485 (2016).
OpenUrl CrossRef PubMed

[108] 108.↵
Storey, J.D. & Tibshirani, R. Estimating false discovery rates under dependence, with applications to DNA microarrays (Technical Report 2001-28). (Department of Statistics, Stanford University, Palo Alto, CA; 2001).

[109] 109.↵
Storey, J.D. & Tibshirani, R. Statistical methods for identifying differentially expressed genes in DNA microarrays. Methods Mol Biol 224, 149–157 (2003).
OpenUrl CrossRef PubMed

[110] 110.↵
Love, M.I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014).
OpenUrl CrossRef PubMed

[111] 111.↵
de Bakker, P.I. et al. Practical aspects of imputation-driven meta-analysis of genome-wide association studies. Hum Mol Genet 17, R122–128 (2008).
OpenUrl CrossRef PubMed Web of Science

[112] 112.↵
Ma, W., Noble, W.S. & Bailey, T.L. Motif-based analysis of large nucleotide data sets using MEME-ChIP. Nat Protoc 9, 1428–1450 (2014).
OpenUrl CrossRef PubMed

[113] 113.↵
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell 38, 576–589 (2010).
OpenUrl CrossRef PubMed Web of Science

Evaluation of Chromatin Accessibility in Prefrontal Cortex of Schizophrenia Cases and Controls

Abstract

Introduction

Results

ATAC-seq evaluation

ATAC-seq peaks are enriched for schizophrenia heritability

Identification of differential open chromatin

Identification of chromatin QTL (cQTL)

cQTLs overlap with eQTLs

cQTL-schizophrenia interaction

Discussion

URLs

Conflicts of Interest

Online Methods

Study design

Rationale: choice of intact brain versus cell populations

Rationale: choice of brain region

Subjects and brain samples

Confirmation of sample identity

ATAC-seq library preparation and sequencing

ATAC-seq initial processing

Identification of sample outliers

Performance of samples

Performance of replicates

Quantification of open chromatin peaks

Ancestry estimation & population stratification

Evaluation of variables affecting chromatin peaks

Covariate selection

Association between diagnosis and meta data variables

Association of metadata with ATAC-seq peaks detected

Chromatin QTL analysis (cQTL)

Heterozygous individuals display allele bias in direction of cQTL association

cQTL interaction with schizophrenia

Overlap of eQTLs with cQTLs

Differential chromatin analysis

Heritability enrichment analysis

Amount of coverage from and overlap between all DNase-seq/ATAC-seq data

Motif enrichment with MEME-chip

Data sharing

Genome build

Acknowledgements

References

Citation Manager Formats

Subject Area