Chromosome-wide simulations uncover folding pathway and 3D organization of interphase chromosomes
=================================================================================================

* Davide Michieletto
* Davide Marenduzzo
* Ajazul H. Wani

## Abstract

Three-dimensional interphase organization of metazoan genomes has been linked to cellular identity. However, the principles governing 3D interphase genome architecture and its faithful transmission through disruptive events of cell-cycle, like mitosis, are not fully understood. By using Brownian dynamics simulations of *Drosophila* chromosome 3R up to time-scales of minutes, we show that chromatin binding profile of Polycomb-repressive-complex-1 robustly predicts a sub-set of topologically associated domains (TADs), and inclusion of other factors recapitulates the profile of all TADs, as observed experimentally. Our simulations show that chromosome 3R attains interphase organization from mitotic state by a two-step process in which formation of local TADs is followed by long-range interactions. Our model also explains statistical features and tracks the assembly kinetics of polycomb subnuclear clusters. In conclusion, our approach can be used to predict structural and kinetic features of 3D chromosome folding and its associated proteins in biological relevant genomic and time scales.

## INTRODUCTION

Eukaryotic genomes have a sophisticated hierarchical threedimensional organization, ranging from nucleosomes to loops, topologically-associating domains (TADs) [1, 3, 4, 8] and chromosome territories [5]. Understanding this multi-scale organization is a fundamental problem in biology as it has been linked to development, differentiation and diseases [6–8]. Application of chromosome conformation capture (3C) and microscopy-based methods has provided unprecedented insight into this problem. 3C-based methods have shown that genomes of metazoans are organized into TADs separated by boundaries [1, 3, 4, 8]. While the mechanism leading to the formation of TADs and the biological nature of boundaries are still not fully understood, good correlations between enrichment of specific proteins and histone modifications in different TADs have been observed [3, 8].

Chromatin-binding proteins, usually present in the form of multiprotein complexes, interact through a combination of protein-protein and DNA-protein interactions and can provide the enthalpy to compensate for the entropic costs associated with the formation of TADs or other compacted or looped forms of chromatin. The multiprotein complex, Polycomb repressive complex 1 (PRC1), can compact chromatin *in vitro*, and it is known that *in vivo* it bridges chromatin regions separated by tens of kbs to Mbs [3]. The ability of PRC1 to mediate intra-domain as well as long-range chromatin interactions stems from its self-association property mediated by SAM-domain polymerization of its PH subunit [3]. Heterochromatin protein 1 (HP1) also self interacts and stabilizes heterochromatic regions. Although CTCF and other insulator proteins mark boundary regions, they are also found within TADs and can mediate long-range loops either by homo-typic or heterotypic protein-protein interactions [10].

Association of chromatin-bound proteins with each other also regulates their subnuclear distribution. Microscopy studies have uncovered subnuclear organization of some proteins into clusters, or *foci*. Polycomb group (PcG) proteins are organized into hundreds of nanoscale clusters and their subnuclear distribution is coupled to chromatin topology [3]. In addition, “transcription factories” underlying active gene clusters, Cajal bodies and nuclear speckles are other features of subnuclear protein distribution linked to gene expression [11, 12]. Super-resolution microscopy coupled to FISH, in contrast to Hi-C, a population averaged technique, can provide single cell information and physical scales of various chromatin features [13–15].

Many of the chromatin-associated proteins regulating interphase organization dissociate from chromatin when cells enter into mitosis, where chromatin is reconfigured [16–22]. As a consequence, how interphase chromatin features are preserved through chromatin reconfiguration stages of cell division is not well understood. Faithful inheritance and imprinting of epigenetic traits regulating interphase chromatin organization from parent cell to daughter cells ensures maintenance of proper gene expression which, in turn, determines cellular identity [23]. Monitoring real time relaxation of chromosomes from the mitotic to the interphase state would shed much light into this complex issue, but such experiment has not been realised yet, as it appears very challenging and demands extensive high resolution observations.

In parallel with this rapidly growing body of experimental data, there have been a number of polymer physics models aiming at elucidating different aspects of genome organization within eukaryotic nuclei. Some of the models [24–26] start from the Hi-C data and have the goal of finding an appropriate force field for the intra-polymer interactions which can recreate these data. A complementary approach, so called “bottom-up”, starts from basic biological assumptions and on the sequencing, *e.g*., chromatin immuneprecipitation coupled to sequencing (ChIP-Seq), datasets characterizing the one-dimensional (1D) genomic features. From this information, but rigorously avoiding any prior knowledge of three-dimensional (3D) contacts or genomic architecture, these models explore via large-scale computer simulations the resulting 3D spatial organization of the genome [7, 27, 28, 30–32]. In this case, the predicted 3D structure can then be directly compared with Hi-C and microscopy data. The two approaches are also sometimes denoted as “inverse” and “direct” modelling respectively. Our current work is an example of the “direct” modelling avenue, and its key strength is in its much enhanced predicted power. As our direct modelling approach does not involve computationally costly fitting procedures, we can simulate very large regions, in this case entire chromosomes, and follow their dynamics for time-scales which are biologically relevant, up to several minutes. These two aspects of our approach enable us to make prediction of structural and kinetic features of chromosome relaxation in biologically relevant genomic and time scales.

In this work we use Brownian dynamics simulations to study the folding of the entire *Drosophila* chromosome 3R (chr3R). In our model, this chromosome is viewed as a semi-flexible polymer made up of coarse grained beads (each representing about 3 kbp of the fly’s genome, see SI, Methods), which interact with soluble proteins, or protein complexes. Our minimal model is based on the simple biological assumption that protein complexes are mainly responsible for driving the observed 3D organization of the chromatin [27, 28, 33–35]. Since these proteins can bind to chromatin at many places, they can bridge non-contiguous chromatin regions to which they are bound. Loops mediated by CTCF, chromatin contacts mediated by polymerization of PcG protein PH [36], and heterochromatin-associated complexes such as dimers of HP1 [37] are some examples of chromatin contacts orchestrated by a combination of DNA-protein and protein-protein interactions.

For simplicity, we only directly model protein bridges, and disregard monovalent proteins. The affinity of chromatin beads for a protein complex is either directly set according to the chromatin immunoprecipitation (ChIP-seq) data, or as per the local chromatin states, which have previously been inferred from various ChIP-seq data [4, 5].

In order to test the predictions of our model, we compare our simulations with both, experimentally obtained chromatin contact maps and subnuclear size distribution of protein clusters. In all cases, we find good overall agreement between simulations and experimental observations, which is remarkable given the simplicity of the assumptions in our model. In addition, we also track the folding dynamics of the chromosome up to experimentally relevant time-scales of several minutes. From this study we draw an important conclusion: the folding dynamics of an interphase chromosome proceeds in a two-step fashion. First, local contacts within a TAD form fast and reproducibly; concomitantly, nuclear protein clusters self-assemble within minutes. Later on, non-local contacts between genomic regions in different TADs are established. This process is slower and it is accompanied by an equally slow coarsening of the nuclear protein clusters. Importantly, the first step (local folding into TADs and nuclear protein cluster assembly) appears solely dependent on the protein binding landscape along the chromosome and strongly suggests that this minimal 1D information can lead to the faithful 3D organization of the genome. On the other hand, there is a notable dependence on the chosen initial chromosomal state in the higher order folding; the formation of non-local, inter-TAD interactions (which is accompanied by coarsening of nuclear protein clusters) has also a significant stochastic component, as different contacts are set up in different simulations – similarly to what observed in single cell Hi-C experiments. In what follows, we will refer to both dependencies on initial conditions and on stochasticity by saying that the second folding step depends on the conformational “history” of the system.

## RESULTS

### PRC1 binding pattern robustly predicts the formation of a sub-set of the experimentally observed TADs

We begin by modelling the chromatin fibre of *Drosophila* chromosome 3R (chr3R) during interphase as a semi-flexible polymer (see SI, Methods) interacting with soluble proteins, representing the polycomb repressive complex 1 (PRC1). This complex binds to chromatin at many places, and a subunit of it, polyhomeotic PH, self-polymerizes, making PRC1 able to effectively bridge between different regions of the chromatin fiber [3]. The affinity between PRC1 and each chromatin bead is informed by ChIP-seq data (see SI, Methods) and can vary between two possible values: high affinity binding (interaction strength *ϵ*1) and weaker affinity (interaction strength *ϵ*2 < *ϵ*1). The relative affinities of these two types of binding sites are set accordingly to the relative binding intensities obtained from ChIP-seq data [3] (see SI Table T1 and Fig. S1).

![Figure S1:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F8.medium.gif)

[Figure S1:](http://biorxiv.org/content/early/2016/04/12/048116/F8)

Figure S1: 
(**left**) Density of binding sites in *Drosophila* genome as a function of their intensities. One can notice two distinct types of binding sites for a PcG protein, PH. These strong (red) and weak (black) binding sites were identified previously [3] and sequencing data was deposited in NCBIs Gene Expression Omnibus and accessible under accession number GSE60686. (**right**) ChIP-seq (binding) profile for strongly (red) and weakly (black) binding sites on *Drosophila* chr3R. The y-axis is arbitrary in that the strong (red) binding sites have higher values than the black ones only for visualisation purposes. The affinity of the binding sites is tuned to study a range of cases (see text), while the position of the binding sites reported here is carefully mapped to the coarse-grained polymer model.

View this table:
[Table 1.](http://biorxiv.org/content/early/2016/04/12/048116/T1)

Table 1. 
Table of the parameters *ϵ**ab* used to model the attractive interaction between DNA and specific binding proteins. The polycomb binding sites are taken from Ref. [3]. The eu-and hetero-chromatin states are obtained from the works Ref. [4, 5] (see main text). The strong and weak affinities of the PRC1 binding sites are varied to explore a range of cases. The strongly and weakly binding euchromatin represent strong enhancer-promoter interactions and weaker interactions in the other cases (such as generally active euchromatin), respectively. These can be bound to generic transcription factors (TF). Heterchromatin is instead bound by generic heterochromatin factors (HF).

We initialized the simulations from three possible conformations of the polymer thereby simulating different initial chromosomal states: (i) random walk (RW), (ii) stretched random walk (S) and (iii) fractal globule (FG) (see right-bottom corner of Figs. 1(a),(b) and (c), respectively, for examples of such configurations). While the random walk represents the simplest polymer physics choice (see Suppl. Movie M1), the stretched random walk may better emulate a post-mitotic elongated fiber (Suppl. Movie M2). Finally, the fractal globule state is motivated by the homonymous model, according to which interphase chromosomes are folded so that they can crumple and segregate whilst remaining entanglement-free. As a consequence of this peculiar global folding, contacts in the fractal globule are established almost exclusively among genomic regions which are also contiguous in 1D sequence space [40] (see Suppl. Movie M3). Although all initial configurations are prepared in such a way that they possess a small knotting probability, we do not check for the presence of knots in either initial or final configurations. Nonetheless, the chain is allowed to cross through itself with an exponentially small probability (see SI) in order to quickly move away from highly entangled, topologically frustrated, configurations.

![Figure 1:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F1.medium.gif)

[Figure 1:](http://biorxiv.org/content/early/2016/04/12/048116/F1)

Figure 1: PcG protein driven topology of chr3R in simulations: Local domains form robustly and similarly for all initial chromosomal states,whereas non-local contact patterns reflects the system history.
Here we show the contact maps and the contact probability for different initial configurations and concentration of binding proteins. The top three rows show contact maps: the left picture shows the contact map which is rotated by 45 degrees and truncated to show more clearly local domains along the diagonal in the region 10-15 Mbp; the right picture shows the full contact map. The three rows correspond to, respectively: starting configuration (top); final configuration with 500 PRC1 proteins (middle); final configuration with 1500 PRC1 proteins (bottom). The final configurations are obtained after 2 105 simulation time units, or 2-20 minutes (see SI for the mapping between simulation and physical units). The fourth row shows the decay of the contact probability as a function of genomic distance in the initial (grey line) and final configurations (coloured lines), and a snapshot of the initial configuration (right picture). In the contact probability plot we also show power law fits with different effective exponents in the intradomain (<Mbp) and interdomain (>Mbp) range. (**a**) (**b**) and (**c**) correspond to Random walk (RW), stretched (S) and fractal globule (FG) initial chromosomal states. See text and SI for further details on the preparation. By comparing initial and final rows one can notice that local domains form robustly and similarly for all initial conditions, whereas non-local contact patterns depend much more strongly on initial conditions. This is also supported by the two exponents found for the decay of contact probability: close to ‐1 for sub-megabase contacts and for all cases, while case-subjective for the inter-TAD (>Mbp) regime. See also Suppl. Movies M1-M3.

Starting from these initial configurations, we followed the evolution of contacts of chr3R for about 2-20 minutes of real time (calculated assuming a nucleoplasmic viscosity of 10-100 cP, respectively, ten or a hundred times that of water, see SI, Methods). Remarkably, we find a very similar local domain structure for all three initial conditions (as shown in the zoom of the region 10-15 Mbp in the second and third rows of Fig. 1(a)-(c)), which suggests that the formation of some topological domains is largely driven by PRC1 binding and self-association.

Interestingly, in the case of the fractal globule, while at the beginning the domains of enriched contacts are visible at multiple scales (see top row in Fig. 1(c)), as expected for a fractal structure, later on these domains rearrange to settle into their final locations, which is determined by the ChIP-seq binding sites of PRC1, and bear little resemblance with the initial state (see Fig. 1(c) and compare second and third rows with first row, see also SI). Remarkably, the final domain locations predicted from all three different starting conditions are in good agreement with TADs as found in Hi-C experiments [8] and classified as PcG TADs (see SI Fig. S2).

![Figure S2:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F9.medium.gif)

[Figure S2:](http://biorxiv.org/content/early/2016/04/12/048116/F9)

Figure S2: 
These plots show the contact maps found experimentally (**Exp**), and by simulations (**FG-RW-S1**) for the region 12-13 Mbp. The same colorbar applies to all plots and is shown at the top. The simulated contact maps correspond to different initial conditions: fractal globule (FG), random walk (**RW**), stretched configuration (**S**). As these results are reported form the “polycomb-only” model, it is important to notice that the PcG TADs are the ones highlighted by the blue squares, and therefore our model correctly captures those, and *not* the domain in the middle. The local folding is robust and essentially independent of the initial condition and also shows additional contacts between the PcG domains in agreement with the experiments.

Differently to the local domain folding, the non-local pattern of contacts, visible as the off-diagonal structure in the full contact map (reported in Fig. 1 for the various cases), is more strongly affected by the starting configuration – the contact maps also show variability in different runs, so that several runs need to be averaged when computing the contact map. For the stretched random walk, local interactions (<1 Mb away along the genomic sequence) account for well over 90% of the overall contacts. This number falls to about 80% for the RW initial condition, and is similar for the fractal globule conformation. An analysis of the contact probability versus genomic distance (*P*(*s*), Fig. 1 bottom row) confirms that there is a “local” scale where the information regarding the initial chromosomal state is lost and a “non-local” scale which carries the signature of system history. This can be seen from the power law decay exponent of *P*(*s*). Interestingly, we find that restricting the datasets to contacts between regions closer than 1 Mbp in sequence, *i.e*., “local” interactions, the effective exponent measuring the decay of contact probability with distance (within chr3R) is close to ‐1 whatever the initial chromosomal state. This values becomes closer to the experimentally observed one [8] (~ –0.75) when a more detailed model is considered (see later sections and SI Fig. S9). Nonetheless, this history-independent emerging exponent reflects the fact that the local organization is solely determined by chromatin-associated proteins and their binding profiles, rather than the initial chromosomal architecture, which, as we showed, is lost in steady state (Fig. 1). The exponent characterizing non-local interactions is instead observed to vary significantly with initial condition, which once again suggests that the inter-TAD organization is dependent on the system history. The fact that there can exist two effective exponents: one for local, intradomain interactions, and another one for non-local, or interdomain, interactions is also supported by the most recent HiC data for contacts within human chromosomes [41]. These findings also complement the ones in Ref. [27], which demonstrated that the effective exponent is dependent on interaction strength and protein concentration. In our simulations we in fact observe that the effective exponent for long-range contacts depends on the history of the system, which translates into a sensitivity on timing of the cell cycle or on sample preparation.

### Including binding patterns of other proteins predicts location of remaining TADs

The results discussed above pertain to the case in which only PRC1 interacted with the chromatin fiber. We refer to this case as the “polycomb-only” model from now on. There are other proteins which act as architectural players in shaping the 3D organization of chromatin. Heterochromatin associated protein HP1, transcription factors and insulator proteins associated with enhancer-promoter loops are common examples. We therefore tackled the problem of understanding how these additional factors change the chromosome folding, which has been thus far driven by the PcG proteins alone. To this end, we introduced two other sets of binding and bridging factors, one representing a heterochromatin-binding factor, and another one representing a euchromatin-binding factor. We chose these generic binders for these epigenetic states because mechanistic aspects driving folding of these chromatin states are not as clearly understood as in the case of PcG proteins, which have been clearly shown to the compact and regulate the topology of chromatin [42].

We marked chromatin beads according to their affinity for the three binders in our simulations: (i) PRC1, (ii) heterochromatin-binding factor and (iii) euchromatin-binding factor. While the affinity between chromatin and PRC1 was determined through ChIP-seq data as previously discussed, we used chromatin state data from Refs. [4, 5] to determine the interactions with the other two factors, as follows. First, we used chromatin states from Ref. [4], and stipulated that heterochromatin factors only bind (with weak affinity, *ϵ*3 =2 *k**B* *T*) to “black” chromatin state, which labels generic heterochromatin (accounting for almost half of the *Drosophila* genome). The heterochromatin factors may therefore model proteins like histone protein H1, AT-Hook protein D1, SUUR, and Su(Hw), all proteins found in black chromatin and associated with architectural roles or the establishment of higher order chromatin structure [4]. Second, we used the chromatin state model from Ref. [5] to identify enhancers (state 1 in Ref. [5]) and promoters (state 3), as chromatin regions where euchromatin factors bind strongly (with high affinity, *ϵ*4 = 9 *k**B**T*). Such euchromatin factors may represent clusters of transcription factors, bridging factors like CTCF and polymerases which can loop the chromatin of active genes. We also allowed binding of euchromatin factors to transcribed regions (states 2, 4 in Ref. [5]) with weak affinity, *ϵ*5 =3 *k**B**T*. We refer to the model with three sets of binders as the “full-model”. For simplicity, we present in this case mainly results obtained with a mitotic-like initial chromosomal state (see preparation details in SI and Suppl. Movies M4-M6). This structure is qualitatively similar to the stretched state used for the polycomb-only model, but has additionally an internal looped structure matching that found in some models for mitotic chromosome organization [6] (see SI).

We find that incorporation of additional bridges in the full model leads to formation of further TADs, while PcG driven TADs remain largely unaltered. This suggests that, to a first approximation, the bridges act mostly independently of one another. Fig. 2 shows a comparison between the predicted and experimental contact maps across different regions of chr3R. At 1.5–2.35 Mbp a big TAD flanked by smaller TADs appear in the full model. The polycomb-only simulations (from any initial condition) show instead a weak domain encompassing these, but with no well defined boundary. As shown by the comparison, these TADs clearly correlate with TADs from Hi-C data, and fall in the “null” class [8]. Next at 12–13.4 Mbp, additional TADs flanking the central PcG TAD (highlighted by the light blue box) come up in the full model; these are absent in the polycomb-only model, and again they correlate well with contacts map from Hi-C. Interestingly, on the left hand side, one can further see that TAD splits into two subdomains, as in experiments. Towards the end of chr3R there is a region, located at 23–24.4 Mbp, which remains completely structureless in the polycomb-only simulations (top row), but instead folds into regular TADs in the full model, yet again matching well with the Hi-C data.

![Figure 2:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F2.medium.gif)

[Figure 2:](http://biorxiv.org/content/early/2016/04/12/048116/F2)

Figure 2: Comparison between predicted and experimentally observed chromatin topology using “polycomb-only” or “full” model simulations.
In each of these plots we rotate the contact maps in order to better compare the TADs obtained from simulations (bottom halves) to the ones from Hi-C experiments [8] (top halves) for three chosen segments of ch3R (left:1.5-2.35Mbp; middle:12-13.4Mbp; right:23-24.4Mbp). The top row shows the comparison with the “polycomb-only” model while the bottom row with the “full” model. It is evident that the latter better captures the position and the fine structure of the experimentally observed TADs, comprising heterochromatic and active domains, while the former only captures the presence of PcG TADs [8] (shown in the middle column).

To further validate the predicted contact map, 4C simulations using a bait at 12.77 Mbp on chr3R were performed (SI Fig. S3). An asymmetric distribution of contacts was observed around the bait, which is expected given that the bait lies close to the boundary of the TAD (see Fig. 2 and SI Fig. S3). This trend is well reproduced by both polycomb-only and full model simulations. Comparing with experimental 4C contact map, detailed analysis of the results shows another intriguing trend. The polycomb-only model shows four large peaks to the left of the bait which correlate with the 4C data, but are much larger than in experiments. Similarly, also the forward contacts predicted by this model are significantly more than those observed by 4C-seq. On the other hand, the number of contacts made by the bait decline in the case of the full model and the agreement with the experimentally observed 4C curve improves (SI Fig. S3). Therefore, although at first approximation we can say that each of the bridges act separately to form separate TADs, the non-PcG bridges can screen PcG-mediated interaction, *e.g*., by sequestering weak binding sites for PRC1. Overall, both models give a good prediction for the position of troughs and peaks in the 4C experiment.

![Figure S3:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F10.medium.gif)

[Figure S3:](http://biorxiv.org/content/early/2016/04/12/048116/F10)

Figure S3: Comparison between simulations and 4C-seq data.
The simulated 4C experiment is in qualitative agreement with the experimental profile (dark grey, from Ref. [3]). Both models, “polycomb-only” (hues of blue) and “full” (hues of red), qualitatively capture the asymmetric profile while the latter model also provides some clues on how the competition between binding proteins might play a role in determining the chromatin folding (see discussion in main text).

From the above-presented simulations, it appears that 1D information obtained from ChIP-seq data can successfully be used to predict the chromatin contacts. Although this procedure can lead to a static picture of interphase chromatin organization, it is also crucial to understand the *dynamics* of formation of contacts and folding. In particular, how the interphase organization is attained by a chromosome upon exiting mitosis is still not clear. Understanding the relaxation of chromosomes from mitotic to interphase states may yield insight into how chromatin organization is inherited across cell generations and enabling cells to remember their identity. Our large scale simulations allow us to probe experimentally relevant time-scales and in light of this we proceed to analyse the chromosomal folding dynamics and study its features.

### Chromatin folding is a two-step process

In order to understand how chromosomes attain locus-specific interphase organization from a locus-independent mitotic state, we studied the relaxation dynamics of chr3R *in silico*. This process would be very difficult to probe experimentally as it would entail, for instance, an extensive Hi-C study of temporally synchronised cells. In light of this, simulations provide an ideal tool to gain some insight into this complex process.

Following the evolution of contacts from three distinct initial configurations using the polycomb-only model, different kinetics is observed for the formation of local and non-local contacts. For all three initial conditions, Fig. 3 shows that local contacts are established faster than non-local ones, and the growth rates also take similar values across different initial conditions. This suggests that the starting configurations chosen to model the initial state of chr3R do not affect its folding kinetics. Contacts are classified as “local” when occurring between beads < 1 Mbp apart along the genome, mainly due to intra-TAD interactions, while nonlocal contacts correspond to > 1 Mbp interactions, mainly due to inter-TAD contacts. From these data it therefore appears that the folding of chr3R takes place in at least two stages. First, local TADs form, and then consolidation of the overall shape takes place by establishment of inter-TAD contacts. A further inspection of the time evolution of the contact maps (Fig. 4) also suggests that local TADs are formed earlier than the establishment of non-local contacts. This can be readily seen in Fig. 4, where we show the evolution of a selected region of chr3R simulated using the “full model” and starting from a random walk configuration. This case displays a quick local folding (within ~ 5 seconds) followed by slower non-local arrangement (up to ~ 12 seconds) which does not show further folding and suggests the establishment of a long-lived configuration (up to 2 minutes). Similar contact maps for chromosomes starting from mitotic-like configurations are reported in the SI (Figs. S6-S7 and Suppl. Movies M4-M5).

![Figure 3:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F3.medium.gif)

[Figure 3:](http://biorxiv.org/content/early/2016/04/12/048116/F3)

Figure 3: Two-step folding of chr3R.
This figure shows the dynamic formation of "local" (< 1 Mbp) and "non-local" (> 1 Mbp) contacts in simulations of "polycomb-only" model. The top row shows the initial and final distribution of the contacts for the random walk (red, left), stretched (green, middle) and fractal globule (purple, right) initial configurations. The bottom row shows the time evolution of the number of local and non-local contacts from the initial state to the final one for the same initial chromosomal state. In all cases, there are two different time-scales, one associated with the fast formation of local contacts and another one associated with the slow formation of non-local contacts. This is strong evidence for a two-step mechanism for the folding of chromosomes exiting any arbitrary initial state. Here, 105 *τ**Br* can be mapped to 1-10 minutes of real time (see SI).

![Figure 4:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F4.medium.gif)

[Figure 4:](http://biorxiv.org/content/early/2016/04/12/048116/F4)

Figure 4: The folding dynamics is arrested and displays long-lived TADs matching the experimental ones.
The time evolution of domains between 23 and 24.5 Mbp is shown. The contact maps are made by averaging over 6 independent simulations starting from a RW configuration and simulated using the “full” model. The snapshots represent configurations taken from one of the simulations. This figure clearly shows that local interactions and intra-TAD arrangements occur on faster time-scales than inter-TAD ones. It also suggests that the folding structure achieved at short time (~ 12 seconds) is very long lived (virtually unchanged from 12 seconds on to 120 seconds) and well captures the epigenetic states along the polymer (see bottom left figure).

Using the full-model, we simulated folding of chr3R from a mitotic like conformation, which is one of the major chromatin reorganization events *in vivo*. Fig. 5 shows that also with this more complete model, and more realistic initial conditions, the conclusions on the kinetics of chromosome folding reached on the basis of the simpler polycomb only model remain unaltered. The chromosome appears to fold in a two-step fashion, with local contacts being established significantly earlier than non-local ones. Time evolution of contact map from mitotic-like state shows that nonlocal contacts continue to form after the establishment of local TADs (SI Fig. S6-S7, Suppl. Movies M4-M6). The fraction of local contacts decreases while fraction of non-local contacts increases upon relaxation from the mitotic-like state. This appears to be in agreement with experimental observations [47], where application of Hi-C showed that mitotic chromosomes are depleted of long-range contacts, TADs, but enriched in very short-range (local) contacts. Here also the nature and genomic coordinates of contacts appears different between initial and final conformations. On the other hand, the dynamics of folding is again very robust against different initial conditions. In Fig. 5 we also show the folding dynamics of chr3R starting from a random walk initial configuration (see Suppl. Movie M7). This is both qualitatively and quantitatively strikingly similar to the dynamics of and final states reached by the mitotic-like starting chromosome (compare final snapshots in insets of Fig. 5 and the time-scales in the two cases). These results strongly suggest a broad robustness of the folding dynamics which might give a clue on the reproducibility and stability of TADs *in vivo*.

![Figure 5:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F5.medium.gif)

[Figure 5:](http://biorxiv.org/content/early/2016/04/12/048116/F5)

Figure 5: The two-step folding process is also observed in the full model.
(**left**) Kinetic of the formation of local and non-local contacts (defined and normalised as in Fig. 3) for the full model, where chr3R is initialised in a mitotic cylinder configuration (top row) (see SI, Ref. [6] and snapshot in inset) and as random walk (bottom row). Time is measured in simulation units (or Brownian times *τ**Br*, see SI for mapping to physical units) and 105 *τ**Br* can be mapped to 1-10 minutes of real time. (**right**) Histograms showing the non-local and local fraction of contacts in the simulated chromosome conformation at the start and end of the simulations shown for the two cases. The qualitative and quantitative similarity of the final configurations, distribution of contacts and folding dynamics is remarkable given so different initial configurations (compare snapshots in inset) and strongly suggest a very robust mechanism of genome organization.

Even though the kinetics are qualitatively similar for different initial configurations and the two models considered (compare Figs. 3 and 5), the quantitative balance struck between local and non-local contacts over time varies for the three different initial conditions. For the RW case, an increase in local and a decrease in non-local contacts is observed, while for the stretched configuration a moderate decrease in local contacts is seen. Finally, for the FG case, the fraction of local contacts increases, while that of non-local ones decreases over time. This dynamic quantitative evolution is accompanied by an even more significant qualitative dynamic change in the nature and genomic coordinates of contacts, which are very different in the initial and final states across different starting conditions (compare top and bottom rows in Fig. 1).

Given that the extent of non-local contacts varies across initial conditions, one might expect that upon waiting a very long time all initial conditions would eventually reach the same steady state properties for the non-local contacts. Even if this were the case, it is likely that, in practice, this time-scale may not be easily reachable in reality: for instance, the equilibration (Rouse) time of a polymer the size of chr3R in isolation (and under infinite dilution) is 108 time units, or 100-1000 minutes, and nuclear crowding is likely to further slow down the dynamics dramatically [6]. It is therefore possible that the system may still evolve, but on much larger time-scales; alternatively, it may be effectively arrested in a metastable state dependent on the initial condition. We favour the latter interpretation on the basis of our simulations, as no detectable change in the pattern is observed even in selected, longer simulations of 106 time units. Furthermore, the radius of gyration and average cluster size appear to have reached a steady state within our simulation time, implying there might not be further major conformational changes (SI Fig. S4-S5). This is also strongly supported by Figs. S6-S7, where we show the time evolution of selected contact maps.

![Figure S4:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F11.medium.gif)

[Figure S4:](http://biorxiv.org/content/early/2016/04/12/048116/F11)

Figure S4: 
Evolution of the radius of gyration *R**g* of the chromosome ch3R in the case of the “polycomb-only” (left) and and “full” (right) model. The different initial configurations are indicated in the figure legends. The “full” model drives a stronger compaction, possibly due to the more numerous proteins considered. The fractal globule case (the dramatic decay indicates the artificial collapse of the polymer prior to its release) shows instead a swelling that contributes to losing many local contacts.

### Protein clusters form concomitantly with chromosome folding

In all of the above-presented simulations we showed that chromatin associated proteins bind to the genome at specific sites, and then interact with each other to bridge different regions of chromatin. This implies that the distribution of chromatin-associated proteins will change as chromatin will reorganize. We therefore now analyse the dynamics of PRC1 during chromosome folding *in silico*, as the subnuclear distribution and dynamics of this protein complex are relatively well studied experimentally [3, 50].

Using both models and for all initial conditions, we remarkably find that the PRC1 proteins rapidly form clusters, which then grow more slowly and appear to reach a stable steady state within our simulated time window, after which no appreciable coarsening occurs (Fig. 6 and SI Fig. S5). Although we considered several different possible values for the concentration of PRC1, we found that all these, except the smallest one (100 proteins for the whole of chr3R) led to very similar cluster size distributions. Remarkably, by assuming a diameter of about 30 nm per complex, these distributions compare well with those observed by STORM (Fig. 6). Because the affinities of PRC1 are only determined qualitatively from ChIP-seq data [3] (see SI), we further varied the values of high and low affinity sites used in the simulations, and found that, while clustering is observed for all the cases we considered, the distribution only matches well that observed by STORM for values of binding for high affinity sites in the range 8-10 *k**B**T* and for weak sites in the range 2-4 *k**B**T* (see SI Fig. S2). These values are reasonable estimates for DNA-protein interactions, which need to be in a range to allow stable but not irreversible binding. This also appears to be in agreement with about four-fold difference in binding intensity between low and high intensity ChIP-seq peaks (Fig. S1) The clustering occurs through the following positive feedback mechanism: the formation of a small cluster of PRC1 proteins is followed by an increased local concentration of polycomb group binding sites; because of this enhanced concentration, more PRC1 proteins are recruited there leading to a further increase in binding site density. This is an example of a generic mechanism through which DNA-binding proteins with multiple binding sites along the chromatin cluster, and which has previously been dubbed “bridging-induced attraction” [28, 34], because it only works with proteins which can bridge chromatin (or indeed any other polymer to which they bind). In this case we consider that binding is specific, and the positive feedback loop we have outlined further requires that a single chromatin bead can bind to more than one protein complex. Because a chromatin bead represents 3.1 kbp (see SI, Methods), this is a realistic assumption for PRC1, as many of the ChIP-seq peaks are smaller than bead size and each of them can likely bind multiple molecules. We note that the affinity range we considered leads to stable rather than transient foci. This is in line with the slow recovery times observed for polycomb bodies; also note that post-translational protein modification, not included here, may potentially render foci dynamics faster.

![Figure S5:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F12.medium.gif)

[Figure S5:](http://biorxiv.org/content/early/2016/04/12/048116/F12)

Figure S5: 
(**a-b**) Time evolution of the average cluster size. The coarsening dynamics of the binding proteins slows down towards the end of the simulation, indicating that the systems has reached a (quasi-)steady state. (**c-f**) Time evolution of the cluster size for the case of the “polycomb-only” model starting from a random walk configuration and with different number of PH proteins (*N**ph*). The cluster size count reaches a steady state whose average is around 120 nm, remarkably close to the one experimentally observed (see main text). (**g-i**) Coarsening dynamics for the full model. The clusters of PRC1 display a severely slow coarsening towards the end of the simulation also in this case.

![Figure 6:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F6.medium.gif)

[Figure 6:](http://biorxiv.org/content/early/2016/04/12/048116/F6)

Figure 6: Formation of PRC1 clusters in chr3R in computer simulations and STORM experiments.
(**a**)-(**c**) The final steady state distribution of cluster diameters in the three cases considered in this work and for different number of proteins *N**ph* are shown. The three cases correspond to how we initialise the chromatin: as a random walk (**RW**), a stretched random walk (S) or a fractal globule (FG) (see SI). (**d**) The cluster diameter distribution measured from STORM experiments [3] shows a remarkable agreement with the simulations. (**e**)-(**f**) Snapshots from simulations showing the clustering of polycomb proteins (red). (**g**)-(**h**) Images form STORM experiment [3]. The scale bar is 500 nm in all images.

We followed the dynamics of PRC1 during folding of chr3R also using the “full-model” and observed that PRC1 started bridging different segments of chr3R by self-associating and this resulted in the formation of PRC1 clusters. A biphasic kinetics was observed for self-association of PRC1, a fast phase followed by slow phase reaching to steady-state (SI Fig. S5). Assuming a size of 30 nm for PRC1, and the formation of spherical cluster (see SI), we observed that cluster diameter reached 120 nm which is the range of experimentally observed values.

In both the polycomb-only and the full model, the biphasic clustering dynamics of PRC1 mirrors the two-step chromatin folding. In particular, the time-scales associated with the biphasic dynamics compare well quantitatively with the time-scale associated with the faster local folding and the slower non-local folding of the chromatin fibre. The growth of PRC1 clusters also follow approximate power laws, where the apparent exponent is significantly larger for the faster initial dynamics (Fig. S8). The apparent exponents are always lower than 1/3 at late times, which would be the exponent expected of diffusive coarsening for clusters of self-attracting particles [48]: possibly, this discrepancy is due to the fact that the coarsening droplets are made up by DNAbound, rather than freely diffusing, proteins.

![Figure S6:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F13.medium.gif)

[Figure S6:](http://biorxiv.org/content/early/2016/04/12/048116/F13)

Figure S6: 
Time evolution of the whole contact map staring from a mitotic-like configuration. The snapshots are taken at *t* = 0, 2.4, 4.8, 38.4, 91.2 and 120 seconds (for viscosity *η* = 10 cP). One can notice a very defined rosette structure in (a) that is lost to the advantage of the formation of TADs. See Movie M4.

![Figure S7:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F14.medium.gif)

[Figure S7:](http://biorxiv.org/content/early/2016/04/12/048116/F14)

Figure S7: 
Time evolution of domains between 10 and 20 Mbp starting from a mitotic-like configuration. The snapshots are taken at *t* = 0, 2.4, 4.8, 38.4, 91.2 and 120 seconds (for viscosity *η* =10 cP). One can notice a very defined rosette structure in (a) that is lost to the advantage of the formation of TADs. See Movie M5.

![Figure S8:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F15.medium.gif)

[Figure S8:](http://biorxiv.org/content/early/2016/04/12/048116/F15)

Figure S8: 
Cluster distribution, contact probability and a snapshot for the case in which the chromosome 3R of *Drosophila* was initialised as a Random Walk (RW) and with *ϵ*1 = 10 *k**B**T* and *ϵ*2 =5 *k**B**T*. In this case there are clusters with a large fraction of “polymerisation independent binding sites” (PIBS), as highlighted by the blue arrow. In the figure, red and blue beads are respectively, strong and weak Polycomb binding sites while grey beads are neutral. Yellow beads represent PH (or PRC) bridge proteins.

![Figure S9:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F16.medium.gif)

[Figure S9:](http://biorxiv.org/content/early/2016/04/12/048116/F16)

Figure S9: 
Contact probability for the full model starting from RW or mitotic-like configurations. One can notice that for both cases, the sub-megabase contacts decay with an exponent around 0.6 – 0.7 well matching the experimentally observed one [8]. One the other hand, beyond 1 Mbp, the contact probability is strongly affected by the choice of initial state.

## DISCUSSION AND CONCLUSIONS

In summary, we studied the folding dynamics of a *Drosophila* chromosome (chr3R), covering realistic genomic and time scales, by means of Brownian dynamics simulations. These simulations are based on the simple assumption that chromosome structure is maintained by the action of “bridges”: these are architectural proteins, or protein complexes, which can bind chromatin at many sites. We considered three types of chromatin-binding bridges: (i) PRC1, (ii) a generic heterochromatin-associating factor, and (iii) a generic euchromatin-associating factor, bridging enhancers and promoters. Prior to this full model, we also considered a simplified, polycomb-only, version where only the role of PRC1 was modeled. This allowed us to study in more detail the nature and assembly dynamics of polycomb-associated structures. Our modeling is informed by a series of sequencing data, on the basis of which we selected the affinity between a chromatin “bead” and a protein, these include ChIP-seq data (for PRC1), and chromatin state data (for heterochromatin‐ and euchromatin-associating factors). Our model can follow the evolution of both proteins and chromatin, and we compare the nuclear organization we find *in silico* with that found experimentally. An example of typical final configuration that we obtain from our simulations is represented in Fig. 7(a), where we also show the typical 3D folding of heterochromatin (Fig. 7(b)), PcG binding sites (Fig. 7(c)) and euchro-matin (Fig. 7(d)).

![Figure 7:](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2016/04/12/048116/F7.medium.gif)

[Figure 7:](http://biorxiv.org/content/early/2016/04/12/048116/F7)

Figure 7: Snapshots from simulations of the “full model”.
(**a-d**) Show the final configuration of chr3R, as simulated via the “full” model (see text for details). Yellow, black and magenta spheres represent PRC1, heterochromatin-associating, and euchromatin-associating bridging proteins, respectively. White chromatin beads are non-binding; Chromatin beads coloured with hues of blue and hues of green denote weak and strong affinity to euchromatin-associating bridges; Red and orange chromatin beads have strong and weak binding affinity only for PRC1 proteins, respectively; beads coloured with hues of purple denote binding sites for heterochromatin-associating proteins. Beads representing eu and hetero-chromatin can also have some (weak or strong) affinity to PRC1 proteins (resulting from the coarse graining procedure), and are therefore shown with different hues of colour. (**a**) All bead types are shown; (**b**) Shows heterochromatin-associating bridges (black) and their binding sites; (**c**) Shows PRC1 bridges (yellow) and their binding sites; (**d**) Represent active chromatin regions together with their associated protein bridges (magenta).

An important outcome of our work is that the folding dynamics of eukaryotic chromosomes *in silico* proceeds in a two-step fashion. First, local domains (TADs) form fast (Figs. 3-5 and SI Figs. S6-S7) and reproducibly (independent of the initial chromosomal configuration, Fig. 1). Later on, non-local contacts form more slowly, and their structure markedly depends on the initial chromosomal state (Figs. 1 and 3, 5). TADs have emerged as unifying feature of metazoan domains, but the mechanisms driving their folding are not clear. Our simulations suggest that interaction among chromatin bound proteins can play a significant role in driving the folding of TADs. Observation of good correlation between predicted chromatin topology with that of experimentally observed topology demonstrates that 1D binding landscape of chromatin proteins can be helpful in predicting 3D organization.

This is an important difference with respect to the previous work on the folding of *Drosophila* chromosome fragments (1 Mb in size) [7], where significant dependence of the initial condition at the local (intradomain) scale was shown. The large-scale Brownian dynamics simulations reported in this work suggest instead a broader robustness with respect to the initial chromosomal conformation (see also Suppl. Movies M1-M3 and M6-M7). In this previous study [7] effective interactions between chromatin beads were only considered, whereas here we directly model the proteins which mediate such interactions. Therefore we are able to study the self-assembly of chromatin associated proteins like polycomb subnuclear clusters and we can follow directly the consequences of changing the nuclear concentration of such proteins. Furthermore, because we model the chromosome and associated proteins for biologically realistic time-scales, we can directly probe the chromosomal dynamics; on the other hand, in a scaled-down model kinetic features can only be indirectly accessed as the dynamics of the slowest dynamical processes, *i.e*., the formation of non-local interactions with genomic regions outside the simulation domain, are not included.

In addition, while our results for the dynamics of long-range (non-local) contacts can be fitted with a time-scale of about several minutes (Fig. 3 and 5), the length of our simulation run-time allow us to reach a steady state, indicating that the chromosomes become trapped into very long-lived metastable states which can retain a memory of their conformational history. Our findings are therefore also consistent with the conjectured glassy dynamics of chromosomes [6, 45]. More generally, our results suggest that the folding into local TADs is essentially an equilibrium process, driven by the interaction between chromatin and chromatin-associating bridging factors, while the same does not apply to the pattern of non-local contacts, and inter-domain interactions, which both depend more strongly on the initial chromosomal state and on stochasticity, leaving a signature of glassy dynamics (Fig. 1). In other words, we find that inter-TAD interactions may depend on two main factors: (i) the chosen initial configuration, as one can notice from the full contact maps in Fig. 1, and (ii) the (stochastic) temporal sequence of long-range contacts which are established. The latter implies that different simulations may have different inter-TADs contacts as they follow independent trajectories in the phase space. This last observation is fully consistent with results obtained with single cell Hi-C, which displayed cell-to-cell variability due to intrinsic stochasticity [46]. Nonetheless, we find that local interactions, or intra-TADs structures, are largely unaffected by such factors and this strongly supports the importance and stability of bridging-induced attraction in establishing local topological structures in interphase chromosomes.

Our simulations of the two-step relaxation of chr3R starting from a mitotic-like state can be used to suggest a possible model for the complex rearrangements that chromosomes must undergo during the cell cycle. In particular, the simulations predict a dynamics during which chromosomes first establish local contacts upon binding of chromatin proteins which are released from chromatin as cells enter into mitosis [49], and then, later on, non-local contacts form. The parent cell and daughter cells have the same chromatin organization, which in turn regulates gene expression, but during cell division huge reorganization of chromatin takes place. Understanding the mechanism and kinetic pathway by which interphase contacts are re-established after cells exit from mitosis can provide important information in order to address how memory for cellular identity is maintained across cell generations [49]. Our simulations suggest that chromatin associated proteins play an important role in attaining the interphase organization after exiting mitosis.

In contrast to most of the simulations in the literature, as highlighted previously, we followed the dynamics of chromatin-binding proteins during chromosome folding. In our simulations, nuclear bodies made up of chromatin binding proteins arise naturally, through a generic positive feedback loop (also known as “bridging induced attraction” [28, 31, 34]). Notably, this feedback does not proceed indefinitely, and polycomb clusters only grow up to a steady state size in our simulations; the structural and kinetic features of these nuclear bodies also match very well with experimental observations. The size distribution of polycomb clusters in steady state *in silico* is close to that observed in *Drosophila* S2 cells by STORM (Fig. 6). Furthermore, a polycomb cluster can self-assemble in just 1-10 min in our simulations, which is similar to the assembly time found in vivo [50]. Strikingly, the self-assembly dynamics of the nuclear bodies also shows a two-step kinetics with power law growth, as well as arrested coarsening (see SI, Fig. S5).

Assumptions and predictions of our simulations appear reasonable as the final output correlates well with experimental observations. Overall good agreement is observed between the predicted and experimental contact maps (Fig. 1 and SI Fig. S2), which is even more enhanced when the full model is implemented (Fig. 2).

In addition, the comparison with the 4C-seq data (SI Fig. S3) suggests that the three types of bridges which we consider share some of their binding sites, so that they compete with each other to bind to these, and, as a result, knocking out some of the proteins does not necessarily open up the associated chromatin domain. Therefore by incorporation of further one-dimensional information (from ChIP-seq data) better and more robust predictions can be made about three-dimensional chromatin organization.

In conclusion, the current work provides a framework within which to model in a realistic way structural and kinetic features of the 3D organization of chromatin and chromatin-binding proteins. We expect this approach to be generalized in order to predict how the organization is affected when the chromatin-protein binding affinity landscape changes, *e.g*., during development or in diseases, mainly based on the ChIP-seq data of the key bridging factors used here. In addition, in light of the observations reported here and of the size of the *Drosophila* genome, we suggest that, with the aid of supercomputers, it is now theoretically feasible to follow, *in silico*, the folding dynamics of entire eukaryotic genomes on realistic time-scale. This represent a crucial milestone towards achieving a more comprehensive understanding of the key mechanism leading the 3D organization of the eukaryotic nucleus.

## Simulations Details

The entire chromosome 3R of *Drosophila melanogaster* was simulated as a finitely-extensible and semi-flexible [1] bead-spring co-polymer chain at a resolution of 30 nm or 3.1 kbp. The different type of beads composing the polymer represent the specific chromatin states as obtained from ChIP-seq tracks of *Drosophila* S2 type cells [8]. The persistence length of the chain is set to *l**p* = 3*σ* ≃ 10 kbp. The dynamics of the monomers is evolved by means of a Brownian Dynamics (BD) scheme. The friction imposed on each monomer and the stochastic term added by the Brownian Dynamics scheme take into account the presence of a surrounding thermal bath whose temperature is set to *T* = 300 K (see SI for further details). The Langevin equation is then integrated using the LAMMPS package in the canonical (NVT) ensemble. The systems were simulated in dilute regime (volume fraction *ρ* = *Nπσ*3/6*Vol* ≃ 0.05%) and confined in a box to avoid self-interactions through the periodic boundaries. The binding proteins are modelled as spheres with diameter *σ* = 30 nm; these interact sterically with other proteins and the chromatin via a cut-and-shift LJ potential (WCA) allowing for an attraction region between specific chromatin states and corresponding binding proteins. The affinity of the binding proteins to specific chromatin *loci* is varied to probe the robustness of the results (see SI for details). The initial configurations we considered are the most addressed in the literature and range from the self-avoiding random walk to a fractal globule state to model a random interphase conformation and from an elongated (persistent) random walk to a collapsed stack of rosettes to model possible post-mitotic situations (see SI for further details).

## SUPPLEMENTARY MATERIAL

### SIMULATION DETAILS

The finitely-extensible worm-like chain is modelled via the Kremer-Grest model [1] as follows: Let *i* and *d**i,j* ≡ *r**j* – *r**i* be respectively the position of the center of the *i*-th bead and the vector of length *d**i,j* between beads *i* and *j*, the connectivity of the chain is treated within the finitely extensible non-linear elastic model with potential energy, ![Formula][1]</img> for *d**i*,*i*+1 < *R*** and *U**FENE*(*i*, *i* + 1) = ∞, otherwise; here we chose *R* = 1.6 *σ* and *k* = 30 *ϵ*/*σ*2. The bending rigidity of the chain is captured with a standard Kratky-Porod potential, ![Formula][2]</img> where we set the persistence length *l**p* = 3*σ* in order to reproduce the observed stiffness of chromatin, *i.e. l**p* ≃ 100 nm.

The steric interaction between monomer *a* and monomer *b* (of sizes *σ**a* and *σ**b*, respectively), representing either a chromatin region, or bridge proteins is taken into account via a truncated and shifted Lennard-Jones (or WCA) potential ![Formula][3]</img> and 0 otherwise and where *σ**c* = (*σ**a* ‫ *σ**b*)/2 is the minimum distance between beads *a* and *b* and *r**c* is the chosen cut off. This parameter is set to *r**c* = 21/6*σ* in order to model purely repulsive interactions and instead set to *r**c* = 1.8*σ* to include attractive interactions. In both cases, the potential is shifted to zero at the cut-off in order to have a smooth curve and avoid singularities in the forces. Purely repulsive interactions, such as those between proteins, have *ϵ**ab* = 1, while attractive interactions such as those between PRC1 proteins and Polycomb binding sites have been varied to study the robustness of the results. The range of parameters are reported in Table S1.

In order to avoid topologically frustrated configurations we allow the beads forming the chain the pass through themselves with a small probability. This is regulated by a soft potential of the form ![Formula][4]</img> with *A* = 20*k**B**T* being the maximum height of the potential barrier, *i.e*., crossing probability *p* = exp –20.

Denoting by *U* the total potential energy, the dynamic of the beads forming the polymer is described by the following Langevin equation: ![Formula][5]</img> where *ξ* is the friction coefficient and *η* is the stochastic delta-correlated noise. The variance of each Cartesian component of the noise, ![Graphic][6]</img> satisfies the usual fluctuation dissipation relationship ![Graphic][7]</img>

As customary [1] we set *m*/*ξ* = *τ**LJ* = *τ**Br*, with the LJ time ![Graphic][8]</img> and the Brownian time *τ**Br* = *σ*/*D**b* is chosen as simulation time step. Here, *D**b* = *k**B* *T/ξ* is the diffusion coefficient of a bead of size *σ*. From the Stokes friction coefficient of spherical beads of diameters *σ* we have: *ξ* = *3πη**sol**σ* where *η*sol** is the solution viscosity. One can map for th ij to Teal-time units by using the viscosity of nucleoplasm, which ranges between 10 – 100 cP, and by setting *T* = 300 K and *σ* = 30 nm, as previously defined. From this it follows that *τ**LJ* = *τ**Br* = 3*πη**sol**σ*3/*ϵ* ≃ 0.6 – 6 ms. The numerical integration of Eq. (A.2) is performed by using a standard velocity-Verlet algorithm with time step Δt = 0.01*τ**Br* and is implemented in the LAMMPS engine. We perform simulations up to 2 105 *τ**Br* which correspond to 2-20 minutes in real time. The thermal energy *k**B**T* is set to *ϵ*, the energy scale of the WCA potential for the purely repulsive interactions.

### Modelling the Affinities between Proteins and Chromatin

For the “polycomb-only” model we only consider protein PRC1 and PcG binding sites; this means that the polymer is made up of beads that can be one of three types: (i) non interacting, (ii) strongly binding to PRC1 (*ϵ*1) or (iii) weakly binding to PRC1 (*ϵ*2). The polymer is then initialised in a box together with a random distribution of PRC1 proteins, here modelled as spheres of diameter *σ* which can then freely interact to the polymer. In the full model the beads can be in one of the 12 possible states which are made by considering that a bead is a coarse-grained section of chromatin (3.1 kbp) and can therefore encompass more than one chromatin state. If the two states are not mutually exclusive (such as euchromatin and heterochromatin for which only the more abundant state is chosen as the marking type for that bead) then a bead can have have an affinity for both, PRC1 and HP1 or PRC1 and TF. For the first case, one can choose between the 3 combinations of affinities (*ϵ*1, *ϵ*2 or no affinity) for PRC1 and one for the heterochromatin *ϵ*3. For the second case, one has to combine three PRC1 affinities and two TF affinities (*ϵ*4,*ϵ*5)(see Table T1). In total we therefore have 9 mixed types, one neutral and 2 PRC1-only affine. Alongside, three proteins identifying generic binding proteins (PRC1, HP1 and TF) complete the system.

### Pre-Equilibration

In order to avoid numerical blow-up and unwanted metastable states dominated by entangled (knotted) configurations, we model the interaction between beads forming the chromatin via a stiff soft potential, *i.e*. ![Formula][9]</img> with *A*pre(*t*) a ramp function increasing from 0 to 20 *k**B**T* during the pre-equilibration time. This potential allows for frequent strand-crossing events at the beginning of the pre-equilibration and progressively forbids them in time. This aims to reproduce the presence of topological enzymes such as topoisomerases, which can un-entangle knotted chromatin conformations, especially close the mitotic state. The soft potential (with *A*pre(*t*) = *A* = 20*k**B**T*) is also employed later on during the simulations to allow rare strand-crossings due to particularly entangled and frustrated configurations and mimics the presence (although at smaller concentrations) of Topo II.

### INITIAL CONFIGURATIONS

In this work we four different classes of initial conditions: (i) Self-avoiding Random Walk (RW), (ii) Stretched (S), (iii) Fractal globule (FG) and (iv) Mitotic (M). The RW configuration of a *N*-beads long polymer is generated by standard means in which every bond connecting two beads is picked in a random direction, uncorrelated with respect to the previous bond. This is then turned into a self-avoiding polymer by performing a short run in which the beads are subject to the soft potential of eq. (A.3). The stretched configuration is generated by biasing the RW algorithm in order to display a larger persistence length along the z direction and by performing the short soft repulsion run. The FG configuration is generated via a quick collapse of a RW configuration. This is done by extending the cut-off of the WCA potential describing the interaction between the polymer beads. In this way we induce a fast non-equilibrium collapse. A confining sphere is also used to drive the symmetric collapse of the polymer which would, otherwise, display segregated (dumbbell-like) sections. The mitotic conformation has been modelled by using the equations reported in Ref. [6]. The following generalised helix describes a stack of rosettes of length *l**r* = 200*σ* ≃ 600 kpb with 12 “leaves” each, meaning that each loop forming a rosette is around 50 kbp long. ![Formula][10]</img> where *f* × *r**chr* ≃ 0.38 × 19*σ* ≃ 7.22*σ* ≃ 200 nm is the radius of the mitotic cylinder, *p* is the vertical step for each full turn and *k* = 6 in order to get 12 leaves per rosette. This means that 9002 beads, forming a polymer with contour length *L**c* ≃ 9000*σ* ≃ 27.9 Mbp, are laid into planes of 200 beads each and with separation of *σ* between planes, leading to a cylinder of height *h* ≃ 1.3 μm.

Interestingly, we observed that the local folding dynamics is largely independent on the chosen initial configuration. An example is reported in Fig. S2 where we compare a PcG topologically associated domain folding on sub-megabase scales and consistently recovered for several choices of initial conditions and well matching with the experimentally observed structure. On the other hand, we observed that the non-local, long range, folding is instead affected by the choice of initial condition. This can be seen from the full contact maps reported in Fig. 1 of the main text.

### OBSERVABLES

The most important observables that we keep track of are the contact maps, radius of gyration and clustering of PcG proteins. These quantities are reported in main text and SI, and if not otherwise specified, they are produced by averaging over 6 independent replicas. The contact maps are obtained using a cut-off of 3*σ* for 3D neighbouring beads. The radius of gyration computed as ![Formula][11]</img>

The clustering analysis is performed setting a cut-off distance *d**c* of 2*σ*, for which two beads are in the same cluster only if their distance is smaller than *d**c*. All other observables, can be obtained by further processing these quantities.

### ROBUSTNESS OF INTERACTION PARAMETERS

The robustness of the results is probed by repeating the simulations for different values of *ϵ*1 and *ϵ*2. We performed the test only for the “polycomb-only” case. We observed that the cluster distribution is consistent with experiment (see Fig. 6 in main text), *i.e*. it shows a peak at cluster diameters at *d**C* *≃* 120 nm, for a rather spread region of the interaction parameters *ϵ*1 and *ϵ*2. We explored the ranges *ϵ*1 ~ 6 – 10 and *ϵ*2 ~ 2 – 5. In all the cases we could observe the formation of clusters with diameters ranging between 100–150 nm. Nonetheless, the nature of the clusters changed when the difference between *ϵ*1 and *ϵ*2 was in the region of few *k**B**T*s or *ϵ*2 was larger than 4*k**B**T*. In these regimes the clusters could be seen to contain a larger fraction of weakly polymerising polycomb binding sites. At the same time, the contact probability of monomers *s* segments apart would show a power law behaviour strongly deviating from s‒1. This is reported for one case where *ϵ*1 = 10 *k**B**T* and *ϵ*2 = 5 *k**B**T* in Fig. S8.

### SIMULATED 4C EXPERIMENTS

In order to compare our model with experiments we also perform a simulated 4C experiment. By placing a “bait” at 12.77 Mbp, we can record the contact profile along the rest of the chromosome (Fig. S3). Interestingly, the forward and backward profile are not symmetric with respect to the bait. In particular, the backward profile is generally higher than the forward one, indicating that the location of the bait is at the end of a TAD. In addition, we observe that both, “PcG-only” and “full”, models capture the general behaviour of the profile, although the latter is closer to the experimentally probed intensity. This can be due to the fact that considering more proteins types and possible binding sites creates a competing effect that lowers and redistributes the chromosome contacts.

### COARSENING

Another important aspect that has been addressed in the literature [7] is the problem of the stability of the observed TADs. Understanding whether the domains are destined to coalesce at long enough times or whether they have reached a steady state is of paramount importance in both, experiments and simulations and to understand the nature of these domains. In order to investigate this aspect we have looked at the evolution of the cluster sizes and of the contact maps for various initial conditions during the course of simulations. The simulations performed can be mapped to up to 2 or 20 minutes (depending on the viscosity *η*sol ≃ 10 – 100 cP) in real time and can therefore shed some light onto the longtime dynamics and evolution of the clusters. We computed the average size of the clusters of PH proteins by taking the average number of proteins belonging to a cluster and by assuming a spherical shape for the clusters, obtained the average diameter as *d* = 〈*n*〉1/3*σ* (we also assume the proteins to occupy a spherical volume of radius *σ*/2). We observed that the clusters show a slowing down at large times, in that their average size scales very weakly with time (Fig. S5). In addition, the coarsening dynamics is never observed to scale faster than *t*0.2 at large time. In Fig. S5 we report the cluster size count as a function of time for the “polycomb-only” (a-f) and the “full” (g-i) model. In Fig. S6 we also report several contact maps at different simulation time-steps for the “full” model and starting from the mitotic state (see above for the mitotic initialisation and Suppl. Movies M4-M6). Fig. S6 and S7 show that the structures obtained are long lived and seem to have reached a steady state. In the main text (Fig. 8) we also show the evolution of a shorter segment between 23 and 24.5 Mbp from the initial (RW). This again displays a remarkable long-lived structure. These figures (S4 – S7) strongly suggest that the final structures that we obtain are truly stable and long lived. Whether their stability is kinetic or thermodynamic in nature is an open problem that we aim to address in the future as it has particular relevance towards the understanding of the 3D organisation of *in vivo* chromatin, where the environment is strongly driven out-of-equilibrium.

## ACKNOWLEDGEMENTS

DMa and DMi acknowledge support from the ERC (Consolidator Grant THREEDCELLPHYSICS, Ref. 648050). AHW is a Ramanujan fellow, awarded by Department of Science and Technology, India.

*   Received April 11, 2016.
*   Accepted April 12, 2016.


*   © 2016, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## References

1.  [1].Lieberman-Aiden, E., van Berkum, N.L., Williams, L., Imakaev, M., Ragoczy, T., Telling, A., Amit, I., Lajoie, B.R., Sabo, P.J., Dorschner, M.O., et al. (2009). Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326:289–293.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzMjYvNTk1MC8yODkiO3M6NDoiYXRvbSI7czozNzoiL2Jpb3J4aXYvZWFybHkvMjAxNi8wNC8xMi8wNDgxMTYuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

2.  [2].Sexton, T., Yaffe, E., Kenigsberg, E., Bantignies, F., Leblanc, B., Hoichman, M., Parrinello, H., Tanay, A., Cavalli, G. (2012). Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 148:458–472.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2012.01.010&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22265598&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000300225000015&link_type=ISI) 

3.  [3].Dixon, J.R., Selvaraj, S., Yue, F., Kim, A., Li, Y., Shen, Y., et al. (2012). Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485:376–380.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature11082&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22495300&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000304099100044&link_type=ISI) 

4.  [4].Rao, S.S.P., Huntley, M.H., Durand, N.C., Stamenova, E.K., Bochkov, I.D., Robinson, J.T., et al. (2014). 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159:1–16.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2014.09.015&link_type=DOI) 

5.  [5].Cremer, T. Cremer, C. (2001) Chromosome territories, nuclear architecture and gene regulation in mammalian cells. Nat. Rev. Gen. 2:292–301.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/35066075&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=11283701&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000167837900014&link_type=ISI) 

6.  [6].Bickmore, W. A. and van Steensel, B. (2013). Genome Architecture: Domain Organization of Interphase Chromosomes. Cell 152:1270–1284.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2013.02.001&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23498936&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000316192500009&link_type=ISI) 

7.  [7].Cavalli, G. Misteli, T. (2013). Functional implications of genome topology. Nat. Struct. Mol. Biol. 20:290–299.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nsmb.2474&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23463314&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

8.  [8].Zane, L., Sharma, V. Misteli, T. (2014). Common features of chromatin in aging and cancer: cause or coincidence? Trends Cell Biol. 24:686–694.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.tcb.2014.07.001&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25103681&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

9.  [9].Wani, A. H., Boettiger, A. N., Schorderet, P., Ergun, A., Munger, C., Sadreyev, R. I., Zhuang, X., Kingston, R. E. Francis, N. J. (2016). Chromatin topology is coupled to Polycomb group protein subnuclear organization. Nat. Comm. 7, 10291.
    
    
10. [10].Seitan, V. C. et al. (2013) Cohesin-based chromatin interactions enable regulated gene expression within preexisting architectural compartments. Gen. Res. 23: 2066–2077.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiZ2Vub21lIjtzOjU6InJlc2lkIjtzOjEwOiIyMy8xMi8yMDY2IjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTYvMDQvMTIvMDQ4MTE2LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

11. [11].Sleeman, J.E., and Trinkle-Mulcahy, L. (2014). Nuclear bodies: new insights into assembly/dynamics and disease relevance. Curr. Opin. Cell Biol. 28:76–83.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ceb.2014.03.004&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24704702&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

12. [12].Papantonis, A., Cook, P.R. (2013). Transcription factories; genome organization and gene regulation. Chem. Rev. 113:8683–8705.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1021/cr300513p&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23597155&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000327103200017&link_type=ISI) 

13. [13].Kritikou, E. (2005) Cytogenetics: Exploring chromatin architecture by FISHing in 3D. Nat. Rev. Gen. 6:429.
    
    
14. [14].Spector, D. L. (2004) Stopping for FISH and Chips along the Chromatin Fiber Superhighway. Mol. Cell. 15:844–846.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.molcel.2004.09.013&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=15383273&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

15. [15].Boettiger AN, et al. (2016) Chromatin Folding for Different Epigenetic States. Nature 529(7586):115.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/529115a&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26738597&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

16. [16].Blobel GA, et al. (2009) A Reconfigured Pattern of MLL Occupancy within Mitotic Chromatin Promotes Rapid Transcriptional Reactivation Following Mitotic Exit. Mol Cell 36(6):970–983.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.molcel.2009.12.001&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20064463&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000273148300010&link_type=ISI) 

17. [17].Fanti L, et al. (2008) The trithorax group and Pc group proteins are differentially involved in heterochromatin formation in Drosophila. Chromosoma 117(1):2539.
    
    
18. [18].Zhao B, Tumaneng K, Guan KL (2011) The Hippo pathway in organ size control, tissue regeneration and stem cell self-renewal. Nat Cell Biol 13(8):877–883.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ncb2303&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21808241&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000293373700003&link_type=ISI) 

19. [19].Buchenau P, Hodgson J, Strutt H, Arndt-jovin DJ (1998) The Distribution of Polycomb-Group Proteins During Cell Division and Developmentin. Cell 141(2):469–481.
    
    
20. [20].Xing H, Vanderford NL, Sarge KD (2008) The TBP-PP2A mitotic complex bookmarks genes by preventing condensin action. Nat Cell Biol 10(11):1318–1323.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ncb1790&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18931662&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000260586700014&link_type=ISI) 

21. [21].Fonseca JP, et al. (2012) In vivo Polycomb kinetics and mitotic chromatin binding distinguish stem cells from differentiated cells. Genes Dev 26(8):857–871.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZ2VuZXNkZXYiO3M6NToicmVzaWQiO3M6ODoiMjYvOC84NTciO3M6NDoiYXRvbSI7czozNzoiL2Jpb3J4aXYvZWFybHkvMjAxNi8wNC8xMi8wNDgxMTYuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

22. [22].Follmer NE, Wani AH, Francis NJ (2012) A Polycomb Group Protein Is Retained at Specific Sites on Chromatin in Mitosis. PLoS Genet 8(12).
    
    
23. [23].Steffen P A, Ringrose L (2014) What are memories made of? How Polycomb and Trithorax proteins mediate epigenetic memory. Nat Rev Mol Cell Biol. 15(5):34056.
    
    
24. [24].Nora, E.P., Lajoie, B.R., Schulz, E.G., Giorgetti, L., Okamoto, I., Servant, N., Piolot, T., van Berkum, N.L., Meisig, J., Sedat, J., Gribnau, J., Barillot, E., Blthgen, N., Dekker, J., Heard, E. (2012). Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 485:381–385.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature11049&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22495304&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000304099100045&link_type=ISI) 

25. [25].Baù, D., and Marti-Renom, M.A. (2012). Genome structure determination via 3C-based data integration by the Integrative Modeling Platform. Methods 58:300–306.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ymeth.2012.04.004&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22522224&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000312360000012&link_type=ISI) 

26. [26].Zhang, B. Wolynes, P. G. (2015). Topology, structure and energy landscapes of human chromosomes. Proc. Natl. Acad. Sci. USA 112:6062–6067.
    
    
27. [27].Barbieri, M., Chotalia, M., Fraser, J., Lavitas, L.M., Dostie, J., Pombo, A., Nicodemi, M. (2012). Complexity of chromatin folding is captured by the strings and binders switch model. Proc. Natl. Acad. Sci. USA 109:16173–16178.
    
    
28. [28].Brackley, C.A, Taylor, S., Papantonis, A., Cook, P.R., and Maren-duzzo, D. (2013). Nonspecific bridging-induced attraction drives clustering of DNA-binding proteins and genome organization. Proc. Natl. Acad. Sci. USA 110:E3605–3611.
    
    
29. [29].Jost, D., Carrivain, P., Cavalli, G. Vaillant, C. (2014) Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucl. Ac. Res. 42:9553–9561.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/nar/gku698&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25092923&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

30. [30].Benedetti F., Dorier J., Burnier Y., and Stasiak A. (2014). Models that include supercoiling of topological domains reproduce several known features of interphase chromosomes. Nucleic Acids Res 42:2848–2855.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/nar/gkt1353&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24366878&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000333093600012&link_type=ISI) 

31. [31].Brackley, C.A, Johnson, J., Kelly, S., Cook, P.R., Marenduzzo, D. (2015). Binding of bivalent transcription factors to active and inactive regions folds human chromosomes into loops, rosettes and domains, arXiv:1511.01848.
    
    
32. [32].Brackley C.A., Brown J.M., Waithe D., Babbs C., Davies J., Hughes J.R., Buckle V.J., and Marenduzzo D. (2016). Predicting the three-dimensional folding of cis-regulatory regions in mammalian genomes using bioinformatic data and polymer models. Genome Biol. 17:59
    
    
33. [33].Nicodemi, M. Prisco, A. (2009). Thermodynamic Pathways to Genome Spatial Organization in the Cell Nucleus. Biophys. J. 96:2168–2177.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.bpj.2008.12.3919&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=19289043&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000266376700013&link_type=ISI) 

34. [34].Johnson, J., Brackley, C.A., Cook, P.R., Marenduzzo, D. (2015). A simple model for DNA bridging proteins and bacterial or human genomes: bridging-induced attraction and genome compaction. J. Phys. Condens. Matter 27:064119.
    
    
35. [35].Cheng TM, et al. (2015) A simple biophysical model emulates budding yeast chromosome condensation. Elife 4:122.
    
    
36. [36].Beisel, C., Paro, R. (2011). Silencing chromatin: comparing modes and mechanisms. Nat. Rev. 12:123–135.
    
    
37. [37].Kilic, S., Bachmann, A.L., Bryan, L.C., Fierz, B. (2015). Multivalency governs HP1a association dynamics with the silent chromatin state. Nat. Commun. 6:7313.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ncomms8313&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26084584&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

38. [38].Filion, G. J., van Bemmel, J. G., Braunschweig, U., Talhout, W., Kind, J., Ward, L. D., Brugman, W., de Castro, I. J., Kerkhoven, R. M., Bussemaker, H. J. and van Steensel, B. (2010). Systematic Protein Location Mapping Reveals Five Principal Chromatin Types in Drosophila Cells. Cell 143, 212–224.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2010.09.009&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20888037&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000283052200011&link_type=ISI) 

39. [39].Kharchenko, P. V. et al. (2011). Comprehensive analysis of the chromatin landscape in Drosophila melanogaster. Nature 471:480–485.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature09725&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21179089&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000288702200055&link_type=ISI) 

40. [40].Mirny, L. A. (2011). The fractal globule as a model of chromatin architecture in the cell. Chromosome Res. 19:37–51.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1007/s10577-010-9177-0&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21274616&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000289443700004&link_type=ISI) 

41. [41].Sanborn, A. L. et al. (2015). Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl. Acad. Sci. USA early edition, doi: 10.1073/pnas.1518552112.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTEyLzQ3L0U2NDU2IjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTYvMDQvMTIvMDQ4MTE2LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

42. [42].Francis NJ (2004) Chromatin Compaction by a Polycomb Group Protein Complex. Science(80) 306(5701):1574–1577.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzMDYvNTcwMS8xNTc0IjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTYvMDQvMTIvMDQ4MTE2LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

43. [43].Vandenbunder, B., Fourré, N., Leray, A., Mueller, F., Völkel, P., Angrand, P. O. Héliot, L. (2014). PRC1 components exhibit different binding kinetics in Polycomb bodies. Biol. Cell. 106:111–125.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1111/boc.201300077&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24460908&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

44. [44].Rosa, A., Everaers, R. (2008). Structure and dynamics of interphase chromosomes. PLoS Comput Biol 4:e1000153.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1000153&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18725929&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

45. [45].Kang, H., Young-Gui Yoon, Y.-G., Thirumalai, D., Hyeon, C. (2015). Confinement-Induced Glassy Dynamics in a Model for Chromosome Organization. Phys. Rev. Lett. 115:198102.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1103/PhysRevLett.115.198102&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26588418&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

46. [46].Nagano, T., Lubling, Y., Stevens, T.J., Schoenfelder, S., Yaffe, E., Dean, W., et al. (2013). Single-cell Hi-C reveals cell-to-cell variability in chromosome structure. Nature 502:59–64.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature12593&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24067610&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000325106000030&link_type=ISI) 

47. [47].Naumova, N., Imakaev, M., Fudenberg, G., Zhan, Y., Lajoie, B. R., Mirny L. A. Dekker, J. (2013). Organization of the Mitotic Chromosome. Science 342, 948–953.
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIzNDIvNjE2MS85NDgiO3M6NDoiYXRvbSI7czozNzoiL2Jpb3J4aXYvZWFybHkvMjAxNi8wNC8xMi8wNDgxMTYuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

48. [48].Chaikin, P. M. Lubensky, T. C. (1995). Principles of Condensed Matter Physics, Cambridge University Press, Cambridge.
    
    
49. [49].Alberts B., Johnson A., Lewis J., Morgan D., Raff M. (2014) Molecular Biology of the Cell, Taylor & Francis.
    
    
50. [50].Brand, P., Lenser, T. Hemmerich, P. (2010). Assembly dynamics of PML nuclear bodies in living cells. PMC Biophysics 3:3.
    
    
51. [51].Kremer K, Grest GS (1990) Dynamics of entangled linear polymer melts: A molecular-dynamics simulation. J. Chem. Phys. 92(8):5057.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1063/1.458541&link_type=DOI) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=A1990DC48400045&link_type=ISI) 

## References

1.  [1].Kremer K, Grest GS (1990) Dynamics of entangled linear polymer melts: A molecular-dynamics simulation. J Chem Phys 92(8):5057.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1063/1.458541&link_type=DOI) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=A1990DC48400045&link_type=ISI) 

2.  [2].Sexton T, et al. (2012) Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 148(3):458-472.
    
    
3.  [3].Wani, A. H., Boettiger, A. N., Schorderet, P., Ergun, A., Munger, C., Sadreyev, R. I., Zhuang, X., Kingston, R. E. Francis, N. J. (2016) . Chromatin topology is coupled to Polycomb group protein subnuclear organization. Nat. Comm. 7, 10291.
    
    
4.  [4].Filion, G. J., van Bemmel, J. G., Braunschweig, U., Talhout, W., Kind, J., Ward, L. D., Brugman, W., de Castro, I. J., Kerkhoven, R. M., Bussemaker, H. J. and van Steensel, B. (2010). Systematic Protein Location Mapping Reveals Five Principal Chromatin Types in Drosophila Cells. Cell 143, 212–224.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2010.09.009&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20888037&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000283052200011&link_type=ISI) 

5.  [5].Kharchenko, P. V. et al. (2011). Comprehensive analysis of the chromatin landscape in Drosophila melanogaster. Nature 471:480–485.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nature09725&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21179089&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000288702200055&link_type=ISI) 

6.  [6].Rosa, A., Everaers, R. (2008). Structure and dynamics of interphase chromosomes. PLoS Comput Biol 4:e1000153.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1000153&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=18725929&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

7.  [7].Jost, D., Carrivain, P., Cavalli, G. Vaillant, C. (2014) Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucl. Ac. Res. 42:9553–9561.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/nar/gku698&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25092923&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 

8.  [8].Sexton, T., Yaffe, E., Kenigsberg, E., Bantignies, F., Leblanc, B., Hoichman, M., Parrinello, H., Tanay, A., Cavalli, G. (2012). Three-dimensional folding and functional organization principles of the Drosophila genome. Cell 148:458–472.
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.cell.2012.01.010&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22265598&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2016%2F04%2F12%2F048116.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000300225000015&link_type=ISI)

 [1]: /embed/graphic-9.gif
 [2]: /embed/graphic-10.gif
 [3]: /embed/graphic-11.gif
 [4]: /embed/graphic-12.gif
 [5]: /embed/graphic-13.gif
 [6]: /embed/inline-graphic-1.gif
 [7]: /embed/inline-graphic-2.gif
 [8]: /embed/inline-graphic-3.gif
 [9]: /embed/graphic-14.gif
 [10]: /embed/graphic-15.gif
 [11]: /embed/graphic-16.gif