Stochastic gene expression is optimized to drive developmental self-organization

Ritika Giri; Dimitrios K. Papadopoulos; Diana M. Posadas; Hemanth K. Potluri; Pavel Tomancak; Madhav Mani; Richard W. Carthew

doi:10.1101/546911

Summary

Self-organization of cells into tissue patterns is a design principle in developmental biology to create order from disorder. However, order must emerge from biochemical processes within and between cells that are stochastic. Here, we measure noise in expression of the Drosophila senseless gene, a key determinant of sensory cell fate. We show that translation and transcription of senseless produce distinct signatures of protein noise. Repression of senseless by a microRNA uniformly decreases both protein abundance and noise in cells, but does so without affecting the fidelity of self-organization. In contrast, the genomic location of senseless affects protein noise without affecting protein abundance. When noise is greater, tissue patterning is significantly disordered. This suggests that gene expression stochasticity, independent of expression level, is a critical feature that must be constrained during development to allow cells to efficiently and accurately self-organize.

Stochasticity is a fundamental feature of all molecular interactions. During gene expression, stochasticity leads to fluctuations in the number of protein molecules in a cell¹ These fluctuations are perhaps especially relevant during developmental transitions, when cells adopt divergent fates based on the protein abundance of a fate determinant. While probabilistic fate adoption has been observed in rare instances^2–4, generally, fate transitions are thought to be virtually deterministic due to cell lineage and proximity to inductive signals. Therefore, it is unclear how extensively protein fluctuations impinge upon the vast majority of developmental decisions. Given that the emergence of order from disorder is a hallmark of development, how are highly-ordered patterns rendered immune to the underlying stochasticity of gene expression? We explore this problem by focusing on the patterning of sensory bristles in Drosophila.

Sensory organs are often arranged in highly-ordered assemblies, as a means to predictably map environmental stimuli to neural circuitry⁵. Paradoxically, Drosophila sensory bristle development requires expression heterogeneity between progenitor cells to initiate the self-organizing process of pattern formation^6,7 We have focused on sensory bristles located at the anterior margin of the wing, where diffusing Wingless (Wg) molecules induce the expression of senseless (sens) in stripes of cells. This imbues these cells with a bistable fate potential (Fig. 1a)⁸. Cells then either upregulate sens expression and adopt a sensory organ (S) fate or downregulate sens and adopt an epidermal (E) fate^9,10 Cells in each stripe compete with one another to adopt an S fate, which is driven by lateral inhibition of sens expression via Notch signaling⁷ Sustained expression of the Sens transcription factor is sufficient to drive a cell towards an S fate, after which the cell develops into an adult bristle (Fig. 1b)⁹. This key role of Sens¹¹ makes it a logical candidate to study how quantitative changes in gene expression noise impact the final ordered outcome.

Figure 1 Measuring in vivo gene expression stochasticity during sensory organ fate selection.

a, Sensory organ (S) fate selection is a bistable system driven by pro-neural Wingless (Wg) and anti-neural Notch (N) signals to regulate sens expression. When cells are in an unstable state of intermediate expression, they switch to either low stable expression (E fate) or high stable expression (S fate). b, Sens expression marks the resolving proneural field along the Drosophila wing margin. The stripes of unstable intermediate cells refine into a robust periodic pattern of S cells, which can be seen emerging in the micrograph, and E cells, which will emerge from the unstable intermediates interspersed between S cells. This generates the bristle pattern along the adult wing margin. c, Gene expression output is inherently variable due to stochastic synthesis and decay of mRNA and protein molecules. Therefore, single cell protein counts fluctuate stochastically from the deterministic expectation. The magnitude of these fluctuations is determined by the rate constants of individual steps (in blue). d, Stochasticity can be measured by tagging the two alleles of a gene with distinct fluorescent proteins and measuring fluorescence correlation in individual cells. Perfect correlation would indicate no stochastic effects i.e. deterministic behavior. e, A genomic fragment containing sens was N-terminally tagged with either single sfGFP or mCherry tags; or with a tandem sfGFP-mCherry fluorescent tag. These were used to rescue sens null animals by site-specific insertion into genomic location 22A3. f, Single-cell mCherry and sfGFP protein was measured in singly tagged sfGFP-sens/mCherry-sens wing cells. Tandem tagged sfGFP-mCherry-sens wing cells serve as a technical control for non-biological sources of stochasticity.

Measuring stochasticity in sens expression

Protein fluctuations can be observed by counting molecules in single cells over time (Fig. 1c)¹². Alternatively, noise can be estimated by tagging the two alleles of a gene with distinct fluorescent proteins and measuring fluorescence correlation in individual cells (Fig. 1d)^1,13. We modified a 19.2 kb fragment of the Drosophila genome containing sens by singly inserting either superfolder GFP (sfGFP) or mCherry into the amino terminus of the sens ORF¹⁴. These transgenes were precisely landed into the 22A3 locus on the second chromosome, a standard landing site for transgenes (Fig. 1e). Endogenous sens was eliminated using protein-null alleles^9,10,15. The transgenes completely rescued all detectable sens mutant phenotypes and exhibited normal expression (Extended Data Fig. 1). We then mated singly-tagged sfGFP-sens with mCherry-sens animals to generate heterozygous progeny. Wing imaginal discs of these offspring were fixed and imaged by confocal microscopy (Extended Data Fig. 2). Cells were computationally segmented in order to measure intensity of sfGFP and mCherry fluorescence within each cell (Extended Data Fig. 3). Fluorescence values were then converted into absolute numbers of Sens protein molecules. This was made possible by using Fluorescence Correlation Spectroscopy (FCS) to measure the concentrations of sfGFP-Sens and mCherry-Sens protein in live wing discs and deriving a conversion factor from these measurements (Extended Data Fig. 4).

Wing disc cells displayed a wide range of Sens protein expression (Fig. 1f). This was expected since sens transcription is regulated by Wg and Notch signaling^8,9 Although both sfGFP and mCherry tagged alleles contributed equally to total Sens output on average (Extended Data Fig. 4a), individual cells showed significant differences between sfGFP and mCherry fluorescence (Fig.1f). This was due to two independent sources of noise: 1) stochastic gene expression, 2) stochastic processes linked to imaging and analysis. The latter arises from differential fluorescent maturation kinetics, probabilistic photon emission and detection, as well as image analysis errors. To estimate this technical noise, we constructed a third transgene containing both sfGFP and mCherry fused in tandem to the sens ORF (Fig. 1e). sfGFP-mCherry-sens was inserted at locus 22A3, and fluorescence was measured in disc cells from such animals. Since sfGFP and mCherry molecule numbers should be perfectly correlated in vivo when expressed as a tandem-tagged protein, we attributed any decrease in fluorescence correlation to technical noise (Fig. 1f).

Previous studies using dissociated cells have shown that protein output from gene expression is Gamma-distributed^1,16, such that protein noise (expressed as the Fano factor, i.e. variance divided by mean) remains constant as protein output varies. We calculated the Fano factor as a function of Sens protein output in cells expressing either singly-tagged Sens or tandem-tagged Sens (Fig. 2a). To estimate the Fano factor due to stochasticity of Sens expression, we subtracted out the technical contribution as measured in tandem-tagged cells. Contrary to expectation, the Fano factor for Sens expression displayed a complex relationship to protein output, with a peak in cells containing < 300 molecules (Fig. 2b).

Figure 2 Sens protein expression displays signatures of transcriptional bursting.

a, The Sens Fano factor as measured in bins of wing disc cells expressing either tandem-tagged Sens or the singly-tagged allelic pairs of Sens. b, The Fano factor of Sens expression was calculated by subtracting out the Fano factor from tandem-tagged cells. For a and b, shading demarcates 95% confidence intervals. c, Simulations of a gene expression model with a constitutively-active promoter produces a protein Fano factor that is invariant with protein output, like a Gamma process. Transcription rate S_m is being varied from 0.1 to 1 mRNA/min to generate graded Sens expression. d, Simulations of a gene expression model with a bursting promoter with distinct on and off states, and independent rate constants for state conversion. If the rate constant k_on is varied as shown, the simulated protein Fano factor exhibits a biphasic profile as a function of protein output. Above a threshold of protein output, the Fano factor becomes invariant. This complex behavior is observed experimentally in b. For c and d, error bars represent 95% confidence intervals.

Numerical modeling of sens gene expression

To understand the origins of this profile, we created a mathematical model of gene expression using rate parameters that we measured for sens in the wing (Supplementary Table 2). Each reaction in the model was treated as a probabilistic event, reflecting the stochastic nature of these processes^17,18. Thus, simulated protein levels displayed fluctuations. To mimic the experimental set-up, two independent alleles were simulated for each virtual cell. In silico, the results resembled the sens allelic output that we observed in vivo (Extended Data Fig. 5).

We first considered a model in which the promoter was always active. The simulated Fano factor was constant irrespective of protein number, consistent with noise being caused by random birth-death events (Fig. 2c)^17,18. This resembled the in vivo profile seen in cells containing more than 300 molecules of Sens (Fig. 2b). Since there was a weak rise in the experimental Fano factor, we hypothesize that one of the post-transcriptional rate constants slowly varies as a function of protein output (Extended Data Fig. 5d). Notably, the model did not recapitulate the observed Fano peak at lower Sens levels. Thus, Sens fluctuations were not exclusively due to the random birth and death of mRNA and protein molecules.

Therefore, we considered an alternative model (Fig. 2d). The promoter was allowed to switch between active and inactive states such that it transcribed mRNA molecules in bursts. Bursty transcription is a common feature of gene expression in many organisms^19–21. Varying the three transcription parameters in our model (k_on, k_off, and S_m) allowed us to independently tune the frequency (k_on⁻¹ + k_off⁻¹)⁻¹ and size (S_m/k_off) of these virtual bursts. When we systematically varied the gene activation parameter k_on and calculated the resulting Fano factor, the in-silico profile closely resembled the in vivo profile (Fig. 2d). In contrast, varying inactivation rate k_off (Extended Data Fig. 5f) or transcription rate S_m (Fig. 2c) did not yield a Fano peak at lower Sens levels. We surmise from these results that transcription of sens at the wing margin is primarily regulated by modulating promoter burst frequency via k_on. Indeed, burst frequency modulation has been observed for multiple developmental genes²² and might be a conserved mechanism to reduce stochastic noise.

These simulations allow us to conceptually frame Sens protein noise as coming from two distinct sources 1) transcriptional bursting kinetics, and 2) RNA/protein birth-death processes. When transcription bursts are infrequent, cells experience large fluctuations in mRNA and protein numbers, which generates a peak in the protein Fano factor. As promoter activation events become more frequent, they approximate a continuous-rate process such that RNA/protein birth-death processes dominate the protein noise, and the Fano factor becomes more constant.

miR-9a modestly suppresses expression noise

The magnitude of the Fano factor should be proportional to the average number of protein molecules translated from one mRNA molecule in its lifetime, if protein noise is caused by RNA/protein birth-death processes^17,18,23,24. Therefore, we experimentally altered this translation burst size for Sens. We did so by eliminating the post-transcriptional repression of Sens by the microRNA miR-9a (Fig. 3a)^15,25. Loss of miR-9a repression increased Sens protein numbers per cell by 1.8-fold (Fig. 3b, Extended Data Fig. 6a). We then compared the Fano factor in cells with or without miR-9a regulation. Loss of miR-9a regulation increased the Fano factor across the entire range of sens expression (Fig. 3c). We compared this effect to model simulations in which either the translation rate (S_p) or mRNA decay rate (D_m) parameter was altered by 1.8-fold. The model-predicted increase in the Fano factor was comparable to the observed elevation when miR-9a regulation was missing (Fig. 3d).

Figure 3 MicroRNA regulation decreases Sens protein output and noise.

a, The sens transgenes were modified to delete the two miR-9a binding sites in the 3’UTR. Sens output was measured in singly tagged allele pairs with and without miR-9a sites. b, Sens protein output increased 1.80 ± 0.21 fold with loss of miR-9a repression. Error bars are standard error of the mean. c, Loss of miR-9a regulation leads to an increase in Fano factor across the entire range of protein output. Shaded regions are 95% confidence intervals. d, Model simulations with a 1.8 fold increase in translation burst size (defined as S_p/D_m) reproduces the experimental Fano profile. Error bars are 95% confidence intervals.events become more frequent, they approximate a continuous-rate process such that RNA/protein birth-death processes dominate the protein noise, and the Fano factor becomes more constant.

Inter-allelic gene transvection dramatically increases Sens protein noise

Our model had suggested that promoter switching is the dominant source of protein noise in cells with fewer than 300 Sens molecules. To test this hypothesis, we landed the sens transgenes in a different location of the genome (Fig. 4a). We reasoned that a different genomic neighborhood would possibly alter promoter bursting dynamics. We chose 57F5 to land sens, since both 22A3 and 57F5 are widely used landing sites for Drosophila transgenes¹⁴. The sens transgenes inserted at 57F5 were highly comparable to the 22A3 site in their ability to rescue null sens mutations, as well as express Sens protein in the correct pattern and abundance at the wing margin (Extended Data Fig. 2).

Figure 4 Sens protein noise is dependent on genome location of the sens gene.

a, The sens transgenes were inserted into one of two locations on chromosome II - 22A3 or 57F5. b, The noise profiles from cells expressing sens at 57F5 or 22A3. Cells with more than 800 molecules have identical Fano values at both genomic positions. The Fano peaks at lower Sens levels are very different between the genomic locations. Shaded regions are 95% confidence intervals. c, Model simulations in which transcription burst size (defined as S_m/k_off) is set to different values as shown. The Fano peak amplitude and position change as burst size varies, but all relax to a constant basal level. These trends are highly similar to those observed in b. Error bars are 95% confidence intervals. d, The noise profiles from cells expressing sens as paired alleles (22A3/22A3, 57F5/57F5) or unpaired alleles (22A3/57F5). Line averages are shown and shaded regions are 95% confidence intervals.

Cells expressing Sens from 57F5 and containing more than 800 Sens molecules had a Fano factor that was identical to their 22A3 counterparts (Fig. 4b). However, the profile was very different in cells with fewer than 800 molecules; the Fano factor peak was larger and greatly expanded. It was remarkable how different the profiles appeared, and we explored potential mechanistic causes of these differences. When varying the transcriptional parameters in the model we found that even a modest increase in burst size could recapitulate the effect of changing gene location from 22A3 to 57F5 on noise (Fig. 4c).

We looked for local properties of the genome that might explain the difference between 22A3 and 57F5. Metazoan genomes are segregated into Topologically Associated Domains (TADs). TADs are conserved 3D compartments of self-interacting chromatin whose boundaries are demarcated by insulator protein binding. TADs often differ in chromatin condensation and accessibility to transcription factors²⁶. The landing site at 22A3 is in the middle of a large, inaccessible, gene-sparse TAD (Extended Data Fig. 7)²⁷. In contrast, the landing site at 57F5 is in a small, accessible, gene-dense TAD. Moreover, the 22A3 site is 40 kb from bound insulators while the 57F5 site is located less than 1 kb from a TAD boundary. Proximity to insulator elements is associated with transcriptional interactions between paired alleles of Drosophila genes^22,28,29. To test whether altered transcriptional kinetics of sens at 57F5 were allele-intrinsic or due to inter-allelic interactions, we placed a 57F5 allele in trans to a 22A3 allele, generating unpaired alleles. The Fano factor profile of 22A3/57F5 cells was strikingly similar to that of 22A3/22A3 cells (Fig. 4d). In contrast, the model predicted that if the alleles behaved intrinsically, then the Fano factor of 22A3/57F5 cells would have been much greater than that of 22A3/22A3 cells (Extended Data Fig. 5g). This suggests that alleles paired at 22A3 frequently fire independently of one another, but alleles paired at 57F5 exhibit inter-allelic interactions, also called transvection. Loss of homolog pairing in 22A3/57F5 cells presumably reverts the Fano factor profile to mimic the non-interacting 22A3 pair (Fig. 4d).

Sens noise can disrupt patterning order of adult bristles

Although we observed dramatic changes in Sens noise when we altered genomic location, heightened fluctuations were limited to cells with fewer than 800 Sens molecules, far lower than the level of Sens expression in S-fated cells. Therefore, it is possible that these fluctuations do not impact fate transitions and bristle patterning. Remarkably, this is not the case. Instead, we find that increased Sens stochasticity in this regime results in increased pattern disorder in the adult form.

We had measured Sens noise in cells undergoing fate decisions to make chemosensory bristles. Chemosensory bristles are periodically positioned in a row near the adult wing margin, such that approximately every fifth cell is a bristle (Fig. 5a)³⁰. Mechanosensory bristles form in a continuous row at the outermost margin of the adult wing and are selected 8-10 hours after the chemosensory cells are selected³⁰. We reasoned that if Sens numbers fluctuate in sensory progenitor cells near the bistable switching threshold, it might propel erroneous escape from lateral inhibition to generate ectopic sensory organs. Thus, mechanosensory bristles positioned incorrectly in the chemosensory row might be derived from proneural cells that escaped inhibition during chemosensory specification (Fig. 5a). Indeed, when we compared ectopic bristles in 57F5 versus 22A3 adults, the frequency increased ten-fold from 3.2% to 29.1% in 57F5 adults (Fig. 5b, Supplementary Table 3). Ectopic chemosensory bristles were also observed (Figs. 5a, Extended Data Fig. 8). The difference in errors was not attributable to genetic background in the different lines since parental stocks had identical ectopic bristle frequencies (Fig. 5b, Extended Data Fig. 8). Nor was it due to higher Sens protein levels in cells from 57F5 animals since there was no dramatic difference in the Sens levels between 57F5 and 22A3 cells (Extended Data Fig. 2). Moreover, loss of miR-9a regulation increased Sens levels (Fig. 3b) but did not increase ectopic bristle frequency (Fig. 5c, Supplementary Table 3). Finally, we ruled out the possibility that insertion near neurogenic genes was responsible for enhanced ectopic bristles from 57F5. First, none of the genes residing in the 57F5 TAD are annotated as neurogenic (Supplementary Table 4)¹⁵. Second, adults with sens at 22A3/57F5 had an ectopic frequency of 1.7%, not significantly different from 22A3/22A3 adults (Fig. 5b, Supplementary Table 3).

Figure 5 Ordered sensory patterning is disrupted by stochastic gene expression.

a, The dorsal surface of the adult wing margin displays two ordered rows of sensory organs - an outer continuous row of thick mechanosensory bristles (cyan) and an inner periodic row of thin chemosensory bristles (magenta). Disordered patterns are observed when bristles are incorrectly positioned. Instances of ectopic (mis-positioned) mechanosensory bristles in the chemosensory row (center) and ectopic chemosensory bristles, which disrupt periodic spacing (right), were observed and counted. b, Pattern disorder is significantly higher when sens allele pairs are expressed from locus 57F5/57F5 relative to alleles at 22A3/22A3 or 22A3/57F5. This effect is seen regardless of whether miR-9a regulates sens or not. c, Increasing mean Sens levels 1.8-fold by removing miR-9a regulation does not lead to increased mispatterning events at either locus. d, The centroids of Sens-positive cells in 22A3/22A3 and 57F5/57F5 wing discs were mapped and color coded according to Sens expression level (left) and Fano noise level (right). Cells experiencing high Fano noise were distributed throughout the proneural zone of the 57F5 disc where S-fate determination occurs. e, Quantitative model of Sens protein dynamics during pattern formation. Sample simulations use a ID model adapted from Corson et al.,(2017)⁷ that generates periodic E and S cells as a final pattern. When all cells first express Sens, they enter an unstable state in which Sens levels are intermediate. Cells then bifurcate into alternating states with maximal or minimal Sens. The magnitude of noise in Sens expression affects the time taken to resolve the pattern and the accuracy of the pattern.

To understand why 57F5 cells with relatively low Sens numbers and high Sens noise disrupted patterning order, we mapped the location of these cells in the wing disc. Wg induces Sens in two broad stripes of cells, each stripe being 4-5 cell diameters wide (Fig. 5d)^8,31. It is only cells near the center of a stripe that express higher levels of Sens^7,31, and a few of these will switch to an S fate. This pattern was preserved whether sens was transcribed from 22A3 or 57F5 (Fig. 5d, Extended Data Figure 9a). However, the pattern of noise was remarkably different. For the 22A3 gene, cells with the greatest noise were at the edges of each stripe, distant from the central region from which S cells normally emerge (Fig. 5d, Extended Data Figure 9b). In contrast, for the 57F5 gene, cells with high noise were located throughout the stripe, including the central region. Thus, it is likely that a subset of cells encompassed in the Fano peak for the 57F5 gene were close to the bistable switch threshold and therefore susceptible to errors in fate determination due to the enhanced fluctuations in Sens.

Noise in sens expression appears to be a fine-tuned parameter. If noise is too low, then the final bristle pattern will be highly ordered, but cells will take more time to resolve their fates since noise initiates the self-organizing process of pattern formation (Fig. 5e). If noise is too high, then cells will rapidly resolve their fates, but the final pattern will be disordered (Fig. 5e). In this high noise scenario, a cell may experience a random fluctuation large enough to flip the cell into an inappropriate stable state during the resolution of pattern formation. Overall, it suggests that perhaps stochasticity in gene expression is an evolutionarily constrained parameter that allows rapid yet accurate cell fate resolution without requiring large numbers of fate-determining molecules^32–34.

Methods

Generation of sens transgenic stocks

The sens alleles sens^E1 and sens^E2 are protein null mutants^9,35. N-terminal 3xFlag-TEV-StrepII-sfGFP-FlAsH tagged sens originally generated from the CH322-01N16 BAC was a kind gift from K. Venken and H. Bellen¹⁴ and has been shown to rescue sens^E1 and sens^E2 mutations^14,15. To generate mCherry tagged sens, the sfGFP coding sequence in 3xFlag-TEV-StrepII-sfGFP-FlAsh was swapped out for mCherry by RpsL-Neo counter-selection (GeneBridges). The sfGFP-mCherry tandem tagged sens transgene was generated similarly by overlap PCR such that sfGFP and mCherry sequences were separated by a 12 amino acid (GGS)₄ linker. The miR-9a binding site mutant alleles of the tagged sens transgenes were created by deletion of the two identified binding sites in the 607 nt sens 3’ UTR as had been described previously¹⁵ to generate sens^m1m2 mutant transgenes. Cloning details are available on request. All BACs were integrated at PBacy+-attP-3BVK00037 (22A3) and PBacy+-attP-9AVK00022 (57F5) landing sites by phiC31 recombination¹⁴. Transgenic lines were crossed with sens mutant lines to construct stocks in which sens transgenes were present in a sens^E1/sens^E2 trans-heterozygous mutant background. Thus, the only Sens protein expressed from these animals came from the transgenes. All experiments were performed on these stocks.

Adult wing imaging and mispatterning analysis

Adult females from uncrowded vials were collected on eclosion and aged for 1-2 days before being preserved in 70% Ethanol. Wings from preserved animals were plucked out with forceps and kept ventral side up on a glass slide. Approximately 10 pairs of wings were arranged per slide using a thin film of Ethanol to lay them flat. Left and right wings from the same animal were positioned next to each other. Once specimens were arranged as desired, excess ethanol was wiped away. A second glass slide was coated with heptane glue (10 cm² double sided embryo tape dissolved overnight in 4 ml heptane) and pressed down onto the specimen slide to affix them dorsal side up. Then wings were mounted in 70% glycerol in PBS and sealed with nail polish for imaging. Wings were imaged using an Olympus BX53 upright microscope with a 10x UPlanFL N objective in brightfield. To achieve optimal resolution, 8-10 overlapping images were taken for each wing and stitched together in Adobe Photoshop^®.

Wings with at least one mechanosensory bristle placed ectopically in or adjacent to the chemosensory bristle row were counted as mispatterned. The proportion of mispatterned wings was calculated for each genotype (n ≥ 60). Genotypes were compared by calculating the odds ratio of mispatterning and determined to be significantly different from 1 if p < 0.05 using Fischer’s exact test. For chemosensory bristle density, wing images were used to identify and mark chemosensory bristles along the margin in Fiji. The euclidean distance between successive bristles was measured and bristle density was calculated as the inverse of mean spacing. 95% confidence intervals were calculated by bootstrapping and bristle distributions across genotypes were compared statistically using student’s t-test.

Fluorescence microscopy

All fluorescence microscopy experiments used female white pre-pupal animals. The white pre-pupal stage was chosen because it is easily identified and lasts for only 45-60 minutes. Further, wing margin chemosensory precursor selection was observed to be tightly linked to the transition from late third larval instar to pre-pupal stage. Therefore,choosing white pre-pupal animals allowed us to strictly control for developmental stage. Wing discs from staged animals were dissected out in ice-cold Phosphate Buffered Saline (PBS). Discs were fixed in 4% paraformaldehyde in PBS for 20 minutes at 25°C and washed with PBS containing 0.3% Tween-20. Then they were stained with 0.5 μg/ml DAPI and mounted in Vectasheild™. Discs were mounted apical side up and imaged with identical settings using a Leica TCS SP5 confocal microscope. All images were acquired at 100x magnification at 2048 x 2048 resolution with a 75 nm x-y pixel size and 0.42 μm z separation. Scans were collected bidirectionally at 400 MHz and 6x line averaged. Wing discs of different genotypes were mounted on the same microscope slide and imaged in the same session for consistency in data quality.

Immunohistochemistry

Discs were dissected and fixed as above before incubating with the primary antibodies of interest. Tissues were washed thrice for 5-10 minutes each and incubated with the appropriate fluorescent secondary antibodies (diluted 1:250) for 1 hour. They were then stained with DAPI, washed in PBS-Tween and mounted for imaging. Guinea pig anti-Sens antibody (gift from H. Bellen) was used at 1:1000 dilution.

Image analysis

Cell Segmentation

For each wing disc, five optical slices containing proneural cells were chosen for imaging and analysis. A previously documented custom MATLAB script was used to segment nuclei in each slice of the DAPI channel^36,37. Briefly, high intensity nucleolar spots were smoothed out to merge with the nuclear area to prevent spurious segmentation. Next, cell nuclei were identified by thresholding based on DAPI channel intensity. Segmentation parameters were optimized to obtain nuclei with at least 100 pixels and no more than 4000 pixels. For each nuclear area so identified, the average signal intensity for the sfGFP and mCherry channels was recorded along with the relative position of its centroid in x and y. Since segmentation was based exclusively on the nuclear signal, it identified all cells present in the imaged area (Extended Data Fig. 3a).

Background fluorescence normalization

The majority of cells imaged did not fall within the proneural region and therefore displayed background levels of fluorescence scattered around some mean level (Extended Data Fig. 3b). Sens expressing cells were present in the right hand tail of the distribution. The background was channel specific and varied slightly from disc to disc (Extended Data Fig. 3c). Therefore, we calculated the mean channel background for each channel in each disc individually. We did this by fitting a Gaussian distribution to the population and finding the mean of that fit. In order to separate Sens positive cells, we chose a cut-off percentile based on the normal distribution, below which cells were deemed Sens negative. We set this cut-off at the 84^th percentile for all analysis (Extended Data Fig. 3d).

This was determined empirically by mapping cell positions relative to the pronueral region. At and above the 84^th percentile, mapped cells followed the proneural striped pattern. Lowering the cut-off led to addition of cells randomly scattered across the imaging field. Increasing the cut-off led to progressive narrowing of the proneural stripes. From this we inferred the fluorescence level at 84^th percentile as a tolerant but specific threshold to identify Sens positive cells. Thus, to normalize measurements across tissues and experiments, this value was subtracted from the total measured fluorescence for all cells in that disc and channel. Only cells with values above the threshold for both mCherry and sfGFP fluorescence were assumed Sens positive (usually 30% of total cells) and carried forward for further analysis (Extended Data Fig. 3e).

mCherry and sfGFP fluorescence units scaling

We required the relative fluorescence of the mCherry and sfGFP channels to be scaled in equivalent units. To do this, we fit a linear equation as shown, and derived best-fit values for slope and constant intercept.

To preserve data integrity, the slope and constant was calculated for each wing disc separately. Linear correlation coefficients were consistenly high between mCherry and sfGFP fluorescence, ranging from 0.85 to 0.95. Finally, to rescale single cell mCherry fluorescence in units of sfGFP-Sens fluorescence, we applied the following transformation to each cell’s mCherry reading (Extended Data Fig. 3f).

Once the two-channel RFUs were made equivalent, they were summed to obtain total Sens RFU for each cell as shown.

Stochastic noise and Fano factor calculation according to gene expression level

We used the following formula to calculate intrinsic noise¹. Mathematically, it is the variance remaining after the co-variance term of two variables is subtracted from their total variance. This value is then normalized to the squared mean (η² = σ²/μ²) to obtain the following dimensionless quantity:

Here x and y represent RFU_sfGFP and ScaledRFU_mcherry. Angled brackets denote averages over the cell population. This term provides a single value of intrinsic noise for the entire cell population. Since Sens expression varies over three orders of magnitude, we partitioned cells into smaller bins according to their total Sens expression. We then calculated intrinsic noise for each binned sub-population. Sens expression RFU was log-transformed and we used a bin width of 0.02 log(RFU) to partition cells (Extended Data Fig. 3g-h).

For each bin, we calculated the intrinsic noise η² as well as mean Sens expression μ. These were multiplied together to calculate the Fano factor for each bin.

Given that the number of cells in each bin was not constant, and that variance estimates are affected by sample size, we calculated confidence intervals around the calculated Fano factor for each bin by bootstrapping. We resampled bin populations 50,000 times with replacement. The 2.5^th and 97.5^th percentile estimates were used to construct a 95% confidence interval for that bin’s Fano factor (Extended Data Fig. 3i).

Technical noise subtraction

Intrinsic noise and the Fano factor were calculated as described above for tandem-tagged sfGFP-mCherry-sens wing discs. Intrinsic noise was identical for tandem-tagged sens genes inserted at either 22A3 or 57F5. This would be expected if the intrinsic noise from this transgene was caused by stochastic processes related to photon detection and counting. Therefore, we pooled data generated from both locations before binning into sub-populations. In order to construct a statistical model for technical noise at each level of Sens fluorescence, we used a Lowess regression to fit a continuous line through the data (as seen in Fig. 2a). The Lowess algorithm fits a locally weighted polynomial onto x-y scatter data and therefore does not rely upon specific assumptions about the data itself. The local window used to calculate a fit was kept constant for all Lowess fits. Using our statistical model, we generated a predicted Fano factor that was due to technical noise for each bin. This predicted value was subtracted from the Fano factor that was due to both technical and gene expression noise for each bin. The difference obtained is an estimate of the Fano factor due to noise in sens gene expression.

Fluorescent tag similarity

As an additional control, we checked by various means if indeed sfGFP-Sens and mCherry-Sens proteins behaved similarly in vivo such that the nature of the protein tag did not affect quantitative assays.

First, we measured the molecule counts of sfGFP-Sens and mCherry-Sens in the same cells using Fluorescence Correlation Spectroscopy (FCS). As can be seen in Extended Data Fig. 4a, we obtained similar numbers of Sens molecules irrespective of which fluorescent tag was attached. This indicated that both alleles express equal numbers of protein in vivo.

Second, using the microRNA repression assay detailed in Extended Data Fig. 6a, we sensitively assayed whether the nature of the tag affects protein output quantitatively. If one tag were differentially expressed relative to the other, we would expect the fold-repression values calculated using mCherry tagged sens alleles to be different from sfGFP tagged sens alleles. This was not observed.

Third, to ensure that we did not under-sample stochastic noise due to Fluorescence Resonance Energy Transfer (FRET), we imaged tandem tagged sfGFP-mCherry-Sens samples in both channels after exciting only the donor (sfGFP) molecules. As shown in Extended Data Fig. 6b, there is negligible FRET from sfGFP to mCherry when using imaging parameters identical to experimental runs.

Fluorescence Correlation Spectroscopy (FCS)

FCS sample calibration and measurement

White pre-pupal wing discs were dissected in PBS and sunken into LabTek 8-well chambered slides containing 400 μl PBS per well³⁸. Discs were positioned such that the pouch region was facing the bottom of the well to be imaged. FCS measurements were made using an inverted Zeiss LSM780, Confocor 3 instrument with APD detectors. A water immersion 40x objective with numerical aperture of 1.2 (which is optimal for FCS measurements) was used throughout. Fast image scanning was utilized for identification of cell nuclei to be measured by FCS. Prior to each session, we used 10 nM dilute solutions of Alexa488 and CF586 dyes to calculate the average number of particles, the diffusion time and define the structural parameters and z₀. Using these we calibrated the Observation Volume Element (OVE) whose volume can approximated by a prolate ellipsoid . Measurements were performed in Sensory Organ Precursor cells (SOPs or S-fated), as well as first and second order neighbors, residing dorsally or ventrally of the S-fated cell (Extended Data Fig. 4a). Measurements were subjected to analysis and fitting, using a two components model for three dimensional diffusion and triplet correction as follows:

FCS measurements were excluded from analysis if they exhibited marked photobleaching or low CPM i.e. counts per molecule (CPM < 0.5 kHz per molecule per second). Due to the higher CPM of sfGFP, it was expected that Sens-sfGFP measurements are more accurate. We, nevertheless, observed fairly similar molecular numbers for both sfGFP-Sens and mCherry-Sens. Normalized auto correlation curves allowed us to compare the differential mobilities of the tagged Sens protein molecules in the nucleus and their degree of interaction with chromatin. Consistently, for both sfGFP and mCherry tagged transcription factors, we observed similar amplitudes and decay times of the slow FCS component, suggesting that the interaction with chromatin is not substantially different for differently tagged Sens molecules or even at different Sens concentrations.

Conversion from relative fluorescence to molecule counts

We compared Sens protein concentrations as measured by FCS to single cell fluorescence data from confocal imaging of the fixed tissue. All comparisons were done for the genotype shown below since all FCS measurements were made in these animals.

First, we looked at the extremes of Sens expression. Since FCS was only performed on Sens positive nuclei as visible by eye, we did not consider the lowest Sens expressing cells comparable to the confocal measured minimum Sens. However, we expected cells with highest Sens to be of similar magnitude between the two methods. To mitigate the effect of extreme outliers on the maxima, we looked at Sens expression profiles of individual discs (Extended Data Fig. 4b). As can be seen from FCS data, for both sfGFP and mCherry channel measurements, the highest Sens levels are no greater than 250 nM (per channel). The highest Sens positive cells, as measured by fluorescence microscopy, display approximately 25 RFU Sens (per channel). This gave us a rough conversion factor of 1 RFU equivalent to 10 nM.

Next we looked at the first and second order neighbors of S-fated cells. While S-fated cells show a large range of Sens expression, their E-fated neighbors display relatively less dispersion. FCS analysis showed that most of these cells expressed Sens in the range of 25nM - 125 nM per channel. Summing up the signal from sfGFP-Sens and mCherry-Sens, this corresponds to a total nuclear concentration of 50-250 nM. Based on this we divided Sens positive nuclei into three categories as shown in Supplementary Table 5.

We then mapped the labelled cell types to the original images. We expected that categorizing cells and mapping their positions should recreate the wing margin pattern i.e. S-fated cells (yellow) dispersed periodically along both sides of the wing margin surrounded by 1° and 2° neighbors (cyan) (Extended Data Fig. 4c). Indeed, we observe this pattern reproducibly if 1 RFU is assumed equivalent to 10 nM Sens (center column Extended Data Fig. 4d).

To further test this conversion factor, we made an order of magnitude estimation. Assuming 1 RFU = 3.3 nM (left column) or 1 RFU = 30 nM (right column), we again labelled cells according to the FCS observed cell types for different concentrations of Sens. As shown, increasing or decreasing the conversion factor three-fold does not reproduce the expected spatial pattern. This is most notable in the S-fated category where we see either none (3.3 nM) or a near-continuous stripe (30 nM) (Extended Data Fig. 4d).

Thus we chose 10 nM as a reasonable conversion factor from fluorescence to nanomolars for our data. Next, we converted from nanomolars to molecule numbers. Assuming a measured average wing disc cell nuclear volume of 22.99 x 10⁻¹⁵ L, each nanomolar of Sens corresponds to 13.8 molecules³⁸. Therefore, we converted relative fluorescence units to molecules per nucleus as follows:

miR-9a repression measurements

In order to measure the fold-decrease in Sens protein output due to miR-9a repression of sens mRNA, we compared the ratio of mCherry-Sens to sfGFP-Sens in the following genotypes:

Only mCherry-sens resistant to miR-9a repression
Neither mCherry-sens or sfGFP-sens resistant to miR-9a repression
Only sfGFP-sens resistant to miR-9a repression

Single cell fluorescence values were obtained after cell segmentation and background subtraction as described earlier. Cells from individual discs were pooled together and red-green fluorescence was linearly correlated using least squares fit (QR factorization) to determine a slope and intercept for each disc. Next the average slope was calculated for each genotype (shown above).

Fold reduction in mCherry-Sens protein output due to miR-9a was calculated as the ratio of slope-(A) to slope-(B) with relative errors propagated. Similarly, fold reduction in sfGFP-Sens protein output due to miR-9a was calculated as the ratio of slope-(B) to slope-(C)(Extended Data Fig. 6a).

Topological Domain Structure

Heat maps of aggregate Hi-C data were used to calculate chromosomal contact frequency for embryonic nc14 datasets²⁷ (Extended Data Fig. 7, data from Stadler et al., 2017) for landing sites at 22A3 and 57F5. DNase accessibility data³⁹ (from Li et al., 2011), and ChIP-seq of the insulator proteins CP190, BEAF-32, dCTCF, GAF and mod(mdg4) (from Négre et al., 2010)⁴⁰ for the corresponding coordinates is shown as well.

Experimental estimation of rate constants

mRNA decay rate D_m

Female pre-pupal wing discs were dissected in WM1 medium⁴¹ at room temperature. To inhibit RNA synthesis, discs were incubated in WM1 plus 5 μg/ml Actinomycin D in light protected 24-well dishes at room temperature. Approximately 20 discs were collected at 0, 10, 20 and 30 minutes post-treatment and were homogenized with 300 μl Trizol for RNA extraction and qPCR analysis. Long-lived Rpl21 mRNA was used to normalize mRNA levels across time points. Similar results were obtained when 18S rRNA was used for normalization. mRNA decay was assumed exponential and a curve fit across all time-points was used to calculate the decay constant D_m to be 0.0462 mRNA/min corresponding to a half-life t_1/2 = 15.75 minutes (R² = 0.91). Hsp70 mRNA decay was also measured as an additional short-lived control with known half-life (observed t_1/2 = 35 mins). All qPCR primers used are listed in Supplementary Table 6

Protein decay rate D_p

Homozygous 3xFlag-TEV-StrepII-sfGFP-FlAsH-sens (in a sens null background) female pre-pupal wing discs were dissected in WM1 medium at room temperature. Discs were incubated in WM1 plus 100 μg/ml cycloheximide for 0, 1, 2 and 3 hours at room temperature. Ten discs were harvested at each time-point and snap frozen in liquid nitrogen. To assay Sens protein abundance, we used an indirect sandwich ELISA (enzyme-linked immunosorbent assay) protocol as follows. Frozen discs were homogenized in 150 μl PBS containing 1% Triton-X, centrifuged to remove crude particulate matter and then incubated with rabbit anti-GFP (1:5000) overnight at 4°C in anti-Flag antibody coated wells. Wells were washed with PBS with 0.2% Tween 20 and incubated with HRP linked goat anti-rabbit (1:5000) antibody for 2 hours at 37°C. Wells were subsequently washed and incubated with 100μl 1-Step Ultra TMB-ELISA substrate. HRP activity was terminated after 30 minutes with 100μl 2M H₂SO₄ and absorbance measured at 450 nm. Protein decay was assumed exponential and a curve fit across all time-points was used to estimate the decay constant D_p to be 0.12 proteins/hr, corresponding to t_1/2 = 5.09 hours (R² = 0.84).

Protein synthesis rate S_p

As has been theorized previously^17,18,23 and also suggested by our experimental data (Fig.3), a constant Fano factor is related to the translation burst size b as follows

Here b is defined by the post-transcriptional rate constants as:

The Fano factor in the constant regime for Sens is ~20 molecules (Fig. 3c). Assuming b =19 and substituting the measured values for D_m and D_p, we estimate that S_p ~ 0.9 proteins/mRNA/min. When miR-9a binding sites are deleted from the gene, Sens protein output is 1.80 ± 0.21 fold higher and the Fano factor is ~ 35 molecules (Fig. 3b-c). This makes the miR-9a resistant protein synthesis rate S_p ~ 1.7 proteins/ mRNA/min. Thus, we fixed S_p at 0.9 proteins/mRNA/min or at 1.7 proteins/mRNA/min to simulate sens alleles with and without miR-9a binding sites respectively.

Stochastic Simulation Model

We modeled the various steps of gene expression, based on central dogma, as linear first order reactions (Extended Data Fig. 5a). To simulate the stochastic nature of reactions, we implemented the model as a Markov process using Gillespie’s stochastic simulation algorithm (SSA)⁴². A Markov process is a memoryless random process such that the next state is only dependent on the current state and not on past states. Simple Markov processes can be analyzed using a chemical master equation to provide a full probability distribution of states as they evolve through time. The master equation defining our three-variable gene expression Markov process is as follows:

Here n_P and n_M denote the number of protein and mRNA molecules respectively. n_{G_total} is the total number of genes of which n_G are genes in the ‘ON’ state capable of transcription. Therefore, n_G/n_{G_total} is the fraction of active genes. Time is denoted by t. The rate constants are defined in Supplementary Table 2.

As the Markov process gets more complex, the master equation can become too complicated to solve. Gillespie’s SSA is a statistically exact method which generates a probability distribution identical to the solution of the corresponding master equation given that a large number of simulations are realized.

Simulation set-up and algorithm

The gene expression model is comprised of six events (Extended Data Fig. 5a) and their associated reaction rates shown in Supplementary Table 2. Unless specified, the events and rate constants were kept identical between sfGFP-sens and mCherry-sens alleles simulated in the same cell. At any given instance, for a given allele, either of these six events could take place.

Gillepsie’s SSA is based on the fact that the time interval between successive events can be drawn from an exponential distribution with mean 1/r_total where i.e the sum total of reaction rates for all i events. Further, the identity of the event that will occur is drawn from a point probability defined as

The algorithm proceeded as follows:

We initialized all simulations to start with state
- Promoter state = off
- mRNA molecules = 0
- protein molecules = 0
- simulation time = 0 minutes
r_total was determined by calculating the individual rates r_i at current time t which depend on the number of substrate molecules and the rate constants in Supplementary Table 2.
A random time interval τ was picked from the exponential distribution with mean 1/r_total
A random event i was picked with probability P(i) as described above.
The cellular state was changed in accordance with the chosen event. The possible state changes were as follows
- Promoter state from off → on
- Promoter state from on → off
- mRNA molecule count increased by 1
- mRNA molecule count decreased by 1
- protein molecule count increased by 1
- protein molecule count decreased by 1
Simulation time was updated as t + τ
Steps 2 to 6 were iterated until total simulation time reached 5 hours.

Fano factor calculation

We ran simulations for 5 hours to approximate steady state expression, at the end of which protein and mRNA molecules produced from each allele in each cell were counted. A minimum of 5000 such ‘cells’ were simulated for each set of parameters. For simulations that tested the effect of parameter gradients on Sens noise, we divided the graded parameter into 20 discrete levels. Each level was simulated separately after which cells from all levels were pooled to generate a whole population. This population was binned into 25-30 bins based on total Sens level, and the Fano factor was calculated for each bin. Bootstrap with resampling was used to determine 95% confidence intervals for each bin’s Fano factor.

Parameter constraints

To keep simulations computationally feasible, we adjusted the slowest rate parameter, the protein decay rate D_p, from 0.002 proteins/min to 0.01 proteins/min (half-life from 5 hours to 1 hour). This is because we conducted simulations until protein conditions reached steady state, which is approximately five-fold longer than the half-life for the slowest reaction. For 25-hour simulations, this was resource and time-intensive. We compared the noise trends in simulations with either D_p of 0.002 proteins/min or to 0.01 proteins/min, and found both generated similar noise trends to one another. This indicates that protein decay is a not a major source of intrinsic noise in this model. Therefore, we kept D_p at 0.01 proteins/min.

The transcriptional parameters S_m, k_on and k_off were varied in accordance with the specific hypothesis being tested. We constrained them loosely to be within an order of magnitude of reported values for these rates from the literature⁴³. We also constrained these rates so as to produce steady state protein numbers and Fano factors similar to experimental data. The minimum and maximum values used are listed in Supplementary Table 2.

Modeling sens regulation by a morphogen gradient

As seen in Fig. 1e, Sens-positive cells display a wide range of expression and they are patterned in space as stripes. This is due to signaling via Wg, which is secreted from the presumptive wing margin and diffuses to form a bidirectional morphogen gradient. Wg signaling directly activates transcription of the sens gene⁸. We assumed that at least one of the three transcriptional rate parameters (S_m, k_on or k_off) in our model must be responsive to Wg signaling. We systematically varied one of the parameters while keeping the others constant. In all cases, varying one parameter did produce a spectrum of Sens expression levels.

We next calculated the Fano profile for each case. Only a free variation in k_on produced a Fano profile that resembled the experimental data, with a Fano peak at the lowest Sens levels which dramatically declines as Sens levels increase. Thus, to recreate a Sens gradient in silico we kept S_m, D_m, S_p, D_p and k_off constant and varied k_on from 0.025 to 5 min⁻¹. Since 1/k_on defines the average time the promoter is inactive, this varied from 12 seconds to 40 minutes in our model.

Impact of transcription burst kinetics

Given that average time the promoter is ‘off’ is 1/k_on and average time it is ‘on’ is 1/k_off, we define transcription burst size and burst frequency as follows

It is worth noting these values define the average burst size or frequency across exponentially distributed values. We independently varied burst size with S_m (Fig. 2c) and burst frequency with k_on (Fig. 2d).

As described previously, a gradient in k_on can re-create the experimentally observed noise profile. Together, these observations suggest that perhaps the morphogen gradient translates into a gradient of sens promoter burst frequencies - at low morphogen concentrations, burst frequency is low and at high concentration, the promoter switches states rapidly. In general, we found that as promoter state switching time-scales get smaller with respect to mRNA or protein lifetimes, bursting dynamics negligibly contribute to expression stochasticity (Extended Data Fig. 5e). This is expected since frequent individual transcription bursts get time-averaged on the scale of long lived mRNA or proteins¹⁸.

From above, it is clear that either of k_on or k_off could be rate-limiting to determine burst frequency. Therefore, we also tested the effect of only varying k_off while keeping the other 5 parameters constant (k_on = 1/min i.e. non-limiting). Interestingly, a gradient of k_off produced a very distinct Fano profile which peaked at approximately half-maximal protein expression (Extended Data Fig. 5f). k_off is a coupled parameter that simultaneously affects both transcription burst size and frequency. From this we speculate that perhaps developmental genes are preferentially regulated by modulating k_on rather than k_off to ensure invariant protein production at higher expression levels.

After recreating the graded expression of Sens, we next sought to understand which burst parameter(s) could explain the effect of genomic position on Fano factor. Modulating burst frequency simply regenerated the noise profile seen before, as expected. Increasing burst size with S_m (or even with the coupled parameter k_off) mimicked the higher and larger Fano peak change as seen for sens at 57F5 (Fig. 4c). Thus, from simulation results, we inferred that transcription burst size for sens is greater at position 57F5 than at 22A3. To simulate cells of type 22A3/57F5, we simply simulated cells with two sens alleles with different S_m values corresponding to a burst size of either 5 or 10 mRNAs. As before, alleles were simulated independent of each other to generate the Fano factor profile (Extended Data Fig. 5g).

Relationship between protein level and ‘constant’ Fano factor

If k_on and k_off are not limiting i.e. promoter switching events do not contribute significantly to expression noise; and the promoter is at 100% occupancy, the steady state protein level is described as:

Thus, once the promoter is fully occupied, protein expression must be increased by regulating the birth-death rate constants. Correspondingly, the Fano factor will be:

If b ≫ 1, then we have:

Thus the Fano factor must rise with protein level if these rate constants are perturbed. When we freely vary S_p, D_m or D_p in simulations, we recreate this linear relationship (Extended Data Fig. 5d) such that if the rate constant is biased towards greater Sens protein accumulation, the corresponding Fano factor increases. We also observe signatures of a slowly rising Fano factor in our data (Fig. 3c, 4b) in the regime we describe as ‘constant’ Fano noise. We therefore speculate that S_p, D_m or D_p might vary across the developmental field to expand the range of steady state Sens accumulation independent of the sens promoter.

1D fate selection model

The mathematical model for self-organization of Drosophila proneural tissue and fate selection is as described by Corson et. al.⁷ shown below

Here u describes the state of each cell and can range from 0 (low u or E-fate) to 1 (high u or S-fate). For our purposes, we assume u represents the fractional concentration of fate determinant Sens molecules. Inhibitory signals received from neighbor cells are summed and represented as s and τ is the time-scale of cell dynamics. All functional forms and parameter values were kept identical to Corson et. al.⁷ with the exception of the Brownian noise term η(t) and simplification of the model to a 1D array of competing cells with periodic boundary conditions. Pre-pattern noise was set to zero. Different levels of Sens stochasticity were simulated by running the model with the Brownian noise values drawn from distributions centred at zero, but with different standard deviations. The standard deviation in units of u for low, intermediate and high noise were set to 10⁻⁶, 10⁻³ and 10⁻² respectively.

Acknowledgments

Fly stocks from Hugo Bellen and the Bloomington Drosophila Stock Center are gratefully appreciated. Antibodies were gifts from Hugo Bellen and purchases from the Developmental Studies Hybridoma Bank. We thank Koen Venken for extensive help with BAC recombineering protocols and reagents. We thank Michael Stadler and Michael Eisen for generously providing HiC maps of the 22A3 and 57F5 loci from their studies. We thank Jessica Hornick and the Biological Imaging Facility for help with imaging, and Lionel Fiske for help implementing the fate selection model.

Funding

Financial support was provided from the Northwestern Data Science Initiative (R.G.), Robert H. Lurie Comprehensive Cancer Center (R.G.), Pew Latin American Fellows Program (D.M.P.), Max Planck Society (D.K.P. and P.T.), Chancellors Fellowship of University of Edinburgh (D.K.P.), NIH (R35GM118144, R.W.C.), NSF (1764421, M.M and R.W.C.), and the Simons Foundation (597491, M.M. and R.W.C.). M.M. is a Simons Foundation Investigator.

Author contributions

The experimental work was conceived by R.G. and R.W.C. D.M.P. and H.P. helped R.G. in the recombineering of sens transgenes. Fluorescence correlation spectroscopy was performed by D.K.P., under the supervision of P.T. All other experimental work was performed by R.G., under the supervision of R.W.C. Mathematical data analysis and modeling was conceived by R.G. and M.M. All data and model analysis was performed by R.G. under the supervision of M.M. The manuscript was written by R.G. and R.W.C. with input from all authors.

Competing interests

Authors declare no competing interests.

Data and materials availability

All stocks and molecular reagents will be made freely available upon request. Raw data of segmented cell fluorescence and position for all experiments as well as R and Matlab code used for analysis is available on request.

References

↵
Elowitz, M. B., Levine, A. J., Siggia, E. D. & Swain, P. S. Stochastic Gene Expression in a Single Cell. Science 297, 1183–1186, doi:10.1126/science.1070919 (2002).
OpenUrl Abstract/FREE Full Text
↵
Wernet, M. F. et al. Stochastic spineless expression creates the retinal mosaic for colour vision. Nature 440, 174–180, doi:10.1038/nature04615 (2006).
OpenUrl CrossRef PubMed Web of Science
Chang, H. H., Hemberg, M., Barahona, M., Ingber, D. E. & Huang, S. Transcriptome-wide noise controls lineage choice in mammalian progenitor cells. Nature 453, 544–547, doi:10.1038/nature06965 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Abranches, E. et al. Stochastic NANOG fluctuations allow mouse embryonic stem cells to explore pluripotency. Development 141, 2770–2779, doi:10.1242/dev.108910 (2014).
OpenUrl Abstract/FREE Full Text
↵
Kaas, J. H. Topographic maps are fundamental to sensory processing. Brain Res Bull 44, 107–112 (1997).
OpenUrl CrossRef PubMed Web of Science
↵
Greenwald, I. LIN-12/Notch signaling: lessons from worms and flies. Genes Dev 12, 1751–1762 (1998).
OpenUrl FREE Full Text
↵
Corson, F., Couturier, L., Rouault, H., Mazouni, K. & Schweisguth, F. Self-organized Notch dynamics generate stereotyped sensory organ patterns in Drosophila. Science 356, doi:10.1126/science.aai7407 (2017).
OpenUrl Abstract/FREE Full Text
↵
Eivers, E. et al. Mad is required for wingless signaling in wing development and segment patterning in Drosophila. PLoS One 4, e6543, doi:10.1371/journal.pone.0006543 (2009).
OpenUrl CrossRef PubMed
↵
Nolo, R., Abbott, L. A. & Bellen, H. J. Senseless, a Zn finger transcription factor, is necessary and sufficient for sensory organ development in Drosophila. Cell 102, 349–362, doi:10.1016/S0092-8674(00)00040-4 (2000).
OpenUrl CrossRef PubMed Web of Science
↵
Jafar-Nejad, H. et al. Senseless acts as a binary switch during sensory organ precursor selection. Genes Dev 17, 2966–2978, doi:10.1101/gad.1122403 (2003).
OpenUrl Abstract/FREE Full Text
↵
Jafar-Nejad, H., Tien, A. C., Acar, M. & Bellen, H. J. Senseless and Daughterless confer neuronal identity to epithelial cells in the Drosophila wing margin. Development 133, 1683–1692, doi:10.1242/dev.02338 (2006).
OpenUrl Abstract/FREE Full Text
↵
Cai, L., Friedman, N. & Xie, X. S. Stochastic protein expression in individual cells at the single molecule level. Nature 440, 358–362, doi:10.1038/nature04599 (2006).
OpenUrl CrossRef PubMed Web of Science
↵
Raser, J. M. & O’Shea, E. K. Control of stochasticity in eukaryotic gene expression. Science 304, 1811–1814, doi:10.1126/science.1098641 (2004).
OpenUrl Abstract/FREE Full Text
↵
Venken, K. J., He, Y., Hoskins, R. A. & Bellen, H. J. P[acman]: a BAC transgenic platform for targeted insertion of large DNA fragments in D. melanogaster. Science 314, 1747–1751, doi:10.1126/science.1134426 (2006).
OpenUrl Abstract/FREE Full Text
↵
Cassidy, J. J. et al. miR-9a minimizes the phenotypic impact of genomic diversity by buffering a transcription factor. Cell 155, 1556–1567, doi:10.1016/j.cell.2013.10.057 (2013).
OpenUrl CrossRef PubMed Web of Science
↵
Bar-Even, A. et al. Noise in protein expression scales with natural protein abundance. Nat Genet 38, 636–643, doi:10.1038/ng1807 (2006).
OpenUrl CrossRef PubMed Web of Science
↵
Thattai, M. & van Oudenaarden, A. Intrinsic noise in gene regulatory networks. Proc Natl Acad Sci U S A 98, 8614–8619, doi:10.1073/pnas.151588598 (2001).
OpenUrl Abstract/FREE Full Text
↵
Paulsson, J. Models of stochastic gene expression. Phys Life Rev 2, 157–175, doi:10.1016/j.plrev.2005.03.003 (2005).
OpenUrl CrossRef
↵
Golding, I., Paulsson, J., Zawilski, S. M. & Cox, E. C. Real-time kinetics of gene activity in individual bacteria. Cell 123, 1025–1036, doi:10.1016/j.cell.2005.09.031 (2005).
OpenUrl CrossRef PubMed Web of Science
Suter, D. M. et al. Mammalian Genes Are Transcribed with Widely Different Bursting Kinetics. Science 332, 472–474, doi:10.1126/science.1198817 (2011).
OpenUrl Abstract/FREE Full Text
↵
Bothma, J. P. et al. Dynamic regulation of eve stripe 2 expression reveals transcriptional bursts in living Drosophila embryos. Proc Natl Acad Sci U S A 111, 10598–10603, doi:10.1073/pnas.1410022111 (2014).
OpenUrl Abstract/FREE Full Text
↵
Fukaya, T., Lim, B. & Levine, M. Enhancer Control of Transcriptional Bursting. Cell 166, 358–368, doi:10.1016/j.cell.2016.05.025 (2016).
OpenUrl CrossRef PubMed
↵
Tkacik, G., Gregor, T. & Bialek, W. The role of input noise in transcriptional regulation. PLoS One 3, e2774, doi:10.1371/journal.pone.0002774 (2008).
OpenUrl CrossRef PubMed
↵
Schmiedel, J. M. et al. Gene expression. MicroRNA control of protein expression noise. Science 348, 128–132, doi:10.1126/science.aaa1738 (2015).
OpenUrl Abstract/FREE Full Text
↵
Li, Y., Wang, F., Lee, J. A. & Gao, F. B. MicroRNA-9a ensures the precise specification of sensory organ precursors in Drosophila. Genes Dev 20, 2793–2805, doi:10.1101/gad.1466306 (2006).
OpenUrl Abstract/FREE Full Text
↵
Sexton, T. & Cavalli, G. The role of chromosome domains in shaping the functional genome. Cell 160, 1049–1059, doi:10.1016/j.cell.2015.02.040 (2015).
OpenUrl CrossRef PubMed
↵
Stadler, M. R., Haines, J. E. & Eisen, M. B. Convergence of topological domain boundaries, insulators, and polytene interbands revealed by high-resolution mapping of chromatin contacts in the early Drosophila melanogaster embryo. eLife 6, 1–29, doi:10.7554/eLife.29550 (2017).
OpenUrl CrossRef PubMed
↵
Fujioka, M., Mistry, H., Schedl, P. & Jaynes, J. B. Determinants of Chromosome Architecture: Insulator Pairing in cis and in trans. PLoS Genet 12, e1005889, doi:10.1371/journal.pgen.1005889 (2016).
OpenUrl CrossRef PubMed
↵
Szabo, Q. et al. TADs are 3D structural units of higher-order chromosome organization in Drosophila. Sci Adv 4, eaar8082, doi:10.1126/sciadv.aar8082 (2018).
OpenUrl FREE Full Text
↵
Hartenstein, V. & Posakony, J. W. Development of adult sensilla on the wing and notum of Drosophila melanogaster. Development 107, 389–405 (1989).
OpenUrl Abstract
↵
Alexandre, C., Baena-Lopez, A. & Vincent, J. P. Patterning and growth control by membrane-tethered Wingless. Nature 505, 180–185, doi:10.1038/nature12879 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
Fraser, H. B., Hirsh, A. E., Giaever, G., Kumm, J. & Eisen, M. B. Noise minimization in eukaryotic gene expression. PLoS Biol 2, e137, doi:10.1371/journal.pbio.0020137 (2004).
OpenUrl CrossRef PubMed
Tanase-Nicola, S. & ten Wolde, P. R. Regulatory control and the costs and benefits of biochemical noise. PLoS Comput Biol 4, e1000125, doi:10.1371/journal.pcbi.1000125 (2008).
OpenUrl CrossRef PubMed
↵
Metzger, B. P., Yuan, D. C., Gruber, J. D., Duveau, F. & Wittkopp, P. J. Selection on noise constrains variation in a eukaryotic promoter. Nature 521, 344–347, doi:10.1038/nature14244 (2015).
OpenUrl CrossRef PubMed
↵
Nolo, R., Abbott, L. A. & Bellen, H. J. Drosophila Lyra mutations are gain-of-function mutations of senseless. Genetics 157, 307–315 (2001).
OpenUrl Abstract/FREE Full Text
↵
Qi, J. et al. Drosophila Eye Nuclei Segmentation Based on Graph Cut and Convex Shape Prior. Int Conf Signal Process Proc, 670–674, doi:10.1109/ICIP.2013.6738138 (2013).
OpenUrl CrossRef
↵
Pelaez, N. et al. Dynamics and heterogeneity of a fate determinant during transition towards cell differentiation. Elife 4, doi:10.7554/eLife.08924 (2015).
OpenUrl CrossRef
↵
Papadopoulos, D. K. et al. Control of Hox transcription factor concentration and cell-to-cell variability by an auto-regulatory switch. Development 146, doi:10.1242/dev.168179 (2019).
OpenUrl Abstract/FREE Full Text
↵
Li, X. Y. et al. Transcription factors bind thousands of active and inactive regions in the Drosophila blastoderm. PLoS Biol 6, e27, doi:10.1371/journal.pbio.0060027 (2008).
OpenUrl CrossRef PubMed
↵
Negre, N. et al. A comprehensive map of insulator elements for the Drosophila genome. PLoS Genet 6, e1000814, doi:10.1371/journal.pgen.1000814 (2010).
OpenUrl CrossRef PubMed
↵
Restrepo, S., Zartman, J. J. & Basler, K. Cultivation and Live Imaging of Drosophila Imaginal Discs. Methods Mol Biol 1478, 203–213, doi:10.1007/978-1-4939-6371-3_11 (2016).
OpenUrl CrossRef
↵
Gillespie, D. T. Exact Stochastic Simulation of Coupled Chemical-Reactions. J Phys Chem-Us 81, 2340–2361, doi:DOI 10.1021/j100540a008 (1977).
OpenUrl CrossRef PubMed
↵
Milo, R., Jorgensen, P., Moran, U., Weber, G. & Springer, M. BioNumbers--the database of key numbers in molecular and cell biology. Nucleic Acids Res 38, D750–753, doi:10.1093/nar/gkp889 (2010).
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted April 05, 2019.

Download PDF

Citation Tools

Subject Area

Developmental Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5214)
Biochemistry (11745)
Bioengineering (8751)
Bioinformatics (29195)
Biophysics (14971)
Cancer Biology (12095)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14179)
Epidemiology (2067)
Evolutionary Biology (18306)
Genetics (12245)
Genomics (16802)
Immunology (11867)
Microbiology (28083)
Molecular Biology (11592)
Neuroscience (60965)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7339)
Zoology (1651)

[1] ↵
Elowitz, M. B., Levine, A. J., Siggia, E. D. & Swain, P. S. Stochastic Gene Expression in a Single Cell. Science 297, 1183–1186, doi:10.1126/science.1070919 (2002).
OpenUrl Abstract/FREE Full Text

[2] ↵
Wernet, M. F. et al. Stochastic spineless expression creates the retinal mosaic for colour vision. Nature 440, 174–180, doi:10.1038/nature04615 (2006).
OpenUrl CrossRef PubMed Web of Science

[3] Chang, H. H., Hemberg, M., Barahona, M., Ingber, D. E. & Huang, S. Transcriptome-wide noise controls lineage choice in mammalian progenitor cells. Nature 453, 544–547, doi:10.1038/nature06965 (2008).
OpenUrl CrossRef PubMed Web of Science

[4] ↵
Abranches, E. et al. Stochastic NANOG fluctuations allow mouse embryonic stem cells to explore pluripotency. Development 141, 2770–2779, doi:10.1242/dev.108910 (2014).
OpenUrl Abstract/FREE Full Text

[5] ↵
Kaas, J. H. Topographic maps are fundamental to sensory processing. Brain Res Bull 44, 107–112 (1997).
OpenUrl CrossRef PubMed Web of Science

[6] ↵
Greenwald, I. LIN-12/Notch signaling: lessons from worms and flies. Genes Dev 12, 1751–1762 (1998).
OpenUrl FREE Full Text

[7] ↵
Corson, F., Couturier, L., Rouault, H., Mazouni, K. & Schweisguth, F. Self-organized Notch dynamics generate stereotyped sensory organ patterns in Drosophila. Science 356, doi:10.1126/science.aai7407 (2017).
OpenUrl Abstract/FREE Full Text

[8] ↵
Eivers, E. et al. Mad is required for wingless signaling in wing development and segment patterning in Drosophila. PLoS One 4, e6543, doi:10.1371/journal.pone.0006543 (2009).
OpenUrl CrossRef PubMed

[9] ↵
Nolo, R., Abbott, L. A. & Bellen, H. J. Senseless, a Zn finger transcription factor, is necessary and sufficient for sensory organ development in Drosophila. Cell 102, 349–362, doi:10.1016/S0092-8674(00)00040-4 (2000).
OpenUrl CrossRef PubMed Web of Science

[10] ↵
Jafar-Nejad, H. et al. Senseless acts as a binary switch during sensory organ precursor selection. Genes Dev 17, 2966–2978, doi:10.1101/gad.1122403 (2003).
OpenUrl Abstract/FREE Full Text

[11] ↵
Jafar-Nejad, H., Tien, A. C., Acar, M. & Bellen, H. J. Senseless and Daughterless confer neuronal identity to epithelial cells in the Drosophila wing margin. Development 133, 1683–1692, doi:10.1242/dev.02338 (2006).
OpenUrl Abstract/FREE Full Text

[12] ↵
Cai, L., Friedman, N. & Xie, X. S. Stochastic protein expression in individual cells at the single molecule level. Nature 440, 358–362, doi:10.1038/nature04599 (2006).
OpenUrl CrossRef PubMed Web of Science

[13] ↵
Raser, J. M. & O’Shea, E. K. Control of stochasticity in eukaryotic gene expression. Science 304, 1811–1814, doi:10.1126/science.1098641 (2004).
OpenUrl Abstract/FREE Full Text

[14] ↵
Venken, K. J., He, Y., Hoskins, R. A. & Bellen, H. J. P[acman]: a BAC transgenic platform for targeted insertion of large DNA fragments in D. melanogaster. Science 314, 1747–1751, doi:10.1126/science.1134426 (2006).
OpenUrl Abstract/FREE Full Text

[15] ↵
Cassidy, J. J. et al. miR-9a minimizes the phenotypic impact of genomic diversity by buffering a transcription factor. Cell 155, 1556–1567, doi:10.1016/j.cell.2013.10.057 (2013).
OpenUrl CrossRef PubMed Web of Science

[16] ↵
Bar-Even, A. et al. Noise in protein expression scales with natural protein abundance. Nat Genet 38, 636–643, doi:10.1038/ng1807 (2006).
OpenUrl CrossRef PubMed Web of Science

[17] ↵
Thattai, M. & van Oudenaarden, A. Intrinsic noise in gene regulatory networks. Proc Natl Acad Sci U S A 98, 8614–8619, doi:10.1073/pnas.151588598 (2001).
OpenUrl Abstract/FREE Full Text

[18] ↵
Paulsson, J. Models of stochastic gene expression. Phys Life Rev 2, 157–175, doi:10.1016/j.plrev.2005.03.003 (2005).
OpenUrl CrossRef

[19] ↵
Golding, I., Paulsson, J., Zawilski, S. M. & Cox, E. C. Real-time kinetics of gene activity in individual bacteria. Cell 123, 1025–1036, doi:10.1016/j.cell.2005.09.031 (2005).
OpenUrl CrossRef PubMed Web of Science

[20] Suter, D. M. et al. Mammalian Genes Are Transcribed with Widely Different Bursting Kinetics. Science 332, 472–474, doi:10.1126/science.1198817 (2011).
OpenUrl Abstract/FREE Full Text

[21] ↵
Bothma, J. P. et al. Dynamic regulation of eve stripe 2 expression reveals transcriptional bursts in living Drosophila embryos. Proc Natl Acad Sci U S A 111, 10598–10603, doi:10.1073/pnas.1410022111 (2014).
OpenUrl Abstract/FREE Full Text

[22] ↵
Fukaya, T., Lim, B. & Levine, M. Enhancer Control of Transcriptional Bursting. Cell 166, 358–368, doi:10.1016/j.cell.2016.05.025 (2016).
OpenUrl CrossRef PubMed

[23] ↵
Tkacik, G., Gregor, T. & Bialek, W. The role of input noise in transcriptional regulation. PLoS One 3, e2774, doi:10.1371/journal.pone.0002774 (2008).
OpenUrl CrossRef PubMed

[24] ↵
Schmiedel, J. M. et al. Gene expression. MicroRNA control of protein expression noise. Science 348, 128–132, doi:10.1126/science.aaa1738 (2015).
OpenUrl Abstract/FREE Full Text

[25] ↵
Li, Y., Wang, F., Lee, J. A. & Gao, F. B. MicroRNA-9a ensures the precise specification of sensory organ precursors in Drosophila. Genes Dev 20, 2793–2805, doi:10.1101/gad.1466306 (2006).
OpenUrl Abstract/FREE Full Text

[26] ↵
Sexton, T. & Cavalli, G. The role of chromosome domains in shaping the functional genome. Cell 160, 1049–1059, doi:10.1016/j.cell.2015.02.040 (2015).
OpenUrl CrossRef PubMed

[27] ↵
Stadler, M. R., Haines, J. E. & Eisen, M. B. Convergence of topological domain boundaries, insulators, and polytene interbands revealed by high-resolution mapping of chromatin contacts in the early Drosophila melanogaster embryo. eLife 6, 1–29, doi:10.7554/eLife.29550 (2017).
OpenUrl CrossRef PubMed

[28] ↵
Fujioka, M., Mistry, H., Schedl, P. & Jaynes, J. B. Determinants of Chromosome Architecture: Insulator Pairing in cis and in trans. PLoS Genet 12, e1005889, doi:10.1371/journal.pgen.1005889 (2016).
OpenUrl CrossRef PubMed

[29] ↵
Szabo, Q. et al. TADs are 3D structural units of higher-order chromosome organization in Drosophila. Sci Adv 4, eaar8082, doi:10.1126/sciadv.aar8082 (2018).
OpenUrl FREE Full Text

[30] ↵
Hartenstein, V. & Posakony, J. W. Development of adult sensilla on the wing and notum of Drosophila melanogaster. Development 107, 389–405 (1989).
OpenUrl Abstract

[31] ↵
Alexandre, C., Baena-Lopez, A. & Vincent, J. P. Patterning and growth control by membrane-tethered Wingless. Nature 505, 180–185, doi:10.1038/nature12879 (2014).
OpenUrl CrossRef PubMed Web of Science

[32] ↵
Fraser, H. B., Hirsh, A. E., Giaever, G., Kumm, J. & Eisen, M. B. Noise minimization in eukaryotic gene expression. PLoS Biol 2, e137, doi:10.1371/journal.pbio.0020137 (2004).
OpenUrl CrossRef PubMed

[33] Tanase-Nicola, S. & ten Wolde, P. R. Regulatory control and the costs and benefits of biochemical noise. PLoS Comput Biol 4, e1000125, doi:10.1371/journal.pcbi.1000125 (2008).
OpenUrl CrossRef PubMed

[34] ↵
Metzger, B. P., Yuan, D. C., Gruber, J. D., Duveau, F. & Wittkopp, P. J. Selection on noise constrains variation in a eukaryotic promoter. Nature 521, 344–347, doi:10.1038/nature14244 (2015).
OpenUrl CrossRef PubMed

[35] ↵
Nolo, R., Abbott, L. A. & Bellen, H. J. Drosophila Lyra mutations are gain-of-function mutations of senseless. Genetics 157, 307–315 (2001).
OpenUrl Abstract/FREE Full Text

[36] ↵
Qi, J. et al. Drosophila Eye Nuclei Segmentation Based on Graph Cut and Convex Shape Prior. Int Conf Signal Process Proc, 670–674, doi:10.1109/ICIP.2013.6738138 (2013).
OpenUrl CrossRef

[37] ↵
Pelaez, N. et al. Dynamics and heterogeneity of a fate determinant during transition towards cell differentiation. Elife 4, doi:10.7554/eLife.08924 (2015).
OpenUrl CrossRef

[38] ↵
Papadopoulos, D. K. et al. Control of Hox transcription factor concentration and cell-to-cell variability by an auto-regulatory switch. Development 146, doi:10.1242/dev.168179 (2019).
OpenUrl Abstract/FREE Full Text

[39] ↵
Li, X. Y. et al. Transcription factors bind thousands of active and inactive regions in the Drosophila blastoderm. PLoS Biol 6, e27, doi:10.1371/journal.pbio.0060027 (2008).
OpenUrl CrossRef PubMed

[40] ↵
Negre, N. et al. A comprehensive map of insulator elements for the Drosophila genome. PLoS Genet 6, e1000814, doi:10.1371/journal.pgen.1000814 (2010).
OpenUrl CrossRef PubMed

[41] ↵
Restrepo, S., Zartman, J. J. & Basler, K. Cultivation and Live Imaging of Drosophila Imaginal Discs. Methods Mol Biol 1478, 203–213, doi:10.1007/978-1-4939-6371-3_11 (2016).
OpenUrl CrossRef

[42] ↵
Gillespie, D. T. Exact Stochastic Simulation of Coupled Chemical-Reactions. J Phys Chem-Us 81, 2340–2361, doi:DOI 10.1021/j100540a008 (1977).
OpenUrl CrossRef PubMed

[43] ↵
Milo, R., Jorgensen, P., Moran, U., Weber, G. & Springer, M. BioNumbers--the database of key numbers in molecular and cell biology. Nucleic Acids Res 38, D750–753, doi:10.1093/nar/gkp889 (2010).
OpenUrl CrossRef PubMed Web of Science