ABSTRACT
One goal of modern day neuroscience is the establishment of molecular maps that assign unique features to individual neuron types. Such maps provide important starting points for neuron classification, for functional analysis and for developmental studies aimed at defining the molecular mechanisms of neuron identity acquisition and neuron identity diversification. In this resource paper, we describe a nervous system-wide map of the potential expression sites of 244 members of the largest gene family in the C. elegans genome, rhodopsin-like (class A) GPCR chemoreceptors, using classic gfp reporter gene technology. We cover representatives of all sequence families of chemoreceptors GPCRs, some of which were previously entirely uncharacterized. Most reporters are expressed in a very restricted number of cells, often just in single cells. We assign GPCR reporter expression to all but two of the 37 sensory neuron classes of the sex-shared, core nervous system. Some sensory neurons express a very small number of receptors, while others, particularly nociceptive neurons, co-express several dozen GPCR reporter genes. GPCR reporters are also expressed in a wide range of inter- and motorneurons, as well as nonneuronal cells, suggesting that GPCRs may constitute receptors not just for environmental signals, but also for internal cues. We observe only one notable, frequent association of coexpression patterns, namely in one nociceptive amphid (ASH) and two nociceptive phasmid sensory neurons (PHA, PHB). We identified GPCRs with sexually dimorphic expression and several GPCR reporters that are expressed in a left/right asymmetric manner. We identified a substantial degree of GPCR expression plasticity; particularly in the context of the environmentally-induced dauer diapause stage when one third of all tested GPCRs alter the cellular specificity of their expression within and outside the nervous system. Intriguingly, in a number of cases, the dauer-specific alterations of GPCR reporter expression in specific neuron classes are maintained during postdauer life and in some case new patterns are induced post-dauer, demonstrating that GPCR gene expression may serve as traits of life history. Taken together, our resource provides an entry point for functional studies and also offers a host of molecular markers for studying molecular patterning and plasticity of the nervous system.
AUTHOR SUMMARY Maps of gene expression patterns in the nervous system provide an important resource for neuron classification, for functional analysis and for developmental studies that ask how different neurons acquire their unique identities. By analyzing transgenic gfp reporter strains, we describe here the expression pattern of 244 putative chemosensory receptor-encoding genes, which constitute the largest gene family in C.elegans. We show that, as expected, chemoreceptor expression is enriched in chemosensory neurons but it is also expressed in a wide range of interneurons, motorneurons, as well as non-neuronal cells, suggesting that putative chemosensory receptors may not just sense environmental signals but also internal cues. We find that each chemoreceptor is expressed in a few neuron types, often just one, but each neuron type can express a large number of chemoreceptors. Interestingly, we uncovered that chemoreceptor expression is remarkably plastic, particularly in the context of the environmentally-induced dauer diapause stage. Taken together, this molecular atlas of chemosensory receptors provides an entry point for functional studies and offers a host of markers for studying neuronal patterning and plasticity.
INTRODUCTION
Molecular markers selectively expressed in individual neuron types represent invaluable tools to understand how cellular diversity in a nervous system is genetically encoded. Molecular markers that are constitutively and invariably expressed throughout the life of a specific neuron type provide static views of neuronal identity and hence provide entry points to study how invariable identity features are acquired during neuronal differentiation [1]. In contrast, some molecular features of a neuron display a remarkable plasticity in that their expression may be regulated by neuronal activity or in response to specific environmental cues. Such genes serve as markers to understand the nature of the gene regulatory programs that govern such dynamic features of a neuron. We reasoned that a significant expansion of the expression analysis of chemosensory G-protein-coupled receptors (GPCRs), initiated more than 20 years ago [2] using gfp-based reporter gene technology [3], may yield a significantly expanded resource of molecular markers that may label various aspects of neuronal identity and neuronal plasticity in the C. elegans nervous system.
Animal genomes encode five major classes of GPCRs, of which the rhodopsin class (or “class A”) is the largest class [4, 5](Table 1). Rhodopsin class GPCRs can be subdivided into phylogenetically deeply conserved neurotransmitter receptors (neuropeptides, acetylcholine, biogenic amines) as well as non-conserved, chemosensory-type GPCRs (from here on referred to as “csGPCRs”)(Table 1). The csGPCRs have independently expanded in distinct animal phyla where they serve to respond to diverse, physiologically relevant external and, supposedly, internal cues [4, 6, 7]. The genome of the nematode C. elegans encodes an exceptionally large battery of chemosensory-type csGPCRs composed of 1,341 protein-coding genes (Table 2)[2, 7, 8], a remarkable number given the small size of its nervous system (302 neurons constituting 118 anatomically defined neuron types)[9]. These csGPCRs have been subdivided by sequence into families and superfamilies, as summarized in Table 2 [2, 7].
Wormbase contains expression data for 131 csGPCRs, however for only 76 of them the expression site has been defined with single cell resolution (S1 Table). The majority of these 76 reporters revealed expression in chemosensory neurons [2]. Functional studies have linked a small subset of these receptors to the sensation of specific environmental or pheromonal cues [12–21], but in the absence of concerted de-orphanization efforts like those seen in other organisms [22, 23], the number of receptors with assigned ligands is still remarkably low.
Intriguingly, a subset of the previously characterized csGPCR genes were also expressed in non-sensory neurons [2, 24–28] suggesting that they may also function as receptors of internal ligands of unknown identity. Providing some hints to the identity of these ligands, one csGPCR subclass, encoded by the srw genes, displays sequence similarities to peptide receptors [11, 29]. The expression of csGPCRs in interneurons also prompted efforts to identify the function of some of these genes. Even though its ligand remains unknown, AIY-expressed sra-11 was found to be involved in the associative learning paradigm, olfactory imprinting [30], while sra-13 acts in the vulva to control vulval development, which is affected by food signals [26].
In spite of the relative paucity of known ligands, the previously published expression patterns of csGPCRs provided molecular indicators for a number of intriguing and generally very poorly understood nervous system features: (1) the expression pattern of the GPCR gene str-2 revealed a left/right asymmetry in the two AWC olfactory neurons [31]; this lateralization phenomenon was later found to be required for olfactory discrimination [32] and spurred a host of studies aimed at revealing how this left/right asymmetry is developmentally programmed [33]. (2) The expression of several csGPCRs revealed a remarkable plasticity in response to changes in the environment. For example, expression of srd-1 and str-2 and str-3 changes in ASI neurons in response to dauer pheromone [34], and expression of srh-34 and srh-234 in ADL is different in fed versus starved animals [35]. Using these dynamic reporter gene patterns, mechanisms controlling csGPCR plasticity have been elucidated [35, 36]. (3) The csGPCR genes srd-1, srj-54 and odr-10 have been found to be expressed in a sexually dimorphic manner in sex-shared sensory neurons, suggesting that sexual identity impinges on sensory perception [2, 37, 38].
In this resource paper, we examined the expression of 244 reporter transgenes that monitor expression of previously uncharacterized csGPCR genes (for simplicity, from here on referred to as “GPCRs”. Our explicit goal in this analysis was to (1) generate more neuronal identity markers, (2) test the hypothesis that many more sensory neurons may be lateralized, (3) identify more markers of neuronal plasticity, (4) identify more markers of sexual dimorphism and (5) examine the extent of expression in non-sensory and non-neuronal cells (suggesting roles as receivers of internal signals). Based on the molecular classification of csGPCRs into defined families, we were also interested in determining whether the expression of specific subfamilies – particularly those whose expression has not previously examined – may reveal specific common themes (i.e., patterns of co-expression or expression in specific cells) that may provide a hint to their function. We synthesize our findings with those of previous expression pattern analyses to carve out a number of general features of csGPCR expression patterns.
MATERIALS AND METHODS
Mutant Strains
Strains were maintained by standard methods [39]. Mutant alleles used in this study were: pha-1(e2123) [40], him-5(e1490) [41], unc-43(n1186lf) [42], unc-43(n498gf)[43] and nsy-5(ky634) [44].
Reporter and transgenic strain generation
GFP reporters were generated using a PCR fusion approach [45] and injected without being subcloned. Genomic fragments were fused to the GFP coding sequence, which was followed by the unc-54 3’ untranslated region. A list of primers for all constructs can be found in the Supplementary Methods. Amplicons were injected at 50ng/ml with the pha-1 rescuing plasmid (pBX) as a co-injection marker (50ng/ml). Reporters were injected into a pha-1(e2123) or pha-1(e2123);him-5(e1490) mutant background strain [40], resulting in transgenic arrays with little mosaicism. For each construct 2 independent lines were scored. Reporter strains provided by the Vancouver Consortium were generated as described [46]. Further details and primer sequences used by the Vancouver Consortium can be found at http://www.gfpworm.org. A list of all reporter strains generated by us or provided by the Vancouver Consortium can be found in the Supplementary Methods.
Microscopy
Worms were anesthetized using 100mM sodium azide (NaN3) and mounted on 5% agarose on glass slides. Images were acquired using an automated fluorescence microscope (Zeiss, AXIO Imager Z.2). Acquisition of several z-stack images (each ~1 mm thick) was performed with the ZEN 2 pro software. Representative images are shown following max-projection of Z-stacks using the maximum intensity projection type. Image reconstruction was performed using Fiji software [47].
Neuron identification
Neurons were identified either by labeling subsets of sensory neurons with DiD (Thermo Fisher Scientific) or by crossing reporter transgenes with landmark reporter strains in which known neuron types are labeled with a red fluorescent reporter. For dye filling, worms were washed with M9, incubated with DiD (1:500) in M9 for 1 hour at room temperature, washed 3 times with M9, and plated on agar plates coated with food for 1-3 hours before imaging. Red fluorescent reporter strains used for cell identification are: otIs263[ceh-36p::TagRFP, rol-6(su1006)], vyIs51[str-2p::2xnls::TagRFP; ofm-1p::DsRed][48], otIs518[eat-4Fosmid::sl2::mCherry::h2b][49], otIs544[cho-1Fosmid::sl2::mCheriy::h2b][50], otIs564[unc-47Fosmid::sl2::mCherry::h2b][51], otIs612[flp-18p::NLG-1::GFP11, gpa-6p::NLG-1:::GFP1-10, flp-18p::mCherry, nlp-1p::mCherry], hdIs30[glr-1p::DsRed], otIs521[eat-4prom8::tagRFP; ttx-3::gfp].
Hierarchical clustering of neurons by GPCR reporter expression
Clustering was performed on binary expression data from 272 neuron-expressed GPCR reporters for which we had cell ID information. Expression data was from our own analysis and available data from wormbase.org [52]. Only positive neuronal cell ID information per GPCR reporter was included in the binary expression matrix with no distinction between the absence of expression and unknown expression per neuron. Data were clustered using the R pvclust package (https://cran.r-project.org/web/packages/pvclust/pvclust.pdf) [53] using the euclidean distance metric with average linkage, bootstrap 1000, and relative sample size ranging from a proportion of 0.5 to 1.4 of the original sample size. The relative proportion was incremented by 0.1 for each bootstrap resampling. Bootstrap Probability value (BP) and Approximately Unbiased p-values (AU) are derived from the multiscale-multistep bootstrap resampling. AU support values > 95 indicate well-supported clusters and should be considered when evaluating dendrogram cluster relationships. Alternative distance and linkage methods showed clustering of the PHA, PHB and ASH neurons in all cases (42 out of 84 cases had strong support with AU/BP >95).
Upstream intergenic distances and intron length calculations
GPCR upstream intergenic regions and intron lengths were extracted from C. elegans exon coordinates, version WS220 using a python script. Non-coding RNA exons were excluded from the intergenic distance calculations so that intergenic distances represent the nucleotide sequence distance between coding genes. The average intron length per gene was calculated by summing the intron sequence lengths for each gene and dividing by the total number of introns. Average intron lengths for genes with multiple isoforms were calculated for each isoform and then averaged, resulting in one average intron length per gene.
Generation of dauers and analysis of changes in expression
To analyze GPCR reporter gene expression in dauers, mixed populations of respective strains were allowed to exhaust food for 5-7 days at 20°C. Dauers were isolated from starved plates by treatment with 1% SDS for 30 min and imaged within 1-2 hours of isolation. The cellular identity of expression changes in dauers were confirmed with red landmark strains, as mentioned above.
RESULTS
Selection of csGPCRs for expression analysis and method of analysis
We chose to examine csGPCR expression patterns using gfp-based reporter gene technology, the standard tool of gene expression analysis in C. elegans [3, 54]. The obvious shortcoming of this technology is that reporter genes may not capture the full cis-regulatory content of the respective GPCR-encoding locus, but as we will describe in more detail below, most GPCR-encoding loci are compact with small intergenic regions and introns. We emphasize that our approach is not necessarily aimed at identifying the complete set of cells expressing a GPCR, but, following ample precedent, is rather aimed at identifying novel and informative patterns of expression, as incomplete as these patterns may be.
We utilized two sources of csGPCR reporters. A consortium at the University of British Columbia (Vancouver) has generated a valuable, large panel of reporters for 1886 genes in the C. elegans genome [46]. However, the site of expression of these reporters has not been determined with single cell resolution in the nervous system. We obtained 100 reporters from this collection that targeted GPCR loci and for every reporter that produced a stable pattern of expression, we undertook a detailed analysis of their sites of expression in the nervous system.
In addition to these 100 reporter genes, we generated 144 of our own reporter genes. We adhered to the following principles in the choice of genes and design of reporters: First, we aimed to cover all 23 classes of chemoreceptor genes defined by Thomas and Robertson [7](Table 2). Using phylogenetic trees assembled by Thomas and Robertson, we sampled each gene family evenly, generally avoiding the examination of close sequence paralogues, which we anticipated to reveal similar expression patterns.
Our own reporters mostly contain all 5’ intergenic regions fused to gfp and contain at most 4 kb of sequence. The rationale behind this choice lies in the overall organization of GPCR loci (summarized in S1 Fig). 89% of the ~1,300 csGPCR loci contain 5’ intergenic regions of less than 4kb. We chose all of our samples from this pool and the reporters generated by us capture the full intergenic region. The reporters from the Vancouver consortium contain about 3 kb of 5’ intergenic region at most [46]. Furthermore, csGPCR loci tend to have small introns (average size 432 bp; almost half of them <200 bp; S1 Fig), indicating that relatively little cis-regulatory information resides in these introns, which provided the basis for our focus on intergenic regions. For some genes with very short upstream intergenic regions (less than 500 bp) we included the first intron (if this was 300 bp or larger) in order to increase the regulatory space contained in the reporters. The coordinates for all reporter constructs can be found in the Supplementary Material.
Sites of expression within the nervous system were determined mainly for those reporters with most robust expression and was based on stereotyped cell position, cellular and process morphology and co-labeling with either DiD (which labels a subset of sensory neurons) or by crossing with landmark strains in which specific neuron types are labeled with a red fluorescent protein (see Material and Methods). All cell identification was initially done in young adult hermaphrodite animals. As we will describe in detail later, a number of these reporter strains were also subjected to analysis at different stages, under different conditions and in the two different sexes.
GPCRs are expressed in restricted patterns within and outside the nervous system
In our ensuing description of expression patterns of reporter genes, we summarize the expression observed with the previously described reporters, as well as the additional reporters analyzed by us. All of our expression analysis is summarized in a tabular form in S1 Table. Three overall features of the 375 csGPCR reporters are immediately apparent (Fig 1): first, 92% of analyzed reporters are expressed in the nervous system; second, expression is not restricted to the nervous system: 33% of the reporters are expressed both within and outside the nervous system and 8% are expressed exclusively in non-neuronal cells, and third, the vast majority of csGPCR reporters are expressed in very restricted number of cells (Fig 1A,B). Of the neuronally expressed reporters, 24% are expressed in single neuron pairs, 27% in 2 neuron pairs, 26% in 3-4 neuron pairs, 19% in 5-10 neuron pairs and the remaining 4% in more than 10 neuron pairs.
Expression outside the nervous system will be described in a later section. Within the nervous system, expression is most prominent in sensory neurons (Fig 1C). 84% of the reporters are expressed in amphid sensory neurons (which are made up of 12 pairs of neurons), 20% in phasmid sensory neurons (made up of 2 pairs of neurons, PHA and PHB), and 17% in other sensory neurons. We find that every sensory neuron, except for URY and ADE neurons, expresses at least one GPCR (Fig 1D; Table S2). The number of GPCRs expressed in a given neuron shows a striking range. The ASI neuron expresses an impressive 99 GPCR reporters. After ASI, the nociceptive neurons ADL and ASH together with the phasmid neurons PHA and PHB are the sensory neurons with higher number of GPCRs, expressing 72, 51, 51 and 49 reporters respectively. Outside the amphid and phasmid neurons, the number of reporters expressed in sensory neurons dramatically drops, with all other sensory neurons expressing less than 10 GPCRs, in some cases only a single GPCR (Fig 1; Table S2). Of course, it needs to be kept in mind that we only consider expression of a fraction of the csGPCR loci and hence each of these total numbers is expected to increase by several fold once all csGPCR expression patterns are identified.
24% of the GPCR reporters for which we have information about neuron numbers are exclusively expressed in a single neuron class and in all these cases, the neuron class is a sensory neuron class (Fig 2; Table S3). In total, however, only nine sensory neurons express single-neuron specific GPCRs. The most striking one of them is the ADL nociceptive neuron, which expresses 23 single neuron-specific GPCR reporters (and an additional 49 GPCR reporters expressed in additional neurons). The ADL-expressed, single neuron-specific GPCRs do not fall into a specific GPCR subfamily but rather cover 7 distinct families. A small subset of the single neuron type specific GPCRs are expressed outside the nervous system as well (genes with asterisk in Fig 2A). This may indicate that these receptors do not detect external cues, but rather sense internal signals.
Notably, expression of the csGPCR reporter collection is clearly not restricted to sensory neurons. A striking 35% of the csGPCR reporters are expressed in inter- and motorneurons (Fig 1, Fig 3; Table 3; S1 Table). There is no unifying feature of the inter- or motorneurons that express GPCR reporters. They range from ventral cord motor neurons to head interneurons, and to command interneurons in the ventral cord. One interneuron, PVT, displays a very large number of expressed csGPCR reporters (57 different reporters); however, PVT expression is generally observed in an unusually large amount of reporter genes and may, like posterior gut expression, be a reporter gene artifact that relies on cryptic regulatory elements in the reporter.
97% of inter and/or motorneuron-expressed csGPCR reporters are also expressed in sensory neurons so only 3% of them show expression exclusively in inter or motorneurons. In light of the inter/motorneuron expression of csGPCR reporters, one can imagine that csGPCR reporters that are expressed in sensory neurons may actually not function as receptors for external sensory cues, but may rather function as they likely do in inter/motorneurons, i.e. as receptors of internal signals.
We asked whether csGPCR expression profiles cluster by neuron type. To this end, we undertook unsupervised hierarchical clustering of expression profiles. The bootstrap value for most associations was very weak with two exceptions: csGPCR reporters are often coexpressed in the two tail phasmid neuron classes PHA and PHB (AU/BP>95) and expression in either or both of the phasmid neurons is associated with the expression in the head neuron ASH (AU/BP>95) (Fig 4). These associations are striking since all these 3 neuron classes are nociceptive neurons that respond to some similar cues and integrate sensory inputs from the head and tail [55, 56] and that directly innervate command interneurons involved in reversal behavior [9]. While GPCRs expressed in these neurons are likely involved in sensing nociceptive cues, it is notable that these co-expressed csGPCR came from a widely distinct sets of GPCR families (Fig 4).
Left/right asymmetric expression of csGPCR reporters
One major motivation for undertaking the csGPCR reporter analysis was to identify more lateralized neuron pairs in the nervous system. In vertebrates, there is a striking dearth of molecular correlates for widespread functional lateralization of the brain. In C. elegans the chance discovery of left/right asymmetric sensory receptor expression has opened up new vistas on lateralization of the C. elegans nervous system [58]. Specifically, the lateralized expression of several csGPCRs in the AWC olfactory neuron pair [31] and guanylyl cyclase receptors in the gustatory ASE neuron pair [59] revealed a common theme of lateralization providing means of sensory discrimination [32, 60, 61]. Since lateralization provides an elegant, straight-forward means for sensory discrimination, we speculated that such lateralization may be wide-spread in the nervous system and therefore took particular care in examining whether csGPCR reporters that we analyzed are expressed in a left/right asymmetric manner.
We indeed identified eight csGPCR reporters with left/right asymmetric gene expression in an otherwise bilaterally symmetric neuron pair. However, this laterality was only observed in the context of the AWC sensory neuron pair, which was previously known to express several GPCRs in a left/right asymmetric manner [31, 62]. Using previously described sets of mutants, we found that the asymmetry of these GPCR reporters is controlled by the same calcium-dependent signaling pathway [33] that controls all other previously known asymmetric GPCRs in the AWC neurons (Fig 5). Of course, our limited analysis does not exclude the existence of left/right asymmetrically expressed GPCR genes in other neuron types, but it may not be as widespread as we initially hypothesized.
Sexually dimorphic expression of csGPCR reporters
Apart from brain lateralization, another domain of nervous system research displays a striking dearth of molecular markers. While the existence of sex-specific neurons is widely appreciated in the nervous system of most animals, including C. elegans [64], it is much less clear to what extent neurons that are shared by the two sexes of a given species display molecular differences. Recent anatomical work in C. elegans revealed intriguing synaptic wiring differences between sex-shared neurons in the two sexes [65], but even in C. elegans there is a dearth of dimorphic molecular markers of sex-shared neurons. Given the distinct priorities that males and hermaphrodites display toward food and mate searching [66] and given that a number of sex-shared sensory neurons are known to respond to different cues in a sex-specific manner [49, 67], we hypothesized that we may discover a multitude of sex-specifically expressed GPCRs. We indeed identified several GPCRs that are expressed in hermaphrodite-specific neurons (HSN, VC motor neurons) or in several male-specific neurons (Fig 6); however, we did not detect differences in GPCR expression in sex-shared neurons. We emphasize here, however, that we did not systematically analyze all 245 reporters that we analyzed in the hermaphrodite for differences in expression in the male, but rather focused on those GPCRs that show expression in 1-3 pairs of neurons in the hermaphrodites.
csGPCR reporter expression outside the nervous system
Moving outside the nervous system, we found expression of individual GPCRs in essentially all tissue types (Fig 7 shows examples; summarized in Table 4). As we already mentioned above, the non-neuronal expression is often quite specific and there are only a few GPCRs that are expressed broadly in many different cell types (e.g. srbc-58, srr-4). Specific sites of non-neuronal expression include subsets of muscle cells, hypodermal cells, specialized epithelial cells, cells of the somatic gonad (distal tip cells), individual cells of the excretory system, glial cells and others (Fig 7, Table 4). There are no obvious, specific associations of non-neuronal expression with expression in a specific set of neuron types. Also, non-neuronally expressed GPCR receptors are not biased toward a single subfamily. GPCRs expressed in non-neuronal tissues that are exposed to the environment, e.g. epidermis, could be involved in sensing external cues but other non-neuronal cells will rather respond to internal signals. As a cautionary note, we can not presently exclude that nonneuronal expression may be the result of lack of repressor elements in the reporter constructs, but we note that in C.elegans there is presently little evidence for non-neuronal repressor mechanisms restricting gene expression to the nervous system (e.g. [68]).
Reporter gene analysis of entire csGPCR gene families
Do any of the patterns described above cluster with sequence similarity (i.e. family membership) of the receptors? As described above, specific features of csGPCR expression patterns do not correlate with family membership, but we wanted to pursue this issue further via a more comprehensive analysis of entire chemoreceptor gene families. As defined by sequence analysis [7], chemoreceptor gene families have very different sizes, ranging from a single gene per family (srn family) to 223 genes per family (srh family)(Table 2). We analyzed reporter gene expression patterns of all members of two small families to examine whether there are common themes in their expression patterns, their genomic location and cis-regulatory control regions. We also analyzed the expression of the one family, the srn family, which only has a single member, which is highly conserved in Caenorhabditis species, to assess whether it may show an unusual expression pattern. However, we find the srn-1 reporter gene to be mainly expressed in amphid sensory neurons, like many other GPCRs (Fig 8).
The two small families for which we generated and analyzed reporter genes for all family members are the previously uncharacterized srm (6 members) and srr (9 members). Five out of the six srm family genes are syntenic to other family members (Fig 8). As these direct genomic adjacencies suggest local gene duplication, we could ask the question whether such local duplications also resulted in duplication of the 5’ cis-regulatory control regions and to what extent such duplicated cis-regulatory control regions retained similar expression profiles. We find that the adjacent srm-1 and srm-2 genes are expressed in a small set of mostly sensory neurons; some of these neurons are the same, others are different. The same theme applies to the adjacent srm-4, srm-5 and srm-6 genes. Their 5’ upstream regions direct expression to distinct, but partially overlapping sets of neurons.
The srr gene family is composed of 9 members. Reporter genes for all members displayed expression in diverse sets of neuron types with no common theme emerging. Outside the nervous system, it is notable that half of the family members are expressed in distinct cell types of the pharynx (Fig 8), suggesting a role for these genes in sensing food.
Temporally regulated csGPCR reporter genes
We also sought to examine dynamic aspects of csGPCR expression. We focused on dynamics that relate to developmental timing and the response to harsh environmental conditions. To facilitate the identification of changes in expression, we focused our analysis on GPCRs that are robustly expressed in the adult in a small number of neurons (in most cases not more than 1-3 neuron pairs in the head and/or 1-2 neuron pairs in the tail). At the first larval stage we didn’t detect any differences in expression in 79 out of 82 examined reporters. Due to the limitations of multicopy array based fluorescent reporters, moderate intensity changes within a cell type might be difficult to notice and could have been missed. Three reporter genes, srh-11, sru-48, and sra-28, show striking differences in first larval versus adult stages: All three reporter genes show expression in the ASK neuron at the L1 stage, but not at the adult stage (Fig 9). Additionally, srh-11 is expressed brightly in the ASI neuron at the L1 stage but dimly at the adult stage (Fig 9). Furthermore, dim expression of srh-11 and sra-28 reporter genes in the tail phasmid PHB and PHA neurons respectively, is only observed at the L1 stage but not at the adult stage (Fig 9).
GPCR reporter gene expression changes in dauers
We found that a substantial number of GPCR reporter genes were dynamically expressed when animals enter the dauer stage, an environmentally controlled diapause arrest stage that is accompanied by substantial cell, tissue and behavioral remodeling [69, 70]. Initially again focusing on reporters that are expressed in a restricted number of neurons under well-fed conditions, we found that 16 out of 46 examined reporters show a diverse set of changes in animals that were sent into the dauer stage via a standard starvation/crowding protocol (see Experimental Procedures). Many of the changes entail striking changes in the cellular specificity of GPCR reporter expression (Fig 10, Table 5). The vast majority of differences are observed in the nervous system, but some changes also occur outside the nervous system. Changes in GPCR reporter expression in the dauer stage have previously been described for two GPCR reporters [34](summarized with our novel patterns in Table 5), but the patterns we observe here are much broader and more complex. They can be summarized as follows:
(1) In most cases, there is stable and unchanged expression in several neuron classes in dauer and non-dauers, but upon dauer entry, expression is either turned on in additional neuron classes (“type I” regulation) or becomes undetectable in subsets of specific neuron classes (“type II” regulation) (Table 5; Fig 10). There are also combinations of both changes (type III regulation): In one particularly striking example, the srh-71 reporter is expressed in some sensory neurons in both dauer and non-dauers, but undergoes a striking respecification in dauers. Reporter expression becomes undetectable in the lateral IL2, PHA and an additional pair of tail neurons in dauer, and instead is turned on in the AIZ and ASG neurons (and increases expression levels in ASI). This hints toward the re-routing of internal sensory information.
(2) In a number of cases reporter expression is strongly downregulated, becoming undetectable in all neurons in which the reporter is expressed (Table 5; Fig 10).
(3) The changes outside the nervous system concern three tissue types, muscle, the excretory cell and epithelial blast cells (Fig 10). In two cases, expression of a specific csGPCR reporter is turned on in the dauer stage, while in another case expression becomes undetectable. These findings indicate that these tissue types now became receptive to signals in a dauer-specific manner, an unanticipated finding.
(4) The most recurrent set of changes in the expression of distinct reporters concern nociceptive neurons, namely the ASH, ADL and phasmid tail neurons. Of particular note is the PHA phasmid neuron, which shows the most consistent pattern of changes: 4 csGPCRs are turned off or strongly downregulated specifically in the dauer stage.
(5) The most unusual novel expression pattern observed in dauer stage animals concerns the PVP tail interneuron pair. We found that in dauers, expression of the sri-9 reporter is turned on in a left/right asymmetric manner, only in the PVPL neuron. The cellular identity of sri-9 expression (as well as other expression changes) was corroborated by examining overlap of GPCR gfp-based reporters with rfp-based landmark strain (see Experimental Procedures).
Some csGPCRs serve as molecular markers of life history
Do reporter expression changes observed in dauers recover upon re-feeding to the pattern observed in control fed animals? Examining csGPCR reporter expression in well-fed adult animals that had passed through the dauer arrest stage during larval development, we found that the expression of 11 of the 18 reporters, which showed dauer-specific gene expression changes, recovers to that of the fed state, i.e. in these 11 cases, expression in the adult is independent on whether the animals had passed through the dauer stage or not.
For 7 csGPCR reporters we discovered intriguing, cell-type specific alterations in animals which have passed through the dauer stage (Fig 11, Table 5). We observed two types of changes:
(1) For 4 reporters (for sri-9, sra-25, srh-71 and sru-12), we observed that expression which was induced in specific neuron types exclusively in dauers, retained this expression in postdauer animals: Dauer-induced expression of the sri-9 reporter in PVPL, of the sra-25 reporter in ADL, of the sru-12 reporter in PVQ as well as of the srh-71 reporter in AIZ and two ventral ganglion head motor neuron pairs is retained in post-dauer adults. In contrast, dauer-induced loss of srh-71 expression in the lateral IL2 pair does not recover.
(2) In 4 cases (sre-43, srh-71, sru-12, sra-25 reporters) we observed induction of expression in additional cell types exclusively in postdauer animals. sru-12 reporter expression is specifically induced in the PLN neurons of postdauer animals, sre-43 expression in dimly observed in the ASH neurons of postdauer animals, sra-25 expression is dimly observed in the ASJ neurons in postdauer animals and srh-71 reporter expression was induced in a non-neuronal pair, pharyngeal gland cells, in post-dauers.
In addition to these two categories, we found two instances, in which a sporadic and weak expression observed in animals that have not passed through the dauer stage will become highly penetrant and stable if they have passed through the dauer state (sra-25 in BAG neurons, sre-43 in ASJ, srh-71 in ASI).
Note that all of the reporters for which we observe changes in postdauer recovery do recover their “fed patterns” in other neuron classes (these could be considered as internal controls that argue against the changes in expression being a reporter gene artefact). Taken together, adult animals show neuron-type specific differences in the expression of GPCR reporters depending on whether they have passed through periods of distress. GPCR reporters therefore serve as reporters of life history traits.
L1 starvation recapitulates some but not all csGPCR reporter changes
We tested five of the 16 csGPCR reporters that displayed changes in the dauer stage for whether their expression also changes in another starvation-induced arrest stage, the starvation-induced L1 arrest stage. Comparing expression in 2 day-starved L1 (egg prep into M9 medium) to fed L1, we find that two reporters (str-114 and sra-25) show the same changes as observed in dauer animals (Fig 12). In contrast, two reporters (str-84 and srg-32) that change their expression in dauers, do not show changes in starved L1 vs. fed L1 (Fig 12). One reporter, srh-15, in addition to dauer-specific expression in ASK, is also expressed in ASI in starved L1. Hence, the response of GPCR expression to arrest conditions is diverse.
DISCUSSION
Together with previously published analyses, there are now reporter transgenes that monitor the expression of 373 of the ~1,300 chemosensory GPCR genes encoded in the C. elegans genome. One intrinsic limitation of reporter genes is that they do not necessarily capture the full complement of cis-regulatory control elements of a gene. However, given the compact nature of csGPCR loci, the inclusion of all 5’ regions in most reporters and the small size of introns, the number of inaccuracies may be quite limited. Irrespective of whether the reporters are a reflection of the complete expression of a csGPCR, they nevertheless function as highly valuable molecular markers of cellular identity and plasticity. Meaning, reporter gene analysis decodes cis-regulatory information and provides read-out of regulatory states of specific cell types. The key conclusions of the expression patterns inferred from the reporter genes are as follows:
Restricted expression
Most csGPCRs show a very restricted expression in few cell types. Many GPCRs are expressed in single neuron classes. Those csGPCRs that express in multiple neuron types do not display a coherent set of coexpressing neurons, with one notable exception: the nociceptive ASH, PHA and PHB neurons display similar (but not identical) sets of csGPCR expression.
csGPCR coexpression within a neuron class
Some neurons display a remarkably large number of GPCRs. The ASI neuron displays the most csGPCR genes at 99, followed by many distinct types of nociceptive neurons. While csGPCRs have been found for all but two sensory neurons (URY and ADE), there is a striking disparity in the number of csGPCRs coexpressed in sensory neuron types. Amphid sensory neurons clearly coexpress the largest number of GPCRs, while other sensory neurons express many fewer csGPCRs. The nociceptive ADL stands out in the list of amphid neurons, as it is the neuron expressing the most single-neuron specific GPCR reporters.
Expression in sensory and non-sensory neuron classes
While expression of csGPCRs clearly predominates in sensory neurons, they are also expressed in inter- and motorneurons and in a diverse set of non-neuronal cells. In most cases, each GPCR is restrictively expressed, suggesting that many different cell types in an organism show very distinct and cell-type specific responses, likely to internal signals. The similarity of one GPCR family, the srw family, to peptide receptors of other animal species provides hints to the nature of these ligands [11, 29]. The expression of many members of the srr family in pharyngeal tissues suggests another source of ligands; perhaps these receptors respond to cues from ingested bacteria. In vertebrates, chemosensory GPCRs are now also becoming increasingly appreciated as being expressed in non-neuronal cells [6].
Polymodality of sensory neurons
csGPCRs were detected in sensory neurons that are known to express distinct types of sensory receptors and engage in non-chemosensory behavior, e.g., in gas-sensing neurons or different types of mechanosensory neurons. The expression of csGPCRs in these neuron classes may hint toward these neurons perceiving different sensory inputs, i.e., they are likely polymodal. However, as discussed above, csGPCRs expressed in these neurons may not be involved in detecting external sensory cues, but measuring internal states.
Absence of gene family themes
The absence of any overarching expression theme within gene families is striking. We did not observe that the expression of family members clusters in specific neuron types or share any other specific expression features. Specifically: (a) left/right asymmetrically expressed csGPCRs in the AWC neurons do not fall into the same family; (b) csGPCR reporters that are differentially regulated in larval stages or in the dauer stage do not come from a single family; (c) GPCRs that share specific expression pattern themes (e.g., coexpression in the nociceptive ASH, PHA and PHB neurons) do not derive from specific families; (d) non-sensory neuron-expressed or non-neuronal expressed GPCRs do not fall into a specific family. The only glimpse of perhaps some common function is observed in the small srr family (9 genes), half of which appear to be expressed in nonneuronal pharyngeal tissue. An important note of caution is that these conclusions are based only on reporter genes. However, the substantial sample size on which these conclusions are based lends some credence to these conclusions.
Combinatorial complexity
GPCRs generally act as homo- or heterodimers [71], thereby hugely increasing the amount of distinct sensory receptor complexes expressed in a cell. This combinatorial activity also makes prediction of function of a given GPCR very difficult in that a GPCR may have one function expressed in one cell (in combination with another GPCR), while it may have a very different function in another cell (in combination with yet another GPCR).
Left/right asymmetric csGPCR expression patterns
While we recovered novel csGPCR genes expressed in a left/right asymmetric manner in the AWC neuron pair, we were surprised to find no other obvious left/right asymmetries in other sensory neuron pairs. Of course such asymmetries may still be found with currently not analyzed GPCR genes, but the number of AWC asymmetries recovered suggest that AWC neurons may be exceptional in their extent of lateralization.
The only other asymmetry that we found revealed itself not in a sensory, but an interneuron and only in an non-anticipated context. The sri-9 reporter transgene becomes induced in dauer animals in PVPL, but not PVPR, and PVPL expression is retained in postdauer animals. Molecular asymmetries in PVP neurons have not previously been reported but can perhaps be inferred by the fact that PVPL and PVPR are innervated in a left/right asymmetric manner by unilateral neurons. Specifically, PVPL, but not PVPR, is innervated by the unilateral DVB neuron. Perhaps sri-9 may play a role in this synaptic signaling context, but why this should be dauer-specific is unclear.
Plasticity of csGPCR expression
One notable feature of our analysis was the extent of plasticity that csGPCR reporters show in the context of the dauer stage. Dauer animals are thought to remodel most tissue types and significantly alter behavioral patterns. Changes in csGPCR expression and hence changes in the external and internal signal perception fit very well into the mold of organismal plasticity and illustrate the plasticity of many different tissue types (note, for example, the changes in csGPCR expression in muscle). We find it particularly intriguing that several csGPCRs represent markers of life history. Some of the changes in GPCR reporter gene expression in dauers is retained in postdauer animals and some csGPCR reporters turn on only in postdauer animals. Animal-wide expression transcriptomic analysis has previously identified large cohorts of transcripts that, like our csGPCR reporters, serve as marker of dauer life history, i.e. transcript changse in dauers and these transcript changes persisting in post-dauer animals [72]. However, due to the whole animal nature of this analysis, this previous study lacked cellular resolution. Our findings add a novel spatial component to these previous findings, since we find the life history traits to be strikingly neuron-type specific. The expression of the TRP channel gene osm-9 has also previously been shown to be modulated during dauer and postdauer stages in a neuron class-specific manner; in this case, osm-9 expression is downregulate in the ADL (but not AWA) chemosensory neurons and the repression is retained post-dauer, using RNAi-and chromatin-based mechanisms [73]. In all except one case that we report here, we observe the opposite post-dauer effect; reporters that are turned on in dauers, persist in non-dauers. The mechanistic basis of this may hence be distinct from the osm-9 case.
It is important to remember here that the life history trait observations are based on transcriptional reporter genes which, on the one hand, may not accurately reflect expression of the endogenous locus, but, on the other hand, clearly provide a definitive molecular “read out” of changes in the “regulatory state” of specific neuron types, depending on whether they have passed through the dauer stage or not. Moreover our transcriptional reporter also argue that the life history regulatory phenomenon must be transcriptional in nature. These GPCR reporters will therefore provide excellent starting points to analyze the molecular mechanisms controlling this plasticity.
Future uses of the csGPCR expression map
The csGPCR reporter atlas can be put to a number of future uses. The sites of expression of specific csGPCRs point to potential functions of the csGCPRs, guiding the future analysis of csGPCR knockout strains – many available by knockout consortia or easily constructable by CRISPR/Cas9 technology. For example, csGPCRs expressed in the polymodal nociceptive ASH, ADL and phasmid neurons may be mediating the response to a number of distinct sensory cues processed by these neurons [56, 74].
csGPCR expression patterns point to perhaps unexpected cellular sites of internal signal perception that warrant further investigation. For example, the excretory canal cell expresses at least six csGPCRs reporters (considering that we only examined reporters for ~20% of GPCR loci, this number may increase several fold). The relevance of this expression could be tested through the excretory cell-specific expression of dominant negative versions of G-protein downstream signaling components. Similarly, the cellular dynamics in csGPCR expression patterns point to specific cells undergoing changes that warrant future characterization. For example, the induction or suppression of csGPCR reporter expression during the dauer stage in specific sensory and interneurons that were not previously associated with dauer-specific functions may warrant a closer examination of other molecular and functional changes of these neurons during the dauer stage.
Because csGPCR reporter fusions also link precisely delineated sequences (used for reporter construction) to specific cellular sites of gene expression, patterns of coexpression of GPCRs can be used to extract cis-regulatory information, which in turn may point to trans-acting factors involved in controlling GPCR gene expression. A proof of principle for this type of analysis has already been conducted, pointing to a critical function of, for example, a bHLH factor in controlling csGPCR expression in the ADL nociceptive neuron [28] and with now substantially more expression information available can be further extended to additional cell types.
Lastly, GFP reporter transgenes have generally served as invaluable starting points for genetic mutant screens in which the genetic control of specific biological processes can be investigated. The GPCR reporter collection provides a multitude of entry points. For example, the postdauer expression of multiple reporter genes can be used to screen for mutants in which these life history traits fail to be properly expressed. GFP reporter genes have also served as invaluable cellular identity markers and here again the csGPCR reporter collection can be used to assess how the identity of specific cell types is genetically controlled.
ACKNOWLEDGEMENTS
We thank Q. Chen for generating transgenic strains, the Vancouver gene expression consortium for generating strains and the CGC for providing these strains to us. We thank Neda Masoudi and Sarah Finkelstein for communicating unpublished results.