Phenotype-driven transitions in regulatory network structure

Megha Padi; John Quackenbush

doi:10.1101/142281

Abstract

Complex traits and diseases like human height or cancer are often not caused by a single mutation or genetic variant, but instead arise from multiple factors that together functionally perturb the underlying molecular network. Biological networks are known to be highly modular and contain dense “communities” of genes that carry out cellular processes, but these structures change between tissues, during development, and in disease. While many methods exist for inferring networks, we lack robust methods for quantifying changes in network structure. Here, we describe ALPACA (ALtered Partitions Across Community Architectures), a method for comparing two genome-scale networks derived from different phenotypic states to identify condition-specific modules. In simulations, ALPACA leads to more nuanced, sensitive, and robust module discovery than currently available network comparison methods. We used ALPACA to compare transcriptional networks in three contexts: angiogenic and non-angiogenic subtypes of ovarian cancer, human fibroblasts expressing transforming viral oncogenes, and sexual dimorphism in human breast tissue. In each case, ALPACA identified modules enriched for processes relevant to the phenotype. For example, modules specific to angiogenic ovarian tumors were enriched for genes associated with blood vessel development, interferon signaling, and flavonoid biosynthesis. In comparing the modular structure of networks in female and male breast tissue, we found that female breast has distinct modules enriched for genes involved in estrogen receptor and ERK signaling. The functional relevance of these new modules indicate that not only does phenotypic change correlate with network structural changes, but also that ALPACA can identify such modules in complex networks.

Significance statement Distinct phenotypes are often thought of in terms of unique patterns of gene expression. But the expression levels of genes and proteins are driven by networks of interacting elements, and changes in expression are driven by changes in the structure of the associated networks. Because of the size and complexity of these networks, identifying functionally significant changes in network topology has been an ongoing challenge. We describe a new method for comparing networks derived from related conditions, such as healthy and disease tissue, and identifying emergent modules associated with the phenotypic differences between the conditions. We show that this method can find both known and previously unreported pathways involved in three contexts: ovarian cancer, tumor viruses, and breast tissue development.

INTRODUCTION

We tend to think of phenotypes as being characterized by differentially expressed genes or mutations in particular genes. However, the individual genes that show the greatest changes in expression in a phenotype do not tend to be drivers of that phenotype (1, 2). Despite the increasing power and depth of sequencing studies, identifying the causal mutations and Single Nucleotide Polymorphisms (SNPs) that are responsible for determining heritable traits and disease susceptibility remains challenging. Indeed, many studies have found thousands of genetic variants of small effect size contribute to common traits (3–5). It has become apparent that phenotypes are driven by complex regulatory interactions between multiple genes and variants that together define the state of the cell. Modeling these phenotypes requires that we have a clearer picture of how genes and proteins work together to perform normal cellular functions, and how remodeling the interactions between genes can cause changes in phenotype including disease.

In this context, we need to subtly shift our understanding and think of a phenotype as being defined by a network of interacting genes and gene products. Exploring the topology of such networks can provide important biological insight into phenotypic properties. For example, high-degree “hubs” in protein-protein interaction (PPI) networks are enriched for genes essential to growth (6). Biological networks are known to have modular structure and contain closely interacting groups of nodes, or “communities,” that work together to carry out cellular functions (7–9). There are many analytical and experimental methods for inferring network models associated with different phenotypic states (10–12). However, the most significant questions we can ask of biological networks – how networks differ from each other, and how differences in network structure relate to functional changes – remain largely unanswered.

Most analyses of so-called “differential networks” have focused on determining which edges are altered relative to a reference network (13). While the advantage of this approach is its simplicity, there are several issues that arise in such an edge-based analysis. First, biological network inference has a relatively high rate of false negatives due to noise in both the experimental data that are used and in the network inference methods themselves. Consequently, it can be difficult to determine whether the appearance or disappearance of a single edge is “real.” The uncertainty in the estimate of the difference between two edge weights is the sum of the uncertainties in each individual edge, which inflates noise in the final differential network. Second, the perturbed network will in general contain both positive and negative changes in edge weight relative to the reference network, and it is challenging to analyze and interpret a differential network with mixed signs. If we only consider the new edges associated with a phenotype, we would miss the functional effects of decreases in edge activity. Third, by focusing only on the altered edges and discarding common edges, the differential interactions are taken out of their functional context, making it challenging to connect them to global cellular changes. For example, adding or deleting ten scattered edges in a network may have very different consequences on the phenotype than would the same number of changes concentrated in a local functional neighborhood of the network.

One way to address these issues and find more robust differences between networks is to identify changes in groups of nodes, rather than in individual edges. Computational methods that have been developed to do this fall into several categories. First there are methods that evaluate differences in pre-specified network features, like user-defined gene sets, small regulatory motifs or global topological characteristics. For example, Gamberdella et al. evaluated the statistical significance of differences in co-expression of a user-defined gene set between two conditions (14). Similarly, the coXpress method defines clusters using co-expression in the reference condition, and tests for significant changes in each cluster under a new condition (15). Landeghem et al. developed a method for inferring the best differential network that contrasts two datasets, and Gill et al. and Danon et al. introduced new measures to test whether global modular structure and degree characteristics are different between two networks (16–18). However, these methods are limited to examining pre-defined gene modules and network features, and fail to take full advantage of the network structure. As such, they lack the ability to discover new pathways and network modules that functionally distinguish different phenotypes.

Other methods have been developed to discover de novo gene modules that differ between conditions. The DiffCoEx algorithm iteratively groups genes that are differentially co-expressed to find new modules (19, 20). Valcarcel et al. compared metabolite correlation networks to discover groups of metabolites that changed their correlation pattern between normal weight and obese mice (21). These methods are based on first computing the most differential edges and then grouping them together, which increases the uncertainty of each edge estimate and does not incorporate functional edges that are present in both conditions (13, 22), thus losing network context.

Another class of methods attempts to identify “active modules,” which are groups of genes that are differentially expressed in a particular disease or condition and also highly connected in a reference network, such as the protein-protein interaction network (23). However, the “active modules” framework only uses differential gene expression and so focuses on the nodes rather than accounting for changes in the strength of regulatory edges.

We present a new graph-based approach called ALtered Partitions Across Community Architectures (ALPACA) that compares two networks and identifies de novo the gene modules that arise in the networks as the phenotype changes. ALPACA is based on modularity maximization, a technique commonly used to find communities in a single graph. As applied previously, modularity is a measure of the observed edge density of the communities as compared to their expected density in a degree-matched random graph. Although this technique is powerful, it has a “resolution limit” because communities can only be identified if they are larger than the typical cluster size in random graph configurations (24). This lack of resolution is especially disadvantageous when studying transcriptional networks, which tend to have a dense and hierarchical structure, and whose functional units only become evident under different environmental conditions (25). A framework based on modularity maximization has been created to find common community structure among multiple networks (26), but the only way to detect differences is to apply modularity maximization to each network separately, followed by brute-force comparison of the two resulting community structures.

In ALPACA, we adapt the modularity framework to compare condition-specific networks to each other rather than to a random graph null model. We define a score called the “differential modularity” that compares the density of modules in the “perturbed” network to the expected density in a matched “baseline” network, allowing us to contrast, for example, networks from disease and healthy tissue samples and partition the nodes into optimal differential modules, without relying on predefined gene sets or pathways. In contrast to methods that simply cluster the most differential edges, ALPACA compares the full network structures active in each condition and reduces the noise from individual edges by estimating an aggregated null model. And because the null model is based on the properties of a known reference network rather than on a random graph, the usual “resolution limit” does not apply, and ALPACA can detect small disease modules otherwise hidden within larger regulatory programs associated with normal cellular functions.

To demonstrate the utility of ALPACA, we apply it to compare simulated networks, as well as transcriptional network pairs from non-angiogenic and angiogenic subtypes of ovarian cancer, normal human fibroblasts and fibroblasts expressing tumor virus oncogenes, and male and female breast tissue from the Genotype-Tissue Expression (GTEx) project. We find that ALPACA produces higher resolution and robustness than other network approaches and identifies modules enriched in biological processes relevant to the phenotypes we are comparing. Although we have focused on transcriptional networks, the framework we present here is mathematically general and could be applied to find the differences in modular structure between any two networks.

RESULTS

Modularity maximization and comparing community structures

Many methods for determining the community structure of a network are based on maximizing the modularity (27):

Here, A_ij indicates the adjacency matrix of the network, m is the number of edges, d_i is the degree of node i, and C_i is the community assignment of node i. The modularity represents to what extent the proposed communities have more edges within them than expected in a randomly connected graph with the same degree properties; this null expectation is represented in the second term of the equation above. The modularity is optimized over the space of all possible partitions {C} and the value of C_i corresponding to the maximum modularity then determines the community structure of the network. An exhaustive search is not possible for large networks, but many methods have been developed to find locally optimal community structure, including ones based on edge betweenness, label propagation, and random walks (27–29). The Louvain algorithm is a particularly efficient way to find high-quality local optima of the modularity function (30).

Community comparison and edge subtraction

Having arrived at a pair of inferred networks corresponding to different phenotypic states, there are two straightforward ways to compare the community structures based on the modularity metric (Figure 1). One method, which we will call “community comparison,” consists of using modularity maximization to find the community structure for each network individually, and then finding the nodes that alter their community membership between the two networks. Another method, which we will call “edge subtraction,” is to compute the differences in the edge weights between the two networks, and then apply modularity maximization to the resulting subtracted weights.

Figure 1. Methods to compare networks and find changes in modular structure.

“Community comparison” identifies communities separately in each network and looks for nodes that change their community membership. “Edge subtraction” finds communities by subtracting the networks and finding communities in the resulting differential edges (red arrows). ALPACA looks for groups of genes that are more interconnected in the perturbed network than expected given the community structure of the baseline network. Flowchart shows the major steps in the implementation of ALPACA.

Both methods can detect large, dramatic changes in network structure. However, there are important differences in these methods. “Community comparison” is limited in its ability to detect structural changes smaller than the average community size in each individual network. In contrast, “edge subtraction” acts on the difference of the edge weights, which reduces the density of the network and increases the resolution, but this method is also more strongly affected by noise in the individual edges. Further, only positive edge weight differences can be used to run modularity maximization in the subtracted network, so edges that are lost are not appropriately accounted for; incorporating both positive and negative edge weight differences requires more complex techniques (31, 32).

ALPACA: A new method for detecting changes in community structure

To overcome some of the limitations of the community comparison and edge subtraction methods, we developed ALPACA, a new algorithm based on modularity maximization. The unique aspect of ALPACA is that, rather than comparing edge distributions to a random null model, we compare edges of the “perturbed” network to a null model based on the “baseline” network to find differential gene modules between the two networks (Figure 1). ALPACA optimizes a new quantity called “differential modularity,” which we define as

This score compares the number of edges in a module M in the perturbed network – whose adjacency matrix is given by and total edge weight is m_P – to the expected number of edges based on the pre-computed community structure {C} of the baseline network. Here, N_ij is defined as where C_i is the community assignment of node i in the baseline network, and is the normalized weight of the edge between node a and node b in the baseline network: . For the normalization, we have chosen to globally scale the edge weights of the baseline network so that the total matches m^P, the sum of the edge weights in the perturbed network. This allows a fair comparison between two networks that could be derived from two datasets of differing quality or sample size and may have different global sensitivity properties. To identify the modules {M} that maximize the differential modularity, we use the following two-step procedure. First, we determine the community structure of the baseline network using established methods (9, 30). Second, we compute the differential modularity matrix D_ij and apply the Louvain optimization algorithm to iteratively aggregate the nodes into modules (30).

Note that the equation above is presented in a form that applies to weighted bipartite networks, as we will be applying it to analyze transcription factor (TF)-gene interactions. It can be easily adapted to analyze other types of networks. More details about the implementation of all three methods – community comparison, edge subtraction, and differential modularity – are presented in the Materials and Methods section.

Evaluating the performance of ALPACA on simulated networks

We reasoned that ALPACA would be more sensitive to small changes in modular structure than methods based on standard community detection, because the null model is computed using detailed properties of the baseline network rather than relying on random graphs. We also believed that ALPACA would be less sensitive to noise in individual edge weights than edge subtraction, because the null model is estimated by averaging over communities in the baseline network. We set out to test these properties in a setting that resembles real biological networks as much as possible, but where we have control over the changes in modularity.

To do this, we constructed a baseline network and then created new modules through the “addition” of new edges, resulting in a perturbed network. For the noiseless version of this simulation, we inferred a regulatory network by integrating known human transcription factor (TF) binding sites with gene expression data in normal human fibroblasts using the algorithm PANDA (33) (see Materials and Methods for further details). After thresholding the edge weights and applying CONDOR (9), a method for community detection in bipartite networks, we found that the baseline network had five communities of varying sizes. Next, we simulated a set of perturbed networks by choosing a random subset of TFs and genes and adding new edges between them, thus artificially creating a new module. The new module consisted of between 3 to 21 TFs, and five times as many genes as transcription factors.

To these simulated networks, we applied three differential community detection methods – community comparison, edge subtraction, and ALPACA – and ranked the nodes by their contribution to the final score for each method. We then used Kolmogorov-Smirnov and Wilcoxon tests to evaluate whether the “true” module ranked higher than expected by chance in each ranked list. The edge subtraction method demonstrated superior performance for recovering modules of all sizes (Figure 2A); this is to be expected, since the only new edges added to the networks were within the new modules. Examining the results from the other two methods, we observed that ALPACA is substantially better than community comparison at detecting smaller modules. Specifically, in a network with a total of ~2500 nodes, community comparison was unable to detect new modules with less than ~110 nodes, whereas ALPACA could reliably detect modules as small as 66 nodes.

Figure 2. Performance of three methods on simulated networks with added module.

Network at left visualizes the regulatory network derived from normal human fibroblasts, with purple, yellow, orange, pink and blue denoting the pre-existing community structure, and red nodes depicting the synthetically added module. Bar graphs show performance of each method – ALPACA, edge subtraction or community comparison – on three random and independent network simulations with (A) or without (B) resampling of edges among the preexisting communities. P-values computed using Wilcoxon test.

We then introduced edge noise into the “addition” simulation while retaining the modular structure of the underlying network. To do this, we made another series of perturbed networks where, in addition to introducing the new module as described above, we also randomly resampled the edges from the baseline network while retaining the inter- and intra-community edge density. In this more realistic set of simulations, we found that ALPACA outperformed the other methods across a range of module sizes (Figure 2B).

To check that these results are independent of the particular optimization algorithm used, we repeated the analysis using the Louvain method instead of CONDOR for initial community detection in the community comparison and edge subtraction methods. The results were very similar in both cases (Supplementary Figure 1). This indicates that the superior performance of ALPACA is not due to the optimization method used, but rather arises directly from the definition of the differential modularity.

While the edge subtraction method works well to detect “added” modules under low noise conditions, it becomes problematic if edges are deleted or if their weights decrease in the perturbed state relative to the control, because most network clustering methods are only formulated for positive edge weights. One might suggest transformation of edge weights, but any simple transformation of negative edge weights to make them positive (for example, by exponentiation or a linear shift) would bias the results. Algorithms that directly incorporate negative edge weights are complex and involve multiple steps and assumptions (31, 32). In contrast, ALPACA’s differential modularity matrix D_ij contains both negative and positive values, corresponding to areas of decreasing and increasing edge density relative to the baseline network and its community structure. By optimizing over the sum of D_ij, ALPACA incorporates positive and negative changes in edge density in a symmetric fashion.

As a simple demonstration of ALPACA’s ability to detect community structure changes with negative weights, we created “subtracted” simulations in which selected edges in a baseline network are reduced in weight to produce a substantially different perturbed network structure (Figure 3 and Supplementary Figure 2; see Materials and Methods for more details). In Figure 3, for example, the network consists of two dense node groups, A and B, which are more strongly connected together in the baseline condition (edge weight 0.8) than in the perturbed condition (edge weight 0.2). Therefore, the perturbation causes groups A and B to separate and perform distinct functions; intuitively, this means groups A and B characterize the change in modular structure between the two networks. Because the only change in edge weights is the decrease in edges between A and B, the edge subtraction method results in a network with negative edge weights.

Figure 3. Performance of three methods on perturbations that decrease edge density.

Left hand side shows a network transition involving a decrease in edge weights between nodes in Groups A and B. All other edges remain the same. Right hand side shows the results of three methods when comparing these two networks, with the computed differential community structure indicated by node coloring. Note that the “edge subtraction” method needs to be applied in the reverse manner, comparing the baseline network against the perturbed network, in order to have positive differential edge weights.

If instead we reverse the process and subtract the perturbed network from the baseline network, the resulting positive edge weight network produces two modules, one consisting of TFs in group A linked with genes in group B, the other consisting of TFs in group B linked with genes in group A. This does not match the intuitive result we are looking for. The community comparison method detects no change because both the baseline and perturbed networks are composed of the same two node communities. However, ALPACA correctly identifies groups A and B as the differential modules characterizing this transition.

An example with three node groups is shown in Supplementary Figure 2. Again, we find that ALPACA identifies the key change in modular structure and edge subtraction cannot. Although these examples are simple, such areas of decreased edge density will be locally embedded in any realistic biological network and will strongly influence the identification of neighboring modules.

Angiogenic vs. non-angiogenic ovarian cancer tumors

Ovarian cancer is the second most common cause of cancer death among women in the developed world. Available treatment options for ovarian cancer, such as platinum-based therapies, often lead to chemoresistance and recurrence. Ovarian cancer tumors can be stratified by gene expression profile, tissue of origin, or other characteristics, in order to better understand heterogeneity and predict patient-specific therapeutic strategies. We previously found that a gene signature associated with angiogenesis is able to classify ovarian cancer patients into a poor-prognosis subtype (34).

We classified 510 ovarian cancer patients from The Cancer Genome Atlas into 188 angiogenic and 322 non-angiogenic tumors and used PANDA to infer separate gene regulatory networks for the two subtypes, as described in (35). We then applied a variety of methods to look for changes in community structure associated with the angiogenic tumors, ranked the nodes by their contribution to the total score for each method (see Materials and Methods), and evaluated the core genes in each set for functional enrichment. In order to evaluate the unique contributions of ALPACA, we first applied standard community detection techniques to identify communities in each subtype-specific network, using both the Louvain method and CONDOR, and we looked for GO terms that were statistically enriched in the angiogenic network but not in the non-angiogenic network. Next, we applied edge subtraction, community comparison, and ALPACA to directly identify differential modules associated with angiogenic tumors. The GO term enrichment with P_adj < 0.05 for each method is presented in full in Supplementary Table 1.

Consistent with what we observed in the simulated networks, ALPACA had higher resolution than the other methods and identified 17 modules specific to the angiogenic network. Strikingly, ALPACA was the only method that identified a gene module enriched in “blood vessel development,” the pathway that we know drives the phenotypic difference between these two ovarian cancer subtypes. Standard community detection methods did not find such a cluster. The non-angiogenic network communities were enriched for histone methylation, embryo development, G-protein coupled receptor signaling, interferon signaling, and chromatin assembly, whereas the angiogenic communities were enriched for cAMP biosynthetic process, response to fibroblast growth factor, MAPKK activity and interferon signaling (Supplementary Table 1). The community comparison method did not yield any enriched GO terms. The edge subtraction method resulted in four large modules enriched for general processes like regulation of cell shape, extracellular matrix organization, nucleosome assembly, and immune response (Supplementary Table 1).

ALPACA led to more specific GO term enrichment than the other methods, suggesting that it was able to more carefully refine differential module structure. For example, instead of general GO terms like “immune response,” the ALPACA modules were enriched for particular immune-related pathways like Type I interferon response, interleukin production, and regulation of the NFκB pathway, and inflammation. Other enriched pathways included JAK-STAT and growth hormone signaling, urogenital development, triglyceride homeostasis, flavonoid glucuronidation, and cell migration. Some of these pathways, like JAK-STAT and cell migration, have already been associated with ovarian tumor progression, while others like flavonoids and triglycerides have only tentative connections with risk of ovarian cancer. We note that most of the ALPACA GO term results could not be found by running community detection on the angiogenic network alone, which shows that ALPACA partitions nodes in a novel manner that does not merely reflect the underlying community structure of the disease network but instead highlights the changes in modular structure between conditions (Figure 4, inset).

Figure 4. ALPACA modules associated with angiogenic ovarian tumors.

Right hand side shows five of the modules, with nodes colored by their membership. Edge opacity is proportional to its contribution to the differential modularity. Network is annotated with representative enriched GO terms with P_adj < 0.05, and the genes annotated by the shown GO terms are labeled in larger font. Left hand side shows the relationship between the ALPACA modules (denoted by M) and the community structure of the angiogenic network (denoted by ANG). Edge thickness depicts the fraction of genes in that differential module that are present in a particular angiogenic network community. The size of each node is proportional to the number of genes in that module or community. Bottom inset: Same networks as above, but colored by community membership in the angiogenic network rather than by membership in the ALPACA modules.

We also note that running ALPACA in reverse, to find modules present in the non-angiogenic network as compared to the angiogenic network, results in a substantially smaller set of enriched GO terms, which fall mostly into the metabolic and immune categories, with no enrichment in blood vessel development (Supplementary Table 1). ALPACA therefore selectively identifies biological signals associated with the specific phenotype under study.

We examined the ALPACA modules and their connections to ovarian cancer in more detail, focusing on non-redundant GO terms that had an overlap of three genes or more with the module in which it was enriched (Figure 4). Module 4 was enriched for “flavonoid glucuronidation” and contains the UDP glucuronosyltransferases UGT2B15, UGT1A8, and UGT2B17, enzymes that can help metabolize flavonoids and regulate hormones. Studies have hinted that dietary intake of flavonoids may reduce the risk of ovarian cancer (36–38) but the association is not statistically robust, and the mechanism is unknown. Our results suggest that the UGT family of enzymes may mediate the connection between flavonoids and ovarian cancer. Module 5 is enriched in “urogenital system development” and contains several genes that are highly relevant to ovarian cancer. HNF1B is known to be a subtype-specific ovarian cancer susceptibility gene (39). Its expression level and promoter methylation status is predictive of clear cell and invasive serous subtypes of epithelial ovarian cancer. ESR1 is the estrogen receptor and is central to breast and ovarian cancer. IQGAP1 is a scaffold protein whose expression appears to drive invasion and progression of ovarian cancer tumors (40–42). SOX11 acts as a tumor suppressor in ovarian cancer, and its expression is regulated by methylation and predicts patient survival (43, 44).

We found that module 12 was enriched in triglyceride homeostasis. Although it is not known whether there is a dietary effect of triglycerides on ovarian cancer risk, several studies have noted that ovarian carcinomas have distinctive lipid profiles and metabolic characteristics (45). Our results suggest that metabolic pathways involving hepatic lipase C (LIPC) and glucokinase regulator (GCKR) may be mobilized differently in poor-prognosis ovarian cancer. Finally, modules 16 and 17 were enriched for various terms involving interferon response, interleukins, and regulation of the NFκB pathway, consistent with the theory that chronic inflammation is associated with risk of cancer (46). Specifically, the interleukin IL6 has been proposed as a therapeutic target, and IL12 is a prognostic factor in ovarian cancer (47–49). Interferons have cytotoxic properties in ovarian cancer cells (50, 51). NFκB activation is correlated with poor prognosis in ovarian cancer, and blocking the NFκB pathway can reduce anchorage-independent growth and invasiveness in cell culture assays (52).

Module 7 was enriched in “blood vessel development” and “positive regulation of cell migration,” reflecting the invasive and angiogenic characteristics of poor-prognosis ovarian tumors. The apoptosis gene PDCD6 is a member of both GO terms and is topologically central to this module. Interestingly, it is a known predictor of progression free survival in ovarian cancer and synergizes with cisplatin to inhibit ovarian cancer cells in vitro (53–55). CYR61, also a member of both GO terms, is an extracellular matrix (ECM) signaling protein that is overexpressed in poor prognosis ovarian carcinoma (56, 57). CTGF (connective tissue growth factor) is an angiogenic ECM protein, and it appears to have an inverse relationship with CYR61; high CTGF expression correlates with low CYR61 in low-grade tumors with increased survival (58). Overall, this suggests that the ALPACA modules contain functional groups of prognostic genes that may interact with each other to produce distinct phenotypes. ALPACA could therefore be a useful feature selection step to isolate small groups of pathway genes and build more complex predictive models.

Module 7 was also enriched in growth hormones and the JAK-STAT cascade. The JAK-STAT pathway is constitutively active in breast, ovarian and prostate cancers, and nuclear localization of activated STAT3 is associated with worse survival and chemoresistance in ovarian cancer. Treatment with JAK2 inhibitor reduces tumor burden in ovarian cancer xenografts (59). Members of module 7 that are annotated with this GO term include growth hormones 1 and 2 (GH1 and GH2). This pathway is already known to be a drug target in ovarian cancer, and growth hormone-releasing hormone (GHRH) antagonists reduce proliferation of ovarian cancer cells both in vitro and in vivo (60–62).

Tumor virus perturbations in primary human cells

DNA viruses hijack the host cell cycle to jumpstart viral genome replication. Tumor viruses can do this so effectively that they lead to aberrant cell proliferation and tumorigenesis, and studying tumor viruses can shed light on the molecular mechanisms behind cancer. Previously, we expressed a panel of 63 proteins from four families of DNA tumor viruses – Epstein-Barr virus (EBV), human papillomaviruses (HPV), polyomaviruses, and adenovirus – in IMR90 primary human fibroblasts and generated gene expression profiles for each cell line (63). To construct regulatory networks, we divided the gene expression data into two groups, the first corresponding to the 37 viral proteins classified as “transforming” due to their tumorigenic properties, and the second corresponding to all the control cell lines that contain either empty vectors or GFP. We used PANDA to infer networks by combining gene expression from each sample group with a prior map of cell type-specific DNase-I-hypersensitive TF binding sites (33).

We first ran standard community detection on each network, using the Louvain method for modularity maximization. The control network contained communities enriched in cell migration, axon guidance, and wound response (Supplementary Table 2). The communities in the transforming viral oncogene network were enriched for epithelial-mesenchymal transition, cell migration, axon guidance and wound response. Since an important function of fibroblasts is to migrate and heal wounds, many of the results from standard community detection appear to be cell type-specific processes that are not specific to viral oncogenes. The genes with the biggest changes in community assignment were enriched in BMP response and natural killer cell development (Supplementary Table 2). Applying the edge subtraction method using Louvain or CONDOR optimization methods resulted in enrichment for chromatin modification, the Toll-like Receptor (TLR) pathway, and immune response. We then applied ALPACA to compare the two networks. Like the edge subtraction method, ALPACA also revealed changes in immune response and chromatin modification but, importantly, it also found significant enrichment for “mitotic cell cycle,” which is the main process we expect to be perturbed by tumor viruses (Figure 5). Consistent with this, we had previously found that fibroblast cell lines expressing transforming viral oncogenes have significantly altered growth rates (63).

Figure 5. ALPACA modules associated with transforming viral oncogenes.

Network shows five modules, with nodes colored by membership in differential modules. Edge opacity is proportional to its contribution to the differential modularity. Network is annotated with representative enriched GO terms with P_adj < 0.05. Genes annotated by the shown GO terms are labeled in large font.

ALPACA was also the only method to identify communities representing several cancer pathways that are known to be targeted by tumor viruses, including extracellular matrix (ECM) organization, NFκB signaling, and embryonic development. Module 1 is enriched in “cellular calcium ion homeostasis” and “regulation of NIK/NF-kappaB signaling.” NFκB and Nuclear factor of activated T-cells (NFAT) are two important cancer-related pathways that activate immune cells, and NFAT activity is modulated primarily through intracellular calcium levels. Merkel cell polyomavirus and EBV LMP1 are both known to functionally perturb the NFκB pathway through different mechanisms (64, 65). EBV, HPV16 and several polyomaviruses target the genes CHI3L1, TLR9, and SOCS1, which are all among the top-scoring nodes in this module (66–71). EBV and HPV infections both alter calcium signaling in the host cell (72, 73). Tumor viruses use these pathways in a variety of ways to increase cell growth and manipulate the innate immune response.

We found that module 4 was enriched in many terms related to “embryonic morphogenesis” and development. We previously found that tumor viruses co-opt the Notch pathway, which is central to embryonic development, in order to promote cell growth and tumorigenesis (63). The GO term enrichment among the target genes in module 4 is driven by the homeobox (HOX) TFs, whose expression is regulated by EBV LMP1 and HPV E7 through differential methylation (74–76). Module 4 was also enriched in “extracellular matrix organization.” The epithelial-to-mesenchymal (EMT) transition is a key step in epithelial tumorigenesis, and cells undergoing EMT often acquire the ability to degrade extracellular matrix (ECM) proteins and increase their invasive potential (77). In particular, the transforming HPV E6 and E7 proteins are able to upregulate matrix metalloproteinases (MMPs) in order to degrade ECM and increase cell migration, thus leading to cellular transformation (78).

Both ALPACA and the edge subtraction method detected a difference in the regulation of histones, suggesting that epigenetic changes may be a key factor in the transformation of human cells by viral oncogenes. ALPACA also identified a separate module (module 8) that was enriched in proteins involved in “DNA conformation change.” Indeed, the importance of epigenetics in transformation has already been demonstrated for many tumor viruses. HPV16 E7 induces histone 3 lysine 27-specific demethylases (76), EBV LMP1 and LMP2A modulate the activity of DNA methyltransferases and interact with histone modifiers (79), and adenovirus E1A causes sweeping changes in histone acetylation (80).

Sexual dimorphism in normal breast tissue

The Genotype-Tissue Expression (GTEx) consortium has generated gene expression data using tissue collected from 51 body sites and in nearly 600 individuals. Not surprisingly, the tissue with the greatest difference between males and females in autosomal gene expression is the breast (81). We used PANDA to create tissue-specific regulatory networks to study the effect of sex on regulatory networks in breast tissue (81). We first applied the Louvain method to detect communities separately in the networks derived from male and female breast tissue and tested for functional enrichment of GO terms in the male and female communities. We found that both the networks were enriched for the same biological processes: GTPase-mediated signal transduction and protein catabolic process (Supplementary Table 3). Therefore, despite what one might expect to be substantially different, the global structure of the male and female networks failed to identify sex-specific patterns of regulation. We also used the edge subtraction method to search for modular differences between the sexes and tested modules for GO term enrichment, but this too failed to identify any significant GO biological processes.

We then tested whether ALPACA could find sex-specific modular structure in the breast regulatory network (Figure 6). We first compared the male regulatory network against the female regulatory network and found 18 male-specific differential modules (Supplementary Table 3). Module 2 was highly enriched in developmental processes, including “nervous system development,” “response to BMP,” and “blood vessel development.” Similarly, module 8 was enriched for “muscle organ morphogenesis.” These results are not surprising and reflect the fact that male and female breast tissues have significant differences in their developmental trajectory. We note that many of the developmental genes in these modules are associated with breast cancer. Among genes annotated with “nervous system development,” the fibroblast growth factor receptor (FGFR) is often amplified or dysregulated in breast cancer, the HES5 locus is repositioned in invasive breast cancer, and VLDLR is often upregulated in metastatic breast cancer (82–85). The blood vessel development category included genes such as GATA6, a known oncogene that may drive EMT in the breast; TBX3, which appears to repress the tumor suppressor p14ARF and drive metastatic breast cancer; PRRX2, which increases invasiveness in breast tumors; and RASA1, whose expression is associated with poor prognosis in breast cancer (86–89). Among the BMP response and muscle development groups, there are several genes, like TWSG1, VANGL2, and GSC, which are relevant to both normal breast development and breast cancer (90–92). Module 7 was enriched for terms related to rRNA processing and module 14 contained genes relevant to chromatin assembly, suggesting that transcription and translation are reorganized at a global level between males and females.

Figure 6. Sexually dimorphic ALPACA modules in human breast tissue.

Networks show four modules specific to either female (left-hand side) or male (right-hand side) breast tissue. Nodes are colored by membership in differential modules. Edge opacity is proportional to its contribution to the differential modularity. Networks are annotated with representative enriched GO terms with P_adj < 0.05. Genes annotated by the shown GO terms are labeled in large font.

Next, we compared the female breast regulatory network against the male network and found 17 female-specific regulatory modules. Among those, ALPACA identified a module (module 15) that is enriched in “intracellular estrogen receptor signaling pathway,” the hormonal process one would expect to be critical for female breast development and overall function. The highest-scoring gene in this pathway, PPARGC1B, is a co-activator of the estrogen receptor and is a genetic risk factor for estrogen receptor-positive breast cancer (93, 94). Module 10 was enriched for “positive regulation of ERK1 and ERK2 cascade.” ERK1/2 signaling is a major pathway involved in estrogen-induced cell proliferation and breast cancer (95, 96). This module contains the growth arrest-specific gene GAS6, which is induced by estrogen and is associated with chemoresistance and metastasis in breast cancer (97–99), and the chemokine CCL5, which has been proposed as a therapeutic target for estrogen-dependent breast cancer (100, 101). Module 10 was also enriched for Type I interferon response, which may be a result of the increased blood and lymphatic penetrance in normal female breast development.

We found module 17 to be enriched for “negative regulation of cell-substrate adhesion” and contained SPOCK1 and NOTCH1, both known markers of invasion and breast cancer progression (102–105). Finally, module 5 was enriched in transcriptional regulation factors, similar to the enrichment in chromatin remodeling found in the male breast network.

Consistent with expectations based on both the functional differences of male and female breast, and the profound differences in gene expression, ALPACA was able to identify major biological processes associated with differences in breast development between females and males, many of which are also known to be dysregulated in breast cancer.

DISCUSSION

Biological networks have complex modular and hierarchical topologies that allow organisms to carry out the functions necessary for survival. Various perturbations, such as environmental conditions or mutations, can alter regulatory networks, leading to changes in the phenotype of the organism. Techniques such as differential expression analysis can be used to characterize the transition between different cellular states, but changes in gene expression are ultimately driven by changes in regulatory pathways. If we are to build predictive models of complex phenotypes and diseases, it is essential that we understand how the regulatory network also changes with phenotype. ALPACA is an algorithm that compares genome-scale networks using a metric we call the “differential modularity” to find groups of nodes that drive changes in modular structure. ALPACA differs from other network methods in that it compares the structure of networks to each other rather than to a random background network and is thus more likely to detect subtle differences in network modular structure. This potentially can allow detection of gene modules that function together in particular conditions or in disease.

We evaluated the performance of ALPACA on simulated networks and compared it to two available approaches for detecting changes in network modular structure: (i) “community comparison,” where one applies standard community detection to the baseline and perturbed networks separately and contrasts the resulting communities, and (ii) “edge subtraction,” which involves subtracting the two networks edge by edge, and clustering the resulting differential network. ALPACA was able resolve smaller differential modules than the community comparison method. Intuitively, this is because modularity maximization in its standard form penalizes the splitting of a large dense community into smaller ones, whereas the differential modularity score used in ALPACA penalizes the formation of large communities similar to those present in the baseline network. In addition, ALPACA was more robust to noise in individual network edges than the edge subtraction method. In the edge subtraction method, the uncertainty of the edges in the “differential” network is the sum of the uncertainties in the corresponding edges of the original networks. Instead, ALPACA aggregates the signals coming from multiple edges in the baseline network communities to derive a null model for edge density, so it is less sensitive to the uncertainty in individual edges.

ALPACA’s differential modularity metric directly compares the edges that one sees within a community to what you would expect based on the topology of a corresponding reference network. This adapts the well-established modularity maximization method to infer subtle changes in the community structure that arise when comparing distinct complex phenotypes. Unlike other methods that simply subtract networks, ALPACA preserves those secondary interactions that exist in both networks but allows them to shift their functional context as the edges around them change, which can capture new modular structures. The differential modularity also incorporates increased and decreased edge weights across the entire network into a single, simple framework for module detection. And unlike community comparison, ALPACA can detect new modules that form on top of a background of globally active regulatory programs that are present in both the baseline and perturbed networks.

We applied ALPACA to transcriptional networks that were constructed from gene expression and TF binding data using the PANDA network inference algorithm. PANDA does not explicitly use the expression correlation between regulators and the target genes, and can therefore model TFs that are not changing in mRNA expression but whose activity is controlled through other mechanisms, like post-translational modification. PANDA also incorporates changes in promoter activity that could alter regulatory targeting patterns. Comparing angiogenic to non-angiogenic subtypes of ovarian cancer, we found functional modules that were enriched in expected disease pathways like blood vessel development, interleukin production, and JAK-STAT signaling. We also found enrichment for less expected processes including nutritional pathways like flavonoid biosynthesis and triglyceride homeostasis, which have been speculated to be relevant for ovarian cancer, but for which the underlying molecular pathways are not known (36–38, 45). These biological processes were specific to the angiogenic subtype and uniquely revealed by ALPACA; they could not be found through standard community detection in the individual angiogenic and non-angiogenic networks or in an edge-subtracted network, or by running ALPACA in reverse on the non-angiogenic network compared to the angiogenic network.

In another test of the method, we compared normal male and female breast tissue to find sex-specific patterns of regulation. Many of the modules we found were enriched in known processes related to breast development and breast cancer, like ERK and Rho GTPase signaling. Perhaps most strikingly, the female breast network contained a differential module enriched for estrogen receptor signaling, which is one of the main sex-specific pathways known to be active in breast tissue. Once again, these results could not be found using other community detection and network comparison methods.

ALPACA requires a minimum input of two graphs. It is easily generalizable and could be applied to many types of biological networks, including metabolic, protein-protein interaction (PPI), and expression Quantitative Trait Loci (eQTL) networks, all of which exhibit highly functional modular structures (9, 106, 107). For example, we could imagine applying ALPACA to compare community structure in PPI networks with mutation-driven “edgetic” perturbations, in order to discover functional changes in protein complexes and signaling associated with disease (108). ALPACA could also be applied to compare eQTL networks in patient cohorts with differing pathologies to prioritize sets of SNPs and genes that influence complex traits (9).

ALPACA builds on our growing understanding of how networks define phenotype. Differential expression is driven by changes in the activity and structure of gene regulatory networks. But adding or subtracting edges does more than change individual regulatory interactions. With enough individual changes occurring in the right places in the starting network, changes in edges can lead to the creation or destruction of functional communities of genes and their regulators. While the global structure of the network may be largely unchanged, these new functional communities provide insight into coherent processes that differentiate one phenotype from another.

As more genome-wide studies of molecular interactions and multi-omics data are generated, better statistical models for network analysis will be critical to making differential network biology a robust and reproducible platform for studying complex diseases (13). To transform this large volume of data into clinically useful predictions and hypotheses, we need rigorous methods that can integrate heterogeneous data types and extract the functional elements that are key parameters for modeling disease transitions and the genotype-phenotype relationship. ALPACA is the first method to make direct comparisons between networks to identify changes in their modular structure in a rigorous manner and is an important step forward in methodology in the statistical analysis of networks.

MATERIALS AND METHODS

ALPACA algorithm

ALPACA is implemented in R and is freely available for download through Github at https://github.com/meghapadi/ALPACA. It is comprised of the following two steps:

Step 1: The input network consists of edges between regulators and target genes. We first label the nodes that act as regulators and targets separately. In particular, a gene that encodes a transcription factor (TF) becomes two separate nodes depending on whether we are modeling its mRNA expression level (target node) or protein activity (regulator node). For the weight of each edge, we use the final z-score output by the PANDA network inference algorithm. We then take the edges that have positive weight in the baseline condition, and run bipartite weighted network community detection using either CONDOR or the Louvain method.
Step 2: Compute D_ij for the perturbed network, using the definition in the main text and the baseline communities found in Step 1. It is possible that the numerator and denominator of N_ij are both zero, meaning that there were no edges between the communities C_i and C_j. This can happen if, for example, at least one of the nodes i or j were not connected to the baseline network to begin with. In this case, we define N_ij to be zero, since the “expected” number of edges between the two nodes is zero. We next apply a generalized Louvain procedure to assign nodes into communities based on D_ij (18). Briefly, the Louvain method works as follows: (i) Start with every node in its own community, (ii) go through each node iteratively, and merge it with the node that produces the biggest increase in differential modularity, (iii) after reaching a local optimum, treat each of the resulting groups as “metanodes” in a new “metanetwork” and recalculate an effective adjacency matrix, and (iv) repeat steps (ii) and (iii) until convergence. For the purpose of reporting reproducible results, we iterate through the nodes in the same pre-determined order every time, and we break ties by selecting the first member of the set.
In an optional third step, we can evaluate the core genes in each module for enrichment in known biological pathways.
Step 3: The core genes are those that are most important to the integrity of the module and therefore potentially the most robust and essential members. To define the core genes, we score each node according to its contribution to the differential modularity of the module that it belongs to:

We ranked the target genes in each module by their scores S_i. Since the size of typical modules found in ALPACA ranged from about 50 to 200 genes, we chose to use the top 50 core genes from each module to evaluate functional enrichment in an equitable manner across all the modules. We also repeated each analysis using the top 100 core genes in order to test the dependence of the enrichment on the cutoff. GO term enrichment was calculated using the GOstats package in R, with the following parameters: the gene universe is defined to be the set of all possible target genes in the initial networks, and the p-value calculation is conditioned on the GO hierarchy structure. In each module, the p-values were adjusted for multiple testing using the Benjamini-Hochberg method.

Edge subtraction method

For each edge, the edge weight of the baseline network was subtracted from the edge weight in the perturbed network to compute Δw_ij, and only edges with Δw_ij > 0 were retained. We then used the Δw_ij values as new edge weights to perform community detection using CONDOR or Louvain optimization (9, 30).

Community comparison method

We first used either CONDOR or Louvain method to find the community structure of the baseline and perturbed networks, in each case keeping only edges that had positive z-scores. We next aimed to efficiently map the two community structures to each other. To find the best approximation of a linear mapping, we computed R in the equation B = AR, where A is the N x q matrix of node membership for the baseline community structure, and B is the corresponding matrix for the perturbed community structure (here N is the number of genes and q is the number of communities). To invert the matrix A, we used singular value decomposition to compute the pseudoinverse A^P = VD⁻¹U^T, where A = UDV^T, and then computed R = A^PB. The entries of the q × q matrix R represent an approximate linear transformation that maps the communities in the baseline network to the communities of the perturbed network. Finally, we scored each node according to how much its community membership remains the same between baseline and perturbed conditions, using the formula . Nodes were ranked from low to high values of for further analysis. Low-scoring nodes represent the nodes that participate in altered community structure in the perturbed network.

Creating simulated networks and evaluating differential community methods

To simulate “addition” networks, we started with the GFP-control network from the tumor virus dataset (see section on “Data preprocessing and network inference” for details on how this network was constructed) and thresholded the edges at a z-score of 2.7 (for noiseless simulation) or 2.9 (for noisy simulation). The threshold was chosen such that the resulting edges would form an unweighted network with a similar community structure as the full weighted network. We found that applying CONDOR to the GFP-control network at a threshold of 2.7 resulted in five communities containing 1336, 833, 781, 1018, and 44 nodes each. To add a module, we randomly chose a subset of these nodes and added all possible new edges between them. To add noise in the second set of “addition” networks, we also resampled edges as follows: (i) start with an empty network with the same nodes as the GFP-control network, (ii) count the number of edges between TFs in community C_i and target genes in community C_j, for each pair i and j, in the GFP-control network, and (iii) add a matching number of edges randomly between the TFs in community C_i and target genes in community C_j in the new network.

We evaluated the results of each method on the simulated networks by comparing the ranks of true positives (the target genes in the added module) against a background consisting of target genes not in the added module. We used Kolmogorov-Smirnov and Wilcoxon tests to look for significant differences in the distribution of the ranks. Both tests gave similar results, and in the figures we present the Wilcoxon p-values.

To create the “subtracted” simulation with two node groups, we started with a fully connected network containing 100 nodes, with all edge weights set to a default value of 0.1. We then defined two node groups, A and B, each containing 10 TFs and 40 genes. Edges within each of these groups were set to edge weight 1.0.

Next, to create the baseline network we set the weights of all edges between groups A and B to be 0.8. To create the perturbed network we set the weights of all edges between groups A and B to be 0.2. To create the three-group “subtracted” network, we first created a fully connected network containing 125 nodes, with all edge weights set to a default value of 0.1. We then defined three node groups A, B, and C containing 50, 25, and 50 nodes respectively (of which 10, 5, and 10 were TFs). Edges within each group were set to weight 1.0, and all edges between groups B and C were set to weight 0.2. For the baseline network, the edges between groups A and B were set to weight 0.8 and for the perturbed network, the edges between groups A and B were set to weight 0.2.

Data preprocessing and network inference

Preprocessing and network inference for ovarian cancer data was carried out as described in (35). Briefly, we ran the network inference algorithm PANDA (Passing Attributes between Networks for Data Assimilation) to integrate gene expression data with transcription factor binding sites to create regulatory networks for each subtype (33). The prior network of binding sites for 111 TFs were defined as the occurrence of the corresponding motif in the promoter, defined as [−750,+250] base pairs around the transcription start site (TSS).

The viral oncogene gene expression data were normalized and batch-corrected, and a map of high-probability TF binding sites was created by combining cell-type-specific DNase-I hypersensitivity data with motif occurrence in the promoters defined as [-25kb, 25kb] around each TSS, as described in (63). The binding sites and gene expression were combined to infer networks using PANDA with default parameters, as described in (1).

Sex-specific and tissue-specific transcriptional networks for the GTEx data were constructed as described in (81).

AUTHOR CONTRIBUTIONS

MP conceived of the project, performed analysis, and wrote the paper. JQ helped refine the analyses and write the paper.

ACKNOWLEDGEMENTS

This work was supported by NIH grants K25 HG006031 (MP) and R01 HL111759 and R35 CA197449 (JQ).

Footnotes

Classification: Biological Sciences, Systems Biology

REFERENCES

1.↵
Padi M & Quackenbush J (2015) Integrating transcriptional and protein interaction networks to prioritize condition-specific master regulators. BMC Syst Biol 9:80.
OpenUrl
2.↵
Giaever G, et al. (2002) Functional profiling of the Saccharomyces cerevisiae genome. Nature 418(6896):387–391.
OpenUrl CrossRef PubMed Web of Science
3.↵
Locke AE, et al. (2015) Genetic studies of body mass index yield new insights for obesity biology. Nature 518(7538):197–206.
OpenUrl CrossRef PubMed
4.
Wood AR, et al. (2014) Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet 46(11): 1173–1186.
OpenUrl CrossRef PubMed
5.↵
Fuchsberger C, et al. (2016) The genetic architecture of type 2 diabetes. Nature 536(7614):41–47.
OpenUrl CrossRef PubMed
6.↵
Jeong H, Mason SP, Barabasi AL, & Oltvai ZN (2001) Lethality and centrality in protein networks. Nature 411(6833):41–42.
OpenUrl CrossRef PubMed Web of Science
7.↵
Hartwell LH, Hopfield JJ, Leibler S, & Murray AW (1999) From molecular to modular cell biology. Nature 402(6761 Suppl):C47–52.
OpenUrl CrossRef PubMed Web of Science
8.
Menche J, et al. (2015) Disease networks. Uncovering disease-disease relationships through the incomplete interactome. Science 347(6224):1257601.
OpenUrl Abstract/FREE Full Text
9.↵
Platig J, Castaldi PJ, DeMeo D, & Quackenbush J (2016) Bipartite Community Structure of eQTLs. PLoS computational biology 12(9):e1005033.
OpenUrl
10.↵
Anonymous (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489(7414):57–74.
OpenUrl CrossRef PubMed Web of Science
11.
Marbach D, et al. (2012) Wisdom of crowds for robust gene network inference. Nat Methods 9(8):796–804.
OpenUrl CrossRef PubMed Web of Science
12.↵
Marbach D, et al. (2016) Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases. Nat Methods 13(4):366–370.
OpenUrl CrossRef PubMed
13.↵
Ideker T & Krogan NJ (2012) Differential network biology. Mol Syst Biol 8:565.
OpenUrl Abstract/FREE Full Text
14.↵
Gambardella G, et al. (2013) Differential network analysis for the identification of condition-specific pathway activity and regulation. Bioinformatics 29(14):1776–1785.
OpenUrl CrossRef PubMed Web of Science
15.↵
Watson M (2006) CoXpress: differential co-expression in gene expression data. BMC Bioinformatics 7:509.
OpenUrl CrossRef PubMed
16.↵
Van Landeghem S, Van Parys T, Dubois M, Inze D, & Van de Peer Y (2016) Diffany: an ontology-driven framework to infer, visualise and analyse differential molecular networks. BMC Bioinformatics 17:18.
OpenUrl
17.
Gill R, Datta S, & Datta S (2010) A statistical framework for differential network analysis from microarray data. BMC Bioinformatics 11:95.
OpenUrl CrossRef PubMed
18.↵
Danon L, Diaz-Guilera A, Duch J, & Arenas A (2005) Comparing community structure identification. Journal of Statistical Mechanics: Theory and Experiment 9:P09008.
OpenUrl
19.↵
Tesson BM, Breitling R, & Jansen RC (2010) DiffCoEx: a simple and sensitive method to find differentially coexpressed gene modules. BMC Bioinformatics 11:497.
OpenUrl CrossRef PubMed
20.↵
Amar D, Safer H, & Shamir R (2013) Dissection of regulatory networks that are altered in disease via differential co-expression. PLoS computational biology 9(3):e1002955.
OpenUrl CrossRef
21.↵
Valcarcel B, et al. (2014) Genome metabolome integrated network analysis to uncover connections between genetic variants and complex traits: an application to obesity. J R Soc Interface 11(94):20130908.
OpenUrl CrossRef PubMed
22.↵
Mitra K, Carvunis AR, Ramesh SK, & Ideker T (2013) Integrative approaches for finding modular structure in biological networks. Nat Rev Genet 14(10):719–732.
OpenUrl CrossRef PubMed
23.↵
Ideker T, Ozier O, Schwikowski B, & Siegel AF (2002) Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics 18 Suppl 1:S233–240.
OpenUrl CrossRef PubMed
24.↵
Fortunato S & Barthelemy M (2007) Resolution limit in community detection. Proc Natl Acad Sci U S A 104(1):36–41.
OpenUrl Abstract/FREE Full Text
25.↵
Gerstein MB, et al. (2012) Architecture of the human regulatory network derived from ENCODE data. Nature 489(7414):91–100.
OpenUrl CrossRef PubMed Web of Science
26.↵
Mucha PJ, Richardson T, Macon K, Porter MA, & Onnela JP (2010) Community structure in time-dependent, multiscale, and multiplex networks. Science 328(5980):876–878.
OpenUrl Abstract/FREE Full Text
27.↵
Newman ME & Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E Stat Nonlin Soft Matter Phys 69(2 Pt 2):026113.
OpenUrl CrossRef PubMed
28.
Raghavan UN, Albert R, & Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E Stat Nonlin Soft Matter Phys 76(3 Pt 2):036106.
OpenUrl CrossRef PubMed
29.↵
Rosvall M & Bergstrom CT (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci U S A 105(4):1118–1123.
OpenUrl Abstract/FREE Full Text
30.↵
V. B, Guillaume J-L, Lambiotte R, & Lefebvre E (2008) Fast unfolding of communities in large networks. Journal of StatisticalMechanics:P10008.
31.↵
Esmailian P & Jalili M (2015) Community Detection in Signed Networks: the Role of Negative ties in Different Scales. Scientific reports 5:14339.
OpenUrl
32.↵
Traag VA & Bruggeman J (2009) Community detection in networks with positive and negative links. Phys Rev E Stat Nonlin Soft Matter Phys 80(3 Pt 2):036115.
OpenUrl CrossRef PubMed
33.↵
Glass K, Huttenhower C, Quackenbush J, & Yuan GC (2013) Passing messages between biological networks to refine predicted interactions. PLoS One 8(5):e64832.
OpenUrl CrossRef PubMed
34.↵
Bentink S, et al. (2012) Angiogenic mRNA and microRNA gene expression signature predicts a novel subtype of serous ovarian cancer. PLoS ONE 7(2):e30269.
OpenUrl CrossRef PubMed
35.↵
Glass K, Quackenbush J, Spentzos D, Haibe-Kains B, & Yuan GC (2015) A network model for angiogenesis in ovarian cancer. BMC Bioinformatics 16:115.
OpenUrl CrossRef PubMed
36.↵
Cassidy A, Huang T, Rice MS, Rimm EB, & Tworoger SS (2014) Intake of dietary flavonoids and risk of epithelial ovarian cancer. Am J Clin Nutr 100(5):1344–1351.
OpenUrl Abstract/FREE Full Text
37.
Gates MA, et al. (2009) Flavonoid intake and ovarian cancer risk in a population-based case-control study. Int J Cancer 124(8):1918–1925.
OpenUrl CrossRef PubMed Web of Science
38.↵
Hua X, et al. (2016) Association among Dietary Flavonoids, Flavonoid Subclasses and Ovarian Cancer Risk: A Meta-Analysis. PLoS One 11(3):e0151134.
OpenUrl CrossRef PubMed
39.↵
Shen H, et al. (2013) Epigenetic analysis leads to identification of HNF1B as a subtype-specific susceptibility gene for ovarian cancer. Nature communications 4:1628.
OpenUrl
40.↵
Bourguignon LY, Gilad E, Rothman K, & Peyrollier K (2005) Hyaluronan-CD44 interaction with IQGAP1 promotes Cdc42 and ERK signaling, leading to actin binding, Elk-1/estrogen receptor transcriptional activation, and ovarian cancer progression. J Biol Chem 280(12): 11961–11972.
OpenUrl Abstract/FREE Full Text
41.
Dong P, et al. (2006) Overexpression and diffuse expression pattern of IQGAP1 at invasion fronts are independent prognostic parameters in ovarian carcinomas. Cancer letters 243(1): 120–127.
OpenUrl CrossRef PubMed Web of Science
42.↵
Dong PX, et al. (2008) Silencing of IQGAP1 by shRNA inhibits the invasion of ovarian carcinoma HO-8910PM cells in vitro. J Exp Clin Cancer Res 27:77.
OpenUrl CrossRef PubMed
43.↵
Brennan DJ, et al. (2009) The transcription factor Sox11 is a prognostic factor for improved recurrence-free survival in epithelial ovarian cancer. Eur J Cancer 45(8):1510–1517.
OpenUrl CrossRef PubMed Web of Science
44.↵
Sernbo S, et al. (2011) The tumour suppressor SOX11 is associated with improved survival among high grade epithelial ovarian cancers and is regulated by reversible promoter methylation. BMC Cancer 11:405.
OpenUrl CrossRef PubMed
45.↵
Tania M, Khan MA, & Song Y (2010) Association of lipid metabolism with ovarian cancer. Current oncology (Toronto, Ont.) 17(5):6–11.
OpenUrl
46.↵
Hanahan D & Weinberg RA (2011) Hallmarks of cancer: the next generation. Cell 144(5):646–674.
OpenUrl CrossRef PubMed Web of Science
47.↵
Cohen CA, Shea AA, Heffron CL, Schmelz EM, & Roberts PC (2016) Interleukin-12 Immunomodulation Delays the Onset of Lethal Peritoneal Disease of Ovarian Cancer. J Interferon Cytokine Res 36(1):62–73.
OpenUrl
48.
Coward J, et al. (2011) Interleukin-6 as a therapeutic target in human ovarian cancer. Clin Cancer Res 17(18):6083–6096.
OpenUrl Abstract/FREE Full Text
49.↵
Isobe A, et al. (2015) Interleukin 6 receptor is an independent prognostic factor and a potential therapeutic target of ovarian cancer. PLoS One 10(2):e0118080.
OpenUrl CrossRef PubMed
50.↵
Wall L, Burke F, Barton C, Smyth J, & Balkwill F (2003) IFN-gamma induces apoptosis in ovarian cancer cells in vivo and in vitro. Clin Cancer Res 9(7):2487–2496.
OpenUrl Abstract/FREE Full Text
51.↵
Welander CE (1987) Use of interferon in the treatment of ovarian cancer as a single agent and in combination with cytotoxic drugs. Cancer 59(3 Suppl):617–619.
OpenUrl CrossRef PubMed
52.↵
Alvero AB (2010) Recent insights into the role of NF-kappaB in ovarian carcinogenesis. Genome Med 2(8):56.
OpenUrl CrossRef PubMed
53.↵
Huang Y, et al. (2011) FSH inhibits ovarian cancer cell apoptosis by up-regulating survivin and down-regulating PDCD6 and DR5. Endocr Relat Cancer 18(1): 13–26.
OpenUrl Abstract/FREE Full Text
54.
Park SH, et al. (2012) PDCD6 additively cooperates with anti-cancer drugs through activation of NF-kappaB pathways. Cellular signalling 24(3):726–733.
OpenUrl PubMed
55.↵
Su D, et al. (2012) PDCD6 is an independent predictor of progression free survival in epithelial ovarian cancer. Journal of translational medicine 10:31.
OpenUrl
56.↵
Lin Y, Xu T, Tian G, & Cui M (2014) Cysteine-rich, angiogenic inducer, 61 expression in patients with ovarian epithelial carcinoma. The Journal of international medical research 42(2):300–306.
OpenUrl CrossRef PubMed
57.↵
Shen H, et al. (2014) CYR61 overexpression associated with the development and poor prognosis of ovarian carcinoma. Med Oncol 31(8): 117.
OpenUrl
58.↵
Bartel F, et al. (2012) Inverse expression of cystein-rich 61 (Cyr61/CCN1) and connective tissue growth factor (CTGF/CCN2) in borderline tumors and carcinomas of the ovary. Int J Gynecol Pathol 31(5):405–415.
OpenUrl PubMed
59.↵
Abubaker K, et al. (2014) Inhibition of the JAK2/STAT3 pathway in ovarian cancer results in the loss of cancer stem cell-like characteristics and a reduced tumor burden. BMC Cancer 14:317.
OpenUrl CrossRef PubMed
60.↵
Guo J, Schally AV, Zarandi M, Varga J, & Leung PC (2010) Antiproliferative effect of growth hormone-releasing hormone (GHRH) antagonist on ovarian cancer cells through the EGFR-Akt pathway. Reprod Biol Endocrinol 8:54.
OpenUrl CrossRef PubMed
61.
Klukovits A, et al. (2012) Novel antagonists of growth hormone-releasing hormone inhibit growth and vascularization of human experimental ovarian cancers. Cancer 118(3):670–680.
OpenUrl CrossRef PubMed Web of Science
62.↵
Papadia A, et al. (2011) Growth hormone-releasing hormone antagonists inhibit growth of human ovarian cancer. Hormone and metabolic research = Hormon-und Stoffwechselforschung = Hormones et metabolisme 43(11):816–820.
OpenUrl
63.↵
Rozenblatt-Rosen O, et al. (2012) Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins. Nature 487(7408):491–495.
OpenUrl CrossRef PubMed Web of Science
64.↵
Griffiths DA, et al. (2013) Merkel cell polyomavirus small T antigen targets the NEMO adaptor protein to disrupt inflammatory signaling. Journal of virology 87(24):13853–13867.
OpenUrl Abstract/FREE Full Text
65.↵
Hiscott J, Kwon H, & Genin P (2001) Hostile takeovers: viral appropriation of the NF-kappaB pathway. J Clin Invest 107(2):143–151.
OpenUrl CrossRef PubMed Web of Science
66.↵
Auburn H, Zuckerman M, & Smith M (2016) Analysis of Epstein-Barr virus and cellular gene expression during the early phases of EBV lytic induction. Journal of medical microbiology.
67.
Hasan UA, et al. (2007) TLR9 expression and function is abolished by the cervical cancer-associated human papillomavirus type 16. J Immunol 178(5):3186–3197.
OpenUrl Abstract/FREE Full Text
68.
Shahzad N, et al. (2013) The T antigen locus of Merkel cell polyomavirus downregulates human Toll-like receptor 9 expression. Journal of virology 87(23):13009–13019.
OpenUrl Abstract/FREE Full Text
69.
Zauner L & Nadal D (2012) Understanding TLR9 action in Epstein-Barr virus infection. Front Biosci (Landmark Ed) 17:1219–1231.
OpenUrl
70.
Assetta B, De Cecco M, O’Hara B, & Atwood WJ (2016) JC Polyomavirus Infection of Primary Human Renal Epithelial Cells Is Controlled by a Type I IFN-Induced Response. MBio 7(4).
71.↵
Michaud F, et al. (2010) Epstein-Barr virus interferes with the amplification of IFNalpha secretion by activating suppressor of cytokine signaling 3 in primary human monocytes. PLoS One 5(7):e11908.
OpenUrl PubMed
72.↵
Turunen A & Syrjanen S (2014) Extracellular calcium regulates keratinocyte proliferation and HPV 16 E6 RNA expression in vitro. APMIS 122(9):781–789.
OpenUrl PubMed
73.↵
Chami M, Oules B, & Paterlini-Brechot P (2006) Cytobiological consequences of calcium-signaling alterations induced by human viral proteins. Biochim Biophys Acta 1763(11): 1344–1362.
OpenUrl CrossRef PubMed
74.↵
Hernando H, et al. (2014) Epstein-Barr virus-mediated transformation of B cells induces global chromatin changes independent to the acquisition of proliferation. Nucleic Acids Res 42(1):249–263.
OpenUrl CrossRef PubMed
75.
Jiang Y, et al. (2015) Repression of Hox genes by LMP1 in nasopharyngeal carcinoma and modulation of glycolytic pathway genes by HoxC8. Oncogene 34(50):6079–6091.
OpenUrl CrossRef PubMed
76.↵
McLaughlin-Drubin ME, Crum CP, & Munger K (2011) Human papillomavirus E7 oncoprotein induces KDM6A and KDM6B histone demethylase expression and causes epigenetic reprogramming. Proc Natl AcadSci USA 108(5):2130–2135.
OpenUrl Abstract/FREE Full Text
77.↵
Lamouille S, Xu J, & Derynck R (2014) Molecular mechanisms of epithelial-mesenchymal transition. Nature reviews 15(3):178–196.
OpenUrl
78.↵
Zhu D, Ye M, & Zhang W (2015) E6/E7 oncoproteins of high risk HPV-16 upregulate MT1-MMP, MMP-2 and MMP-9 and promote the migration of cervical cancer cells. International journal of clinical and experimental pathology 8(5):4981–4989.
OpenUrl
79.↵
Niller HH, Szenthe K, & Minarovits J (2014) Epstein-Barr virus-host cell interactions: an epigenetic dialog? Frontiers in genetics 5:367.
OpenUrl
80.↵
Ferrari R, et al. (2008) Epigenetic reprogramming by adenovirus e1a. Science 321(5892): 1086–1088.
OpenUrl Abstract/FREE Full Text
81.↵
Chen C-Y, et al. (2016) Sexual dimorphism in gene expression and regulatory networks across human tissues. bioRxiv.
82.↵
He L, et al. (2010) Up-regulated expression of type II very low density lipoprotein receptor correlates with cancer metastasis and has a potential link to beta-catenin in different cancers. BMC Cancer 10:601.
OpenUrl CrossRef PubMed
83.
Meaburn KJ, Gudla PR, Khan S, Lockett SJ, & Misteli T (2009) Disease-specific gene repositioning in breast cancer. J Cell Biol 187(6):801–812.
OpenUrl Abstract/FREE Full Text
84.
Turner N & Grose R (2010) Fibroblast growth factor signalling: from development to cancer. Nat Rev Cancer 10(2):116–129.
OpenUrl CrossRef PubMed Web of Science
85.↵
Webb DJ, Nguyen DH, Sankovic M, & Gonias SL (1999) The very low density lipoprotein receptor regulates urokinase receptor catabolism and breast cancer cell motility in vitro. J Biol Chem 274(11):7412–7420.
OpenUrl Abstract/FREE Full Text
86.↵
Krstic M, et al. (2016) The transcriptional regulator TBX3 promotes progression from non-invasive to invasive breast cancer. BMC Cancer 16(1):671.
OpenUrl
87.
Okada T, et al. (2015) The Rho GTPase Rnd1 suppresses mammary tumorigenesis and EMT by restraining Ras-MAPK signalling. Nat Cell Biol 17(1):81–94.
OpenUrl PubMed
88.
Song Y, et al. (2015) GATA6 is overexpressed in breast cancer and promotes breast cancer cell epithelial-mesenchymal transition by upregulating slug expression. Exp Mol Pathol 99(3):617–627.
OpenUrl
89.↵
Yarosh W, et al. (2008) TBX3 is overexpressed in breast cancer and represses p14 ARF by interacting with histone deacetylases. Cancer Res 68(3):693–699.
OpenUrl Abstract/FREE Full Text
90.↵
Forsman CL, et al. (2013) BMP-binding protein twisted gastrulation is required in mammary gland epithelium for normal ductal elongation and myoepithelial compartmentalization. Developmental biology 373(1):95–106.
OpenUrl CrossRef PubMed
91.
Hartwell KA, et al. (2006) The Spemann organizer gene, Goosecoid, promotes tumor metastasis. Proc Natl Acad Sci U S A 103(50):18969–18974.
OpenUrl Abstract/FREE Full Text
92.↵
Puvirajesinghe TM, et al. (2016) Identification of p62/SQSTM1 as a component of non-canonical Wnt VANGL2-JNK signalling in breast cancer. Nature communications 7:10318.
OpenUrl
93.↵
Li Y, et al. (2011) Genetic variation of ESR1 and its co-activator PPARGC1B is synergistic in augmenting the risk of estrogen receptor-positive breast cancer. Breast Cancer Res 13(1):R10.
OpenUrl CrossRef PubMed
94.↵
Wirtenberger M, et al. (2006) Associations of genetic variants in the estrogen receptor coactivators PPARGC1A, PPARGC1B and EP300 with familial breast cancer. Carcinogenesis 27(11):2201–2208.
OpenUrl CrossRef PubMed Web of Science
95.↵
Jerjees DA, et al. (2014) ERK1/2 is related to oestrogen receptor and predicts outcome in hormone-treated breast cancer. Breast cancer research and treatment 147(1):25–37.
OpenUrl
96.↵
Keshamouni VG, Mattingly RR, & Reddy KB (2002) Mechanism of 17-beta-estradiol-induced Erk1/2 activation in breast cancer cells. A role for HER2 AND PKC-delta. J Biol Chem 277(25):22558–22565.
OpenUrl Abstract/FREE Full Text
97.↵
McCormack O, et al. (2008) Growth arrest-specific gene 6 expression in human breast cancer. Br J Cancer 98(6):1141–1146.
OpenUrl CrossRef PubMed Web of Science
98.
Mo R, Tony Zhu Y, Zhang Z, Rao SM, & Zhu YJ (2007) GAS6 is an estrogen-inducible gene in mammary epithelial cells. Biochem Biophys Res Commun 353(1):189–194.
OpenUrl CrossRef PubMed Web of Science
99.↵
Wang C, et al. (2016) Gas6/Axl Axis Contributes to Chemoresistance and Metastasis in Breast Cancer through Akt/GSK-3beta/beta-catenin Signaling. Theranostics 6(8):1205–1219.
OpenUrl
100.↵
Soria G & Ben-Baruch A (2008) The inflammatory chemokines CCL2 and CCL5 in breast cancer. Cancer letters 267(2):271–285.
OpenUrl CrossRef PubMed Web of Science
101.↵
Svensson S, et al. (2015) CCL2 and CCL5 Are Novel Therapeutic Targets for Estrogen-Dependent Breast Cancer. Clin Cancer Res 21(16):3794–3805.
OpenUrl Abstract/FREE Full Text
102.↵
Bolos V, et al. (2013) Notch activation stimulates migration of breast cancer cells and promotes tumor growth. Breast Cancer Res 15(4):R54.
OpenUrl CrossRef PubMed
103.
Fan LC, Jeng YM, Lu YT, & Lien HC (2016) SPOCK1 Is a Novel Transforming Growth Factor-beta-Induced Myoepithelial Marker That Enhances Invasion and Correlates with Poor Prognosis in Breast Cancer. PLoS One 11(9):e0162933.
OpenUrl
104.
Simmons MJ, Serra R, Hermance N, & Kelliher MA (2012) NOTCH1 inhibition in vivo results in mammary tumor regression and reduced mammary tumorsphere-forming activity in vitro. Breast Cancer Res 14(5):R126.
OpenUrl CrossRef PubMed
105.↵
Wang J, Fu L, Gu F, & Ma Y (2011) Notch1 is involved in migration and invasion of human breast cancer cells. Oncology reports 26(5):1295–1303.
OpenUrl PubMed
106.↵
Spirin V & Mirny LA (2003) Protein complexes and functional modules in molecular networks. Proc Natl Acad Sci U S A 100(21):12123–12128.
OpenUrl Abstract/FREE Full Text
107.↵
Ravasz E, Somera AL, Mongru DA, Oltvai ZN, & Barabasi AL (2002) Hierarchical organization of modularity in metabolic networks. Science 297(5586):1551–1555.
OpenUrl Abstract/FREE Full Text
108.↵
Sahni N, et al. (2015) Widespread macromolecular interaction perturbations in human genetic disorders. Cell 161(3):647–660.
OpenUrl CrossRef PubMed

View the discussion thread.

Posted May 25, 2017.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Systems Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11753)
Bioengineering (8752)
Bioinformatics (29201)
Biophysics (14974)
Cancer Biology (12100)
Cell Biology (17413)
Clinical Trials (138)
Developmental Biology (9422)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18309)
Genetics (12245)
Genomics (16804)
Immunology (11869)
Microbiology (28098)
Molecular Biology (11596)
Neuroscience (60975)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] 1.↵
Padi M & Quackenbush J (2015) Integrating transcriptional and protein interaction networks to prioritize condition-specific master regulators. BMC Syst Biol 9:80.
OpenUrl

[2] 2.↵
Giaever G, et al. (2002) Functional profiling of the Saccharomyces cerevisiae genome. Nature 418(6896):387–391.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Locke AE, et al. (2015) Genetic studies of body mass index yield new insights for obesity biology. Nature 518(7538):197–206.
OpenUrl CrossRef PubMed

[4] 4.
Wood AR, et al. (2014) Defining the role of common variation in the genomic and biological architecture of adult human height. Nat Genet 46(11): 1173–1186.
OpenUrl CrossRef PubMed

[5] 5.↵
Fuchsberger C, et al. (2016) The genetic architecture of type 2 diabetes. Nature 536(7614):41–47.
OpenUrl CrossRef PubMed

[6] 6.↵
Jeong H, Mason SP, Barabasi AL, & Oltvai ZN (2001) Lethality and centrality in protein networks. Nature 411(6833):41–42.
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Hartwell LH, Hopfield JJ, Leibler S, & Murray AW (1999) From molecular to modular cell biology. Nature 402(6761 Suppl):C47–52.
OpenUrl CrossRef PubMed Web of Science

[8] 8.
Menche J, et al. (2015) Disease networks. Uncovering disease-disease relationships through the incomplete interactome. Science 347(6224):1257601.
OpenUrl Abstract/FREE Full Text

[9] 9.↵
Platig J, Castaldi PJ, DeMeo D, & Quackenbush J (2016) Bipartite Community Structure of eQTLs. PLoS computational biology 12(9):e1005033.
OpenUrl

[10] 10.↵
Anonymous (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489(7414):57–74.
OpenUrl CrossRef PubMed Web of Science

[11] 11.
Marbach D, et al. (2012) Wisdom of crowds for robust gene network inference. Nat Methods 9(8):796–804.
OpenUrl CrossRef PubMed Web of Science

[12] 12.↵
Marbach D, et al. (2016) Tissue-specific regulatory circuits reveal variable modular perturbations across complex diseases. Nat Methods 13(4):366–370.
OpenUrl CrossRef PubMed

[13] 13.↵
Ideker T & Krogan NJ (2012) Differential network biology. Mol Syst Biol 8:565.
OpenUrl Abstract/FREE Full Text

[14] 14.↵
Gambardella G, et al. (2013) Differential network analysis for the identification of condition-specific pathway activity and regulation. Bioinformatics 29(14):1776–1785.
OpenUrl CrossRef PubMed Web of Science

[15] 15.↵
Watson M (2006) CoXpress: differential co-expression in gene expression data. BMC Bioinformatics 7:509.
OpenUrl CrossRef PubMed

[16] 16.↵
Van Landeghem S, Van Parys T, Dubois M, Inze D, & Van de Peer Y (2016) Diffany: an ontology-driven framework to infer, visualise and analyse differential molecular networks. BMC Bioinformatics 17:18.
OpenUrl

[17] 17.
Gill R, Datta S, & Datta S (2010) A statistical framework for differential network analysis from microarray data. BMC Bioinformatics 11:95.
OpenUrl CrossRef PubMed

[18] 18.↵
Danon L, Diaz-Guilera A, Duch J, & Arenas A (2005) Comparing community structure identification. Journal of Statistical Mechanics: Theory and Experiment 9:P09008.
OpenUrl

[19] 19.↵
Tesson BM, Breitling R, & Jansen RC (2010) DiffCoEx: a simple and sensitive method to find differentially coexpressed gene modules. BMC Bioinformatics 11:497.
OpenUrl CrossRef PubMed

[20] 20.↵
Amar D, Safer H, & Shamir R (2013) Dissection of regulatory networks that are altered in disease via differential co-expression. PLoS computational biology 9(3):e1002955.
OpenUrl CrossRef

[21] 21.↵
Valcarcel B, et al. (2014) Genome metabolome integrated network analysis to uncover connections between genetic variants and complex traits: an application to obesity. J R Soc Interface 11(94):20130908.
OpenUrl CrossRef PubMed

[22] 22.↵
Mitra K, Carvunis AR, Ramesh SK, & Ideker T (2013) Integrative approaches for finding modular structure in biological networks. Nat Rev Genet 14(10):719–732.
OpenUrl CrossRef PubMed

[23] 23.↵
Ideker T, Ozier O, Schwikowski B, & Siegel AF (2002) Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics 18 Suppl 1:S233–240.
OpenUrl CrossRef PubMed

[24] 24.↵
Fortunato S & Barthelemy M (2007) Resolution limit in community detection. Proc Natl Acad Sci U S A 104(1):36–41.
OpenUrl Abstract/FREE Full Text

[25] 25.↵
Gerstein MB, et al. (2012) Architecture of the human regulatory network derived from ENCODE data. Nature 489(7414):91–100.
OpenUrl CrossRef PubMed Web of Science

[26] 26.↵
Mucha PJ, Richardson T, Macon K, Porter MA, & Onnela JP (2010) Community structure in time-dependent, multiscale, and multiplex networks. Science 328(5980):876–878.
OpenUrl Abstract/FREE Full Text

[27] 27.↵
Newman ME & Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E Stat Nonlin Soft Matter Phys 69(2 Pt 2):026113.
OpenUrl CrossRef PubMed

[28] 28.
Raghavan UN, Albert R, & Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E Stat Nonlin Soft Matter Phys 76(3 Pt 2):036106.
OpenUrl CrossRef PubMed

[29] 29.↵
Rosvall M & Bergstrom CT (2008) Maps of random walks on complex networks reveal community structure. Proc Natl Acad Sci U S A 105(4):1118–1123.
OpenUrl Abstract/FREE Full Text

[30] 30.↵
V. B, Guillaume J-L, Lambiotte R, & Lefebvre E (2008) Fast unfolding of communities in large networks. Journal of StatisticalMechanics:P10008.

[31] 31.↵
Esmailian P & Jalili M (2015) Community Detection in Signed Networks: the Role of Negative ties in Different Scales. Scientific reports 5:14339.
OpenUrl

[32] 32.↵
Traag VA & Bruggeman J (2009) Community detection in networks with positive and negative links. Phys Rev E Stat Nonlin Soft Matter Phys 80(3 Pt 2):036115.
OpenUrl CrossRef PubMed

[33] 33.↵
Glass K, Huttenhower C, Quackenbush J, & Yuan GC (2013) Passing messages between biological networks to refine predicted interactions. PLoS One 8(5):e64832.
OpenUrl CrossRef PubMed

[34] 34.↵
Bentink S, et al. (2012) Angiogenic mRNA and microRNA gene expression signature predicts a novel subtype of serous ovarian cancer. PLoS ONE 7(2):e30269.
OpenUrl CrossRef PubMed

[35] 35.↵
Glass K, Quackenbush J, Spentzos D, Haibe-Kains B, & Yuan GC (2015) A network model for angiogenesis in ovarian cancer. BMC Bioinformatics 16:115.
OpenUrl CrossRef PubMed

[36] 36.↵
Cassidy A, Huang T, Rice MS, Rimm EB, & Tworoger SS (2014) Intake of dietary flavonoids and risk of epithelial ovarian cancer. Am J Clin Nutr 100(5):1344–1351.
OpenUrl Abstract/FREE Full Text

[37] 37.
Gates MA, et al. (2009) Flavonoid intake and ovarian cancer risk in a population-based case-control study. Int J Cancer 124(8):1918–1925.
OpenUrl CrossRef PubMed Web of Science

[38] 38.↵
Hua X, et al. (2016) Association among Dietary Flavonoids, Flavonoid Subclasses and Ovarian Cancer Risk: A Meta-Analysis. PLoS One 11(3):e0151134.
OpenUrl CrossRef PubMed

[39] 39.↵
Shen H, et al. (2013) Epigenetic analysis leads to identification of HNF1B as a subtype-specific susceptibility gene for ovarian cancer. Nature communications 4:1628.
OpenUrl

[40] 40.↵
Bourguignon LY, Gilad E, Rothman K, & Peyrollier K (2005) Hyaluronan-CD44 interaction with IQGAP1 promotes Cdc42 and ERK signaling, leading to actin binding, Elk-1/estrogen receptor transcriptional activation, and ovarian cancer progression. J Biol Chem 280(12): 11961–11972.
OpenUrl Abstract/FREE Full Text

[41] 41.
Dong P, et al. (2006) Overexpression and diffuse expression pattern of IQGAP1 at invasion fronts are independent prognostic parameters in ovarian carcinomas. Cancer letters 243(1): 120–127.
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Dong PX, et al. (2008) Silencing of IQGAP1 by shRNA inhibits the invasion of ovarian carcinoma HO-8910PM cells in vitro. J Exp Clin Cancer Res 27:77.
OpenUrl CrossRef PubMed

[43] 43.↵
Brennan DJ, et al. (2009) The transcription factor Sox11 is a prognostic factor for improved recurrence-free survival in epithelial ovarian cancer. Eur J Cancer 45(8):1510–1517.
OpenUrl CrossRef PubMed Web of Science

[44] 44.↵
Sernbo S, et al. (2011) The tumour suppressor SOX11 is associated with improved survival among high grade epithelial ovarian cancers and is regulated by reversible promoter methylation. BMC Cancer 11:405.
OpenUrl CrossRef PubMed

[45] 45.↵
Tania M, Khan MA, & Song Y (2010) Association of lipid metabolism with ovarian cancer. Current oncology (Toronto, Ont.) 17(5):6–11.
OpenUrl

[46] 46.↵
Hanahan D & Weinberg RA (2011) Hallmarks of cancer: the next generation. Cell 144(5):646–674.
OpenUrl CrossRef PubMed Web of Science

[47] 47.↵
Cohen CA, Shea AA, Heffron CL, Schmelz EM, & Roberts PC (2016) Interleukin-12 Immunomodulation Delays the Onset of Lethal Peritoneal Disease of Ovarian Cancer. J Interferon Cytokine Res 36(1):62–73.
OpenUrl

[48] 48.
Coward J, et al. (2011) Interleukin-6 as a therapeutic target in human ovarian cancer. Clin Cancer Res 17(18):6083–6096.
OpenUrl Abstract/FREE Full Text

[49] 49.↵
Isobe A, et al. (2015) Interleukin 6 receptor is an independent prognostic factor and a potential therapeutic target of ovarian cancer. PLoS One 10(2):e0118080.
OpenUrl CrossRef PubMed

[50] 50.↵
Wall L, Burke F, Barton C, Smyth J, & Balkwill F (2003) IFN-gamma induces apoptosis in ovarian cancer cells in vivo and in vitro. Clin Cancer Res 9(7):2487–2496.
OpenUrl Abstract/FREE Full Text

[51] 51.↵
Welander CE (1987) Use of interferon in the treatment of ovarian cancer as a single agent and in combination with cytotoxic drugs. Cancer 59(3 Suppl):617–619.
OpenUrl CrossRef PubMed

[52] 52.↵
Alvero AB (2010) Recent insights into the role of NF-kappaB in ovarian carcinogenesis. Genome Med 2(8):56.
OpenUrl CrossRef PubMed

[53] 53.↵
Huang Y, et al. (2011) FSH inhibits ovarian cancer cell apoptosis by up-regulating survivin and down-regulating PDCD6 and DR5. Endocr Relat Cancer 18(1): 13–26.
OpenUrl Abstract/FREE Full Text

[54] 54.
Park SH, et al. (2012) PDCD6 additively cooperates with anti-cancer drugs through activation of NF-kappaB pathways. Cellular signalling 24(3):726–733.
OpenUrl PubMed

[55] 55.↵
Su D, et al. (2012) PDCD6 is an independent predictor of progression free survival in epithelial ovarian cancer. Journal of translational medicine 10:31.
OpenUrl

[56] 56.↵
Lin Y, Xu T, Tian G, & Cui M (2014) Cysteine-rich, angiogenic inducer, 61 expression in patients with ovarian epithelial carcinoma. The Journal of international medical research 42(2):300–306.
OpenUrl CrossRef PubMed

[57] 57.↵
Shen H, et al. (2014) CYR61 overexpression associated with the development and poor prognosis of ovarian carcinoma. Med Oncol 31(8): 117.
OpenUrl

[58] 58.↵
Bartel F, et al. (2012) Inverse expression of cystein-rich 61 (Cyr61/CCN1) and connective tissue growth factor (CTGF/CCN2) in borderline tumors and carcinomas of the ovary. Int J Gynecol Pathol 31(5):405–415.
OpenUrl PubMed

[59] 59.↵
Abubaker K, et al. (2014) Inhibition of the JAK2/STAT3 pathway in ovarian cancer results in the loss of cancer stem cell-like characteristics and a reduced tumor burden. BMC Cancer 14:317.
OpenUrl CrossRef PubMed

[60] 60.↵
Guo J, Schally AV, Zarandi M, Varga J, & Leung PC (2010) Antiproliferative effect of growth hormone-releasing hormone (GHRH) antagonist on ovarian cancer cells through the EGFR-Akt pathway. Reprod Biol Endocrinol 8:54.
OpenUrl CrossRef PubMed

[61] 61.
Klukovits A, et al. (2012) Novel antagonists of growth hormone-releasing hormone inhibit growth and vascularization of human experimental ovarian cancers. Cancer 118(3):670–680.
OpenUrl CrossRef PubMed Web of Science

[62] 62.↵
Papadia A, et al. (2011) Growth hormone-releasing hormone antagonists inhibit growth of human ovarian cancer. Hormone and metabolic research = Hormon-und Stoffwechselforschung = Hormones et metabolisme 43(11):816–820.
OpenUrl

[63] 63.↵
Rozenblatt-Rosen O, et al. (2012) Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins. Nature 487(7408):491–495.
OpenUrl CrossRef PubMed Web of Science

[64] 64.↵
Griffiths DA, et al. (2013) Merkel cell polyomavirus small T antigen targets the NEMO adaptor protein to disrupt inflammatory signaling. Journal of virology 87(24):13853–13867.
OpenUrl Abstract/FREE Full Text

[65] 65.↵
Hiscott J, Kwon H, & Genin P (2001) Hostile takeovers: viral appropriation of the NF-kappaB pathway. J Clin Invest 107(2):143–151.
OpenUrl CrossRef PubMed Web of Science

[66] 66.↵
Auburn H, Zuckerman M, & Smith M (2016) Analysis of Epstein-Barr virus and cellular gene expression during the early phases of EBV lytic induction. Journal of medical microbiology.

[67] 67.
Hasan UA, et al. (2007) TLR9 expression and function is abolished by the cervical cancer-associated human papillomavirus type 16. J Immunol 178(5):3186–3197.
OpenUrl Abstract/FREE Full Text

[68] 68.
Shahzad N, et al. (2013) The T antigen locus of Merkel cell polyomavirus downregulates human Toll-like receptor 9 expression. Journal of virology 87(23):13009–13019.
OpenUrl Abstract/FREE Full Text

[69] 69.
Zauner L & Nadal D (2012) Understanding TLR9 action in Epstein-Barr virus infection. Front Biosci (Landmark Ed) 17:1219–1231.
OpenUrl

[70] 70.
Assetta B, De Cecco M, O’Hara B, & Atwood WJ (2016) JC Polyomavirus Infection of Primary Human Renal Epithelial Cells Is Controlled by a Type I IFN-Induced Response. MBio 7(4).

[71] 71.↵
Michaud F, et al. (2010) Epstein-Barr virus interferes with the amplification of IFNalpha secretion by activating suppressor of cytokine signaling 3 in primary human monocytes. PLoS One 5(7):e11908.
OpenUrl PubMed

[72] 72.↵
Turunen A & Syrjanen S (2014) Extracellular calcium regulates keratinocyte proliferation and HPV 16 E6 RNA expression in vitro. APMIS 122(9):781–789.
OpenUrl PubMed

[73] 73.↵
Chami M, Oules B, & Paterlini-Brechot P (2006) Cytobiological consequences of calcium-signaling alterations induced by human viral proteins. Biochim Biophys Acta 1763(11): 1344–1362.
OpenUrl CrossRef PubMed

[74] 74.↵
Hernando H, et al. (2014) Epstein-Barr virus-mediated transformation of B cells induces global chromatin changes independent to the acquisition of proliferation. Nucleic Acids Res 42(1):249–263.
OpenUrl CrossRef PubMed

[75] 75.
Jiang Y, et al. (2015) Repression of Hox genes by LMP1 in nasopharyngeal carcinoma and modulation of glycolytic pathway genes by HoxC8. Oncogene 34(50):6079–6091.
OpenUrl CrossRef PubMed

[76] 76.↵
McLaughlin-Drubin ME, Crum CP, & Munger K (2011) Human papillomavirus E7 oncoprotein induces KDM6A and KDM6B histone demethylase expression and causes epigenetic reprogramming. Proc Natl AcadSci USA 108(5):2130–2135.
OpenUrl Abstract/FREE Full Text

[77] 77.↵
Lamouille S, Xu J, & Derynck R (2014) Molecular mechanisms of epithelial-mesenchymal transition. Nature reviews 15(3):178–196.
OpenUrl

[78] 78.↵
Zhu D, Ye M, & Zhang W (2015) E6/E7 oncoproteins of high risk HPV-16 upregulate MT1-MMP, MMP-2 and MMP-9 and promote the migration of cervical cancer cells. International journal of clinical and experimental pathology 8(5):4981–4989.
OpenUrl

[79] 79.↵
Niller HH, Szenthe K, & Minarovits J (2014) Epstein-Barr virus-host cell interactions: an epigenetic dialog? Frontiers in genetics 5:367.
OpenUrl

[80] 80.↵
Ferrari R, et al. (2008) Epigenetic reprogramming by adenovirus e1a. Science 321(5892): 1086–1088.
OpenUrl Abstract/FREE Full Text

[81] 81.↵
Chen C-Y, et al. (2016) Sexual dimorphism in gene expression and regulatory networks across human tissues. bioRxiv.

[82] 82.↵
He L, et al. (2010) Up-regulated expression of type II very low density lipoprotein receptor correlates with cancer metastasis and has a potential link to beta-catenin in different cancers. BMC Cancer 10:601.
OpenUrl CrossRef PubMed

[83] 83.
Meaburn KJ, Gudla PR, Khan S, Lockett SJ, & Misteli T (2009) Disease-specific gene repositioning in breast cancer. J Cell Biol 187(6):801–812.
OpenUrl Abstract/FREE Full Text

[84] 84.
Turner N & Grose R (2010) Fibroblast growth factor signalling: from development to cancer. Nat Rev Cancer 10(2):116–129.
OpenUrl CrossRef PubMed Web of Science

[85] 85.↵
Webb DJ, Nguyen DH, Sankovic M, & Gonias SL (1999) The very low density lipoprotein receptor regulates urokinase receptor catabolism and breast cancer cell motility in vitro. J Biol Chem 274(11):7412–7420.
OpenUrl Abstract/FREE Full Text

[86] 86.↵
Krstic M, et al. (2016) The transcriptional regulator TBX3 promotes progression from non-invasive to invasive breast cancer. BMC Cancer 16(1):671.
OpenUrl

[87] 87.
Okada T, et al. (2015) The Rho GTPase Rnd1 suppresses mammary tumorigenesis and EMT by restraining Ras-MAPK signalling. Nat Cell Biol 17(1):81–94.
OpenUrl PubMed

[88] 88.
Song Y, et al. (2015) GATA6 is overexpressed in breast cancer and promotes breast cancer cell epithelial-mesenchymal transition by upregulating slug expression. Exp Mol Pathol 99(3):617–627.
OpenUrl

[89] 89.↵
Yarosh W, et al. (2008) TBX3 is overexpressed in breast cancer and represses p14 ARF by interacting with histone deacetylases. Cancer Res 68(3):693–699.
OpenUrl Abstract/FREE Full Text

[90] 90.↵
Forsman CL, et al. (2013) BMP-binding protein twisted gastrulation is required in mammary gland epithelium for normal ductal elongation and myoepithelial compartmentalization. Developmental biology 373(1):95–106.
OpenUrl CrossRef PubMed

[91] 91.
Hartwell KA, et al. (2006) The Spemann organizer gene, Goosecoid, promotes tumor metastasis. Proc Natl Acad Sci U S A 103(50):18969–18974.
OpenUrl Abstract/FREE Full Text

[92] 92.↵
Puvirajesinghe TM, et al. (2016) Identification of p62/SQSTM1 as a component of non-canonical Wnt VANGL2-JNK signalling in breast cancer. Nature communications 7:10318.
OpenUrl

[93] 93.↵
Li Y, et al. (2011) Genetic variation of ESR1 and its co-activator PPARGC1B is synergistic in augmenting the risk of estrogen receptor-positive breast cancer. Breast Cancer Res 13(1):R10.
OpenUrl CrossRef PubMed

[94] 94.↵
Wirtenberger M, et al. (2006) Associations of genetic variants in the estrogen receptor coactivators PPARGC1A, PPARGC1B and EP300 with familial breast cancer. Carcinogenesis 27(11):2201–2208.
OpenUrl CrossRef PubMed Web of Science

[95] 95.↵
Jerjees DA, et al. (2014) ERK1/2 is related to oestrogen receptor and predicts outcome in hormone-treated breast cancer. Breast cancer research and treatment 147(1):25–37.
OpenUrl

[96] 96.↵
Keshamouni VG, Mattingly RR, & Reddy KB (2002) Mechanism of 17-beta-estradiol-induced Erk1/2 activation in breast cancer cells. A role for HER2 AND PKC-delta. J Biol Chem 277(25):22558–22565.
OpenUrl Abstract/FREE Full Text

[97] 97.↵
McCormack O, et al. (2008) Growth arrest-specific gene 6 expression in human breast cancer. Br J Cancer 98(6):1141–1146.
OpenUrl CrossRef PubMed Web of Science

[98] 98.
Mo R, Tony Zhu Y, Zhang Z, Rao SM, & Zhu YJ (2007) GAS6 is an estrogen-inducible gene in mammary epithelial cells. Biochem Biophys Res Commun 353(1):189–194.
OpenUrl CrossRef PubMed Web of Science

[99] 99.↵
Wang C, et al. (2016) Gas6/Axl Axis Contributes to Chemoresistance and Metastasis in Breast Cancer through Akt/GSK-3beta/beta-catenin Signaling. Theranostics 6(8):1205–1219.
OpenUrl

[100] 100.↵
Soria G & Ben-Baruch A (2008) The inflammatory chemokines CCL2 and CCL5 in breast cancer. Cancer letters 267(2):271–285.
OpenUrl CrossRef PubMed Web of Science

[101] 101.↵
Svensson S, et al. (2015) CCL2 and CCL5 Are Novel Therapeutic Targets for Estrogen-Dependent Breast Cancer. Clin Cancer Res 21(16):3794–3805.
OpenUrl Abstract/FREE Full Text

[102] 102.↵
Bolos V, et al. (2013) Notch activation stimulates migration of breast cancer cells and promotes tumor growth. Breast Cancer Res 15(4):R54.
OpenUrl CrossRef PubMed

[103] 103.
Fan LC, Jeng YM, Lu YT, & Lien HC (2016) SPOCK1 Is a Novel Transforming Growth Factor-beta-Induced Myoepithelial Marker That Enhances Invasion and Correlates with Poor Prognosis in Breast Cancer. PLoS One 11(9):e0162933.
OpenUrl

[104] 104.
Simmons MJ, Serra R, Hermance N, & Kelliher MA (2012) NOTCH1 inhibition in vivo results in mammary tumor regression and reduced mammary tumorsphere-forming activity in vitro. Breast Cancer Res 14(5):R126.
OpenUrl CrossRef PubMed

[105] 105.↵
Wang J, Fu L, Gu F, & Ma Y (2011) Notch1 is involved in migration and invasion of human breast cancer cells. Oncology reports 26(5):1295–1303.
OpenUrl PubMed

[106] 106.↵
Spirin V & Mirny LA (2003) Protein complexes and functional modules in molecular networks. Proc Natl Acad Sci U S A 100(21):12123–12128.
OpenUrl Abstract/FREE Full Text

[107] 107.↵
Ravasz E, Somera AL, Mongru DA, Oltvai ZN, & Barabasi AL (2002) Hierarchical organization of modularity in metabolic networks. Science 297(5586):1551–1555.
OpenUrl Abstract/FREE Full Text

[108] 108.↵
Sahni N, et al. (2015) Widespread macromolecular interaction perturbations in human genetic disorders. Cell 161(3):647–660.
OpenUrl CrossRef PubMed