GSKB: A gene set database for pathway analysis in mouse

Liming Lai; Jason Hennessey; Valerie Bares; Eun Woo Son; Yuguang Ban; Wei Wang; Jianli Qi; Gaixin Jiang; Arthur Liberzon; Steven Xijin Ge

doi:10.1101/082511

ABSTRACT

Interpretation of high-throughput genomics data based on biological pathways constitutes a constant challenge, partly because of the lack of supporting pathway database. In this study, we created a functional genomics knowledgebase in mouse, which includes 33,261 pathways and gene sets compiled from 40 sources such as Gene Ontology, KEGG, GeneSetDB, PANTHER, microRNA and transcription factor target genes, etc. In addition, we also manually collected and curated 8,747 lists of differentially expressed genes from 2,526 published gene expression studies to enable the detection of similarity to previously reported gene expression signatures. These two types of data constitute a Gene Set Knowledgebase (GSKB), which can be readily used by various pathway analysis software such as gene set enrichment analysis (GSEA). Using our knowledgebase, we were able to detect the correct microRNA (miR-29) pathway that was suppressed using antisense oligonucleotides and confirmed its role in inhibiting fibrogenesis, which might involve upregulation of transcription factor SMAD3. The knowledgebase can be queried as a source of published gene lists for further meta-analysis. Through meta-analysis of 56 published gene lists related to retina cells, we revealed two fundamentally different types of gene expression changes. One is related to stress and inflammatory response blamed for causing blindness in many diseases; the other associated with visual perception by normal retina cells. GSKB is available online at http://ge-lab.org/gs/, and also as a Bioconductor package (gskb, https://bioconductor.org/packages/gskb/). This database enables in-depth interpretation of mouse genomics data both in terms of known pathways and the context of thousands of published expression signatures.

INTRODUCTION

Pathway analysis is a key step in analyzing high-throughput genomics data. The goal of pathway analysis is to determine if coherent change in gene expression occurs among a set of genes related to a molecular pathway or biological function. Many methods have been developed to achieve this goal (see review in [1]). While gene set enrichment analysis (GSEA) [2], one of the most popular programs, is based on non-parametric Kolmogorov-Smirnov statistic, several parametric algorithms have been developed [3, 4]. The foundation for all of these algorithms is a database of curated pathways and functional categories. The Molecular Signatures Database (MSigDB) [5] is a collection of gene lists initially developed for GSEA [2]. While its main focus is human, MSigDB also includes a small number of gene sets for mouse and rat. In addition to existing pathway databases, MSigDB also includes lists of differentially expressed genes manually collected from published gene expression studies related to genetic and chemical perturbations. Inclusion of these gene sets enables detection of the co-regulation of genes similar to those reported in the literature. The MSigDB has been widely used and greatly facilitate pathway analysis in human genomics studies.

As much of the work has been focused on gene sets in human, there is an urgent need of comprehensive pathway databases for other model organisms. Some researchers have to convert human genes into mouse orthologs in order to use MSigDB [6]. Other previous efforts include Genetrail [7], which constructs its own database by extracting information from several sources such as Gene Ontology [8], KEGG [9] etc. GeneSetDB is a larger collection of gene sets for several species based on 26 public databases [10]. Pathway and gene-set enrichment database(PAGED), which covers 20 species, derives information from many different sources including GeneSetDB and other disease-gene association data [11]. We recently created a pathway database for Arabidopsis [12].

In this study, we sought to develop a pathway knowledgebase for mouse, an important model organism for the study of many human diseases. We compiled gene sets from a large number of existing annotation databases as well as thousands of primary publications, which covers a wide spectrum of genetic, genomic and biological information. These gene sets forms a foundation for in-depth interpretation of mouse genomics data, supporting the use of mouse as a model for understanding human biology and diseases. We also demonstrate the use of this knowledgebase to generate testable hypotheses through several examples.

METHODS

Since most gene expression studies deposit gene expression data and publication information in public repositories, we first search for publications in Gene Expression Omnibus (GEO, www.ncbi.nlm.nih.gov/geo/) and ArrayExpress (www.ebi.ac.uk/arrayexpress/). We focused on mouse related expression studies in this study. Based on the information, the full text of the papers and their supplementary materials were retrieved. Then gene lists were compiled and curated after reading the papers and their supplementary materials. Similar to MSigDB [5] and AraPath [12], an unique identifier was assigned to each gene list. The names often start with the last names of the first authors and end with “_up”, or “_down” to indicate whether the genes are up-regulated or down-regulated, respectively. A brief one-sentence description was also given to each gene list. For example, gene list VENEZIA_FETAL-LIVER_UP represents genes highly expressed in fetal liver. A long description for the gene lists are abstracts of the paper from PubMed. We retrieved and processed all papers linked to GEO and ArrayExpress that we can found at the time of the study. We tried to collect all lists of differentially expressed genes reported in the literature. Larger lists (>3000 genes) are excluded. Finally, all gene lists were merged into an Excel spreadsheet to be further processed. One key step is the conversion of various gene IDs from different sources to NCBI gene symbols and Mouse Genome Informatics (MGI) IDs. The conversion was made based on the mouse genes information at NCBI, including platform definition files at GEO and other gene information files such as gene2accession.gz, Mm.data.gz, Mm.gb_cid_lid, and All_data.gene_info.gz (ftp://ftp.ncbi.nih.gov). Conversion to MGI IDs is carried out using information from MGI web site (www.informatics.jax.org). A Perl program was created to convert these gene IDs.

Gene lists from existing sources were manually imported in July 2013. Our web site (http://gskb.ge-lab.org) was assembled using a combination of the Python programming language, Django web framework, MySQL, JavaScript, and html. More detailed information about GSKB is included in supplementary file 3 which follows the BioDBcore guidelines [13]. This web site also includes data for other species such as the Arabidopsis gene sets [12], as well as other species that are still in the process.

Gene expression datasets were downloaded from GEO with accession numbers GSE40261and GSE27035. The raw Affymetrix .CEL files were processed using robust multiarray analysis (RMA) algorithm [14] as implemented in Bioconductor [15]. “Present” or “Absent” calls were calculated using the Affymetrix MAS5 algorithm based on Wilcoxon Rank Sum test. Probe-sets with “Absent” calls across all samples were deemed not significantly above background and removed from further analysis. Probe-set IDs were mapped to official gene symbol based on the mapping in Bioconductor package mouse4302.db. If multiple probe-sets were mapped to the same gene, we retained the one with the largest standard deviation. The processed gene expression data is available as supplementary file 4. GSEA version 2.0 was used with default parameters to analyze GSE40261. We also used another method, parametric analysis of gene set enrichment (PAGE) [3], implemented in the PGSEA package [16]. Gene expression data was mean-centered before using PGSEA, which calculates a t-statistic for each gene-set in every sample to measure whether the genes are collectively up- or down-regulated. The t-statistics were used in an analysis of variance (ANOVA) to test whether there is significant difference between sample groups. Gene sets are ranked by the difference in average t-statistic.

RESULTS AND DISCUSSION

GSKB is a comprehensive knowledgebase for pathway analysis in mouse. It complements MSigDB, which contains a small number of mouse gene lists. Unlike GeneSetDB [10], GSKB includes a large number of lists of differentially expressed genes. The data can be downloaded and used by various software packages for pathway analysis. Our web site also includes an interface for keyword search. The keywords can be a topic-related (“stem cell”), or a gene symbol (“SOX2”), which will be compared against all the descriptions including the abstracts as well as gene lists. In addition, we also provide a similarity search page, where users can upload a gene list, and our web server will compare user’s gene lists with all the gene lists in the database and return those that have statistically significant overlaps.

Construction of the knowledgebase

As a first step, we gathered annotation information from 40 existing databases for mouse-related gene sets (Table 1). These gene sets are divided into 8 categories, namely, Co-expression, Gene Ontology, Curated pathways, Metabolic Pathways, Transcription Factor (TF) and microRNA target genes, location (cytogenetics band), and others. We used information in GeneSetDB [10] for some of the databases. Detailed information on these 40 sources and the citations is available in supplementary file 1.

View this table:

Table 1. Sources for gene sets in GSKB

The gene lists from literature were retrieved manually from individual gene expression studies through a process similar to the one used to create AraPath, a similar resource for Arabidopsis[12]. As most expression studies upload raw data to repositories like GEO and ArrayExpress, we used the meta-data in these databases to search for publications. We scanned all datasets we can found and retrieved 4,313 potentially useful papers reporting gene expression studies in mouse. These papers were individually read by curators to identify lists of differentially expressed genes in various conditions. We compiled a total of 8,747 lists of differently expressed genes from 2,518 of papers (See supplementary information 2 for citations). Each gene list was annotated with a unique name, brief description, and publication information, similar to the protocol used in MSigDB and Arapath [12]. These gene lists constitute a large collection of published expression signatures that form a foundation for interpret new gene lists and expression profiles.

These 8,747 gene lists collected from literature include a total of 29,876 unique gene symbols. Interestingly, the distribution of the sizes of published gene lists approximates a normal distribution on logarithm scale (Fig. 1). Although the median size is 51, there are many gene lists containing a few hundreds of genes. The most frequently appeared genes in these lists are shown in Table 2. It includes genes related to cell cycle (CCND2, CDKN1A), and immune response (SOCS3, FOS). Many were intensively-studied, as suggested by the number of related PubMed hits when using gene symbol in keyword searches.

Fig. 1.

Size distribution of the number of differentially expressed genes reported in the literature

View this table:

Table 2.

Top 10 most frequent genes in 8,747 published lists of differentially expressed genes.

Using the database

The database can be downloaded at our web site (http://ge-lab.org/gs/), and used with various pathway analysis software. The web site can also be searched using keywords representing either gene names or pathway names. Using the “Find related genesets” page, users can also upload their own lists of genes and search the database for gene sets with significant overlap. Finally, we also created a Bioconductor package “gskb” (https://bioconductor.org/packages/gskb/) that provide access to this data.

In-depth analysis of the expression profile of miR-29 silencing and induction

To test if known pathways could be detected, we re-analyzed a gene expression dataset from Hand et al. [17], which is available in GEO database with accession number GSE40261. In this experiment, mice were injected of antisense oligonucleotides against miR-29a or a scrambled sequence as control. The antisense oligonucleotides should suppress the expression of miR-29a, thereby affect the expression of the set of genes inhibited by this microRNA in liver tissue. We downloaded and processed the raw Affymetrix files and used different pathway analysis software to detect significantly altered pathways (See Methods for details). We first focused on 1869 predicted microRNA target gene sets. Table 3 shows the top gene sets using GSEA. It is clear that the most significantly affected pathways are those related to miR-29a, b, and c. We also used another method, parametric analysis of gene set enrichment (PAGE) [3], as implemented in the PGSEA package [16]. The results are shown in Fig. 2. The top 6 pathways are all miR-29 target gene sets. Therefore, the expected molecular pathway was detected using different software.

View this table:

Table 3.

Significant gene sets from pathway analysis using GSEA. ES: Enrichment Score, NES: Normalized Enrichment Score, P-val: Nominal P values, FDR: False discovery rate, FWER: family-wise error rate.

Fig. 2.

Significant gene sets from PGSEA analysis. Red indicates higher expression of genes targeted by certain microRNA according to prediction, while blue means lower expression.

To gain further insight into the downstream molecular pathways, we analyze this expression data using other gene sets in GSKB. Using the PGSEA software, we identified significantly altered pathways in many different types of gene sets. Table 4 shows some of the top ranked gene sets by category. In the co-expression category, we detected that the change in expression profile when miR-29c was suppressed are similar to several reported expression signatures. The top 5 signatures are related to various perturbations of liver cells such as overexpression of TCFAP2C [18], or treatment with rosiglitazone [19], a compound that can lower glucose levels. The third most significantly gene set (Nones_Mdr1A_Curcumin-Diet_Fibrogenesis_Dn) includes genes involved in fibrogenesis down-regulated in the colon of multidrug resistance gene-deficient (mdr1a-/-) mice fed with curcumin diet [20]. Fibrogenesis is a process regulating the deposition of extracellular matrix proteins. This is in fact a reoccurring theme in multiple significant gene sets across categories. For example, two significant Gene Ontology terms are “Extracellular Matrix Structural Constituent”, and “Collagen Fibril Organization”. Other related gene sets are “ECM-Receptor Interaction”, “Integrin”, and “Beta Integrin Cell Surface Interactions”. These results strongly suggest that lower expression of miR-29 caused by antisense oligonucleotides leads to the up-regulation of extracellular matrix proteins and fibrogenesis in hepatic cells. This agrees with the well-established role of miR-29 in fibrosis in liver [21], and several other tissues/cells such as heart [22], stellate cells [23] and HK-2 cells (human kidney cell line) [24] etc. All members of the miR-29 family were found to be downregulated in murine livers treated with carbon tetrachloride to induce hepatic fibrogenesis [21]. Roderburg et al. [21] also noted abnormally low expression of miR-29 in patients with advanced liver fibrosis and liver cirrhosis. Thus GSKB enables us to identify downstream molecular pathways regulated by the miR-29 family.

View this table:

Table 4.

Top gene sets in all categories from PGSEA analysis of the anti-miR-29 treated liver cells.

Table 4 also shows transcription factors that might be involved. Treatment with anti-miR-29 oligonucleotides is associated with reduced expression of Srebf1 (sterol regulatory element binding transcription factor 1) target genes. Srebf1 is regulator of lipid homeostatsis [25] and directly binds to sterol regulatory element-1 (SRE1) flanking LDL receptor genes and other related genes. This suggests that lower expression of miR-29 might hinder the normal function of hepatic cells in metabolism. Therefore high expression of miR-29 expression in hepatic cells is required for lipid metabolism and that Srebf1 might be downstream of miR-29. There does not appear to be any existing evidence linking miR-29 with Srebf1. This is a directly testable hypothesis for further experimental studies regarding the function of miR-29.

Table 4 also indicates that anti-miR-29 treatment lead to increased expression of Smad3 (SMAD family member 3) target genes. It is well-established that Smad3 is a key player in TGFβ-mediated fibrosis, tumor suppression and metastasis [26]. Qin et al. provided evidence that Smad3-mediated suppression of miR-29 expression by TGFβ1 is achieved by direct binding to the promoter of miR-29 [27], and overexpression of miR-29b inhibits the collagen I and III and prevents renal fibrosis. In our analysis, suppression of miR-29 leads to the upregulation of Smad3 target genes, suggesting that Smad3 and miR-29 might form a negative regulation loop. Indeed, Xiao et al. [28] provided some evidence that gene transfer of miR-29 was able to block bleomycin-induced pulmonary fibrosis by suppressing the expression of TGFβ-1and inhibiting Smad3 phosphorylation.

To further confirm these findings, we analyzed another expression data set (GSE27035), in which fetal astrocytes were transfected with miR-29 [29]. We obtained similar results (data not shown) regarding the extracellular matrix related gene sets. For transcriptional factors, E2F family members are highly significant, which is not observed in hepatic cells in the previous dataset. But Smad3 targets genes are downregulated, suggesting that the universal regulation of extracellular matrix genes by miR-29 might be related to Smad3.

Identifying transcription factors from gene expression data

In another example, we used our knowledgebase to detect transcription factors (TFs) responsible for tissue-specific gene expression. We analyzed a large gene expression dataset consisting of 3 or 4 biological replicates for each of the 24 mouse tissues [30] (NCBI accession number GSE24207). We used a subset of 373 predicted TF target gene sets in our analysis using PGSEA. Fig. 2 lists top 30 most significant TFs associated with various tissues. Many highly significant TFs are known to be involved in different organs. The most significant are the hepatocyte nuclear factors (HNF4A, HNF1A, and FOXA2/HNF3B) that are highly expressed in the liver and are supported by many studies to be involved in liver development [31]. The target genes of SPI1 (Spleen focus forming virus proviral integration oncogene, also listed as SFPI1) are highly expressed in spleen and bone marrow, in agreement with the fact that SPI1 is an ETS-domain transcription factor involved in myeloid and B-lymphoid cell development [32]. Another very highly significant TF, Steroidogenic Factor-1 (NR5A1), is found to be responsible for adrenal gland-specific expression profile. This nuclear receptor is known to play an important role in adrenal development and function and mutations in this protein are associated with adrenal hypolasia [33]. PPARG (peroxisome proliferator activated receptor gamma) is essential for the differentiation of adipose tissue [34]. In our result, its target genes were found to be significant highly expressed in the adipose tissue. Most of the TFs in Fig. 2 are supported by previous studies in the literature.

Meta-analysis of published gene lists yields insight into blindness and visual perception

The comprehensive dataset can also serve as an information source on published expression signatures. For example, using a keyword “retina” to conduct a search at our web site, we retrieved 56 published signatures from 26 genome-wide expression studies of the retina cells. Detailed information on each of gene lists, including links to PubMed, is available for further examination.

Meta-analysis of a set of retrieved signatures can provide insights into the relationship among multiple previously published expression profiles. We conducted an all-versus-all overlapping analysis of these 56 gene lists. Following the method developed previously [35], we generated a network where nodes correspond to gene lists and edges represent significant overlaps between them (Fig.4). We found 52 significant overlaps (FDR < 0.0001) among 25 gene lists. Interestingly, the overlaps define two groups with high similarity within each group and very little in between. Group A on the left side of Fig. 4 includes gene lists that are upregulated in ageing, bright-light-damaged retina, or other injury, as well as hypoxia treatment from 3 independent studies[36–38]. Retinal hypoxia is believed to be the mechanism of blinding underlying several diseases [39] and have been subjected to studies using animal models. Our results suggest that there is significant similarity in gene expression response across multiple studies. We identified the most frequently shared genes across these 10 gene lists in group A. Table 5 lists 29 genes that are shared by 3 or more gene lists in group A gene lists. These 29 genes are significantly enriched in genes related to inflammatory response (P value < 1.80E-05), and immune response (P value < 5.30E-03). This is in agreement with previous finding regarding the activation of inflammatory response upon hypoxia treatment [39]. As shown in Table 5, the most commonly upregulated genes in group A is the glial fibrillary acidic protein (GFAP), which was initially discovered as an indicator of stress in astrocytes in the brain, but its activation in the radial glia (Müller cells) is also known to signal stress in the retina [40]. Based on the consensus of 8 genome-wide expression studies, our new gene lists in Table 5 thus could serve as marker for stress response in the retina caused by hypoxia or other injuries.

Fig. 3.

Significantly altered gene sets in normal tissues. Red represents higher expression of a set of genes regulated by a transcription factor.

Fig. 4.

Overlapping analysis of 46 published lists of differentially expressed genes related to the retina. Each note represents a gene list. Edges represent significant overlaps. Edge labels are the number of genes shared and the width indicates statistical significance. Highlighted in yellow are lists related to hypoxia.

View this table:

Table 5.

Genes frequently appeared in gene set cluster A. Genes in bold are related to inflammatory response. Freq.: frequency, the number of gene sets with the gene.

On the other hand, the 15 gene lists in group B on the right side of Fig. 4 have different biological theme. Three of the gene lists (ZHANG_RETINAL-EXPLANT_RB1_DN, COTTET_RPE65_RETINA_DN, DEL-TORO_EC-DLL4_RETINAS_DN) are genes downregulated in retina with mutated genes (Rb1[41], RPE65[42], and DLL4[43]), compared with normal retina. Another gene list includes genes downregualted in hypoxic retina [36]. This seems to suggest that gene lists in this group might include genes specifically required for normal photoreceptive function of the retina cells. This is confirmed by examining the frequently appearing genes in this group. Among the 38 genes (Table 6) that are shared by 3 or more lists in this group, half of them are related to visual perception according to GO, which is extremely significant (P < 1.2E-28). The most frequently appearing gene ARR3 (arrestin 3, retinal) are predicted to play an important role in retina-specific signal transduction with possible binding to photoactivated-phosphorylated opsins, including OPN1SW (opsin 1) that are shared by 7 of the 15 gene lists (Table 6). Based on multiple studies, table 6 serves as a reliable list of retina-specific genes important for photoreception.

View this table:

Table 6.

Genes frequently appeared in gene sets cluster B. Genes in bold are related to visual perception according to GO.

Overall, through meta-analysis of 56 retina-related gene lists, we revealed two fundamentally different types of gene expression changes in the retina. One is related to stress response and inflammatory response that are blamed for blindness in many diseases; the other is a set of genes that are required by visual perception by the retina cells. Our database could be used to conduct similar meta-analysis using various search keywords.

CONCLUSION

We have created a comprehensive gene set database for pathway analysis in mouse. We also demonstrated that this database could be used with different pathway analysis software to gain insights into genome-wide expression profiles. For further improvement of knowledgebase, we will update the database from existing sources, and also continue to improve the accuracy of the existing curation, and search for additional published gene lists.

FUNDING

This work was supported in part by the National Institutes of Health (R01GM083226 and R01CA121941). This material is based upon work supported partially by the National Science Foundation/EPSCoR Award No. IIA-1355423 and by the state of South Dakota.

ACKNOWLEDGEMENTS

The authors thank Drs. Jill Mesirov and Arthur Liberzon for advices on gene list collection, and the reviewers for many constructive suggestions

References

1.↵
Huang da W, Sherman BT, Lempicki RA: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 2009, 37:1–13.
OpenUrl CrossRef PubMed Web of Science
2.↵
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005, 102:15545–15550.
OpenUrl Abstract/FREE Full Text
3.↵
Kim SY, Volsky DJ: PAGE: parametric analysis of gene set enrichment. BMC Bioinformatics 2005, 6:144.
OpenUrl CrossRef PubMed
4.↵
Irizarry RA, Wang C, Zhou Y, Speed TP: Gene set enrichment analysis made simple. Stat Methods Med Res 2009, 18:565–575.
OpenUrl CrossRef PubMed Web of Science
5.↵
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdottir H, Tamayo P, Mesirov JP: Molecular signatures database (MSigDB) 3.0. Bioinformatics 2011, 27:1739–1740.
OpenUrl CrossRef PubMed Web of Science
6.↵
Ellis J, Goodswen S, Kennedy PJ, Bush S: The core mouse response to infection by neospora caninum defined by gene set enrichment analyses. Bioinform Biol Insights 2012, 6:187–202.
OpenUrl PubMed
7.↵
Backes C, Keller A, Kuentzer J, Kneissl B, Comtesse N, Elnakady YA, Muller R, Meese E, Lenhof HP: GeneTrail--advanced gene set enrichment analysis. Nucleic Acids Res 2007, 35:W186–192.
OpenUrl CrossRef PubMed Web of Science
8.↵
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25:25–29.
OpenUrl CrossRef PubMed Web of Science
9.↵
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res 2006, 34:D354–357.
OpenUrl CrossRef PubMed Web of Science
10.↵
Araki H, Knapp C, Tsai P, Print C: GeneSetDB: A comprehensive meta-database, statistical and visualisation framework for gene set analysis. FEBS Open Bio 2012, 2:76–82.
OpenUrl CrossRef PubMed Web of Science
11.↵
Huang H, Wu X, Sonachalam M, Mandape SN, Pandey R, MacDorman KF, Wan P, Chen JY: PAGED: a pathway and gene-set enrichment database to enable molecular phenotype discoveries. BMC Bioinformatics 2012, 13 Suppl 15:S2.
OpenUrl CrossRef PubMed
12.↵
Lai L, Liberzon A, Hennessey J, Jiang G, Qi J, Mesirov JP, Ge SX: AraPath: a knowledgebase for pathway analysis in Arabidopsis. Bioinformatics 2012, 28:2291–2292.
OpenUrl CrossRef PubMed Web of Science
13.↵
Gaudet P, Bairoch A, Field D, Sansone SA, Taylor C, Attwood TK, Bateman A, Blake JA, Bult CJ, Cherry JM, et al: Towards BioDBcore: a community-defined information specification for biological databases. Database (Oxford) 2011, 2011:baq027.
OpenUrl PubMed
14.↵
Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 2003, 4:249–264.
OpenUrl CrossRef PubMed Web of Science
15.↵
Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, et al: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004, 5:R80.
OpenUrl CrossRef PubMed
16.↵
Furge K, Dykema K: PGSEA: Parametric Gene Set Enrichment Analysis. R package version 1360 2012.
17.↵
Hand NJ, Horner AM, Master ZR, Boateng LA, LeGuen C, Uvaydova M, Friedman JR: MicroRNA profiling identifies miR-29 as a regulator of disease-associated pathways in experimental biliary atresia. J Pediatr Gastroenterol Nutr 2012, 54:186–192.
OpenUrl CrossRef PubMed
18.↵
Holl D, Kuckenberg P, Woynecki T, Egert A, Becker A, Huss S, Stabenow D, Zimmer A, Knolle P, Tolba R, et al: Transgenic overexpression of Tcfap2c/AP-2gamma results in liver failure and intestinal dysplasia. PLoS One 2011, 6:e22034.
OpenUrl PubMed
19.↵
Wang P, Renes J, Bouwman F, Bunschoten A, Mariman E, Keijer J: Absence of an adipogenic effect of rosiglitazone on mature 3T3-L1 adipocytes: increase of lipid catabolism and reduction of adipokine expression. Diabetologia 2007, 50:654–665.
OpenUrl CrossRef PubMed Web of Science
20.↵
Nones K, Dommels YE, Martell S, Butts C, McNabb WC, Park ZA, Zhu S, Hedderley D, Barnett MP, Roy NC: The effects of dietary curcumin and rutin on colonic inflammation and gene expression in multidrug resistance gene-deficient (mdr1a-/-) mice, a model of inflammatory bowel diseases. Br J Nutr 2009, 101:169–181.
OpenUrl CrossRef PubMed
21.↵
Roderburg C, Urban GW, Bettermann K, Vucur M, Zimmermann H, Schmidt S, Janssen J, Koppe C, Knolle P, Castoldi M, et al: Micro-RNA profiling reveals a role for miR-29 in human and murine liver fibrosis. Hepatology 2011, 53:209–218.
OpenUrl CrossRef PubMed Web of Science
22.↵
van Rooij E, Sutherland LB, Thatcher JE, DiMaio JM, Naseem RH, Marshall WS, Hill JA, Olson EN: Dysregulation of microRNAs after myocardial infarction reveals a role of miR-29 in cardiac fibrosis. Proc Natl Acad Sci U S A 2008, 105:13027–13032.
OpenUrl Abstract/FREE Full Text
23.↵
Ogawa T, Iizuka M, Sekiya Y, Yoshizato K, Ikeda K, Kawada N: Suppression of type I collagen production by microRNA-29b in cultured human stellate cells. Biochem Biophys Res Commun 2010, 391:316–321.
OpenUrl CrossRef PubMed Web of Science
24.↵
Du B, Ma LM, Huang MB, Zhou H, Huang HL, Shao P, Chen YQ, Qu LH: High glucose down-regulates miR-29a to increase collagen IV production in HK-2 cells. FEBS Lett 2010, 584:811–816.
OpenUrl CrossRef PubMed Web of Science
25.↵
Yokoyama C, Wang X, Briggs MR, Admon A, Wu J, Hua X, Goldstein JL, Brown MS: SREBP-1, a basic-helix-loop-helix-leucine zipper protein that controls transcription of the low density lipoprotein receptor gene. Cell 1993, 75:187–197.
OpenUrl CrossRef PubMed Web of Science
26.↵
Roberts AB, Tian F, Byfield SD, Stuelten C, Ooshima A, Saika S, Flanders KC: Smad3 is key to TGF-beta-mediated epithelial-to-mesenchymal transition, fibrosis, tumor suppression and metastasis. Cytokine Growth Factor Rev 2006, 17:19–27.
OpenUrl CrossRef PubMed Web of Science
27.↵
Qin W, Chung AC, Huang XR, Meng XM, Hui DS, Yu CM, Sung JJ, Lan HY: TGF-beta/Smad3 signaling promotes renal fibrosis by inhibiting miR-29. J Am Soc Nephrol 2011, 22:1462–1474.
OpenUrl Abstract/FREE Full Text
28.↵
Xiao J, Meng XM, Huang XR, Chung AC, Feng YL, Hui DS, Yu CM, Sung JJ, Lan HY: miR-29 inhibits bleomycin-induced pulmonary fibrosis in mice. Mol Ther 2012, 20:1251–1260.
OpenUrl CrossRef PubMed
29.↵
Tanabe H, Kohno N, Yamaguchi M: Global profiling of gene expression in mouse astrocyte in response to the potential longevity determinant miR-29. Mem Fac Agr Kinki Univ 2012, 45:1–16.
OpenUrl
30.↵
Thorrez L, Laudadio I, Van Deun K, Quintens R, Hendrickx N, Granvik M, Lemaire K, Schraenen A, Van Lommel L, Lehnert S, et al: Tissue-specific disallowance of housekeeping genes: the other face of cell differentiation. Genome Res 2011, 21:95–105.
OpenUrl Abstract/FREE Full Text
31.↵
Parviz F, Matullo C, Garrison WD, Savatski L, Adamson JW, Ning G, Kaestner KH, Rossi JM, Zaret KS, Duncan SA: Hepatocyte nuclear factor 4alpha controls the development of a hepatic epithelium and liver morphogenesis. Nat Genet 2003, 34:292–296.
OpenUrl CrossRef PubMed Web of Science
32.↵
Moreau-Gachelin F, Tavitian A, Tambourin P: Spi-1 is a putative oncogene in virally induced murine erythroleukaemias. Nature 1988, 331:277–280.
OpenUrl CrossRef PubMed
33.↵
Phelan JK, McCabe ER: Mutations in NR0B1 (DAX1) and NR5A1 (SF1) responsible for adrenal hypoplasia congenita. Hum Mutat 2001, 18:472–487.
OpenUrl CrossRef PubMed Web of Science
34.↵
Rosen ED, Sarraf P, Troy AE, Bradwin G, Moore K, Milstone DS, Spiegelman BM, Mortensen RM: PPAR gamma is required for the differentiation of adipose tissue in vivo and in vitro. Mol Cell 1999, 4:611–617.
OpenUrl CrossRef PubMed Web of Science
35.↵
Ge SX: Large-scale analysis of expression signatures reveals hidden links among diverse cellular processes. BMC Syst Biol 2011, 5:87.
OpenUrl CrossRef PubMed
36.↵
Ishikawa K, Yoshida S, Kadota K, Nakamura T, Niiro H, Arakawa S, Yoshida A, Akashi K, Ishibashi T: Gene expression profile of hyperoxic and hypoxic retinas in a mouse model of oxygen-induced retinopathy. Invest Ophthalmol Vis Sci 2010, 51:4307–4319.
OpenUrl Abstract/FREE Full Text
37.
Rattner A, Nathans J: The genomic response to retinal disease and injury: evidence for endothelin signaling from photoreceptors to glia. J Neurosci 2005, 25:4540–4549.
OpenUrl Abstract/FREE Full Text
38.↵
Zhu Y, Natoli R, Valter K, Stone J: Differential gene expression in mouse retina related to regional differences in vulnerability to hyperoxia. Mol Vis 2010, 16:740–755.
OpenUrl PubMed
39.↵
Kaur C, Foulds WS, Ling EA: Hypoxia-ischemia and retinal ganglion cell damage. Clin Ophthalmol 2008, 2:879–889.
OpenUrl PubMed
40.↵
Lewis GP, Fisher SK: Up-regulation of glial fibrillary acidic protein in response to retinal injury: its potential role in glial remodeling and a comparison to vimentin expression. Int Rev Cytol 2003, 230:263–290.
OpenUrl CrossRef PubMed Web of Science
41.↵
Zhang J, Gray J, Wu L, Leone G, Rowan S, Cepko CL, Zhu X, Craft CM, Dyer MA: Rb regulates proliferation and rod photoreceptor development in the mouse retina. Nat Genet 2004, 36:351–360.
OpenUrl CrossRef PubMed Web of Science
42.↵
Cottet S, Michaut L, Boisset G, Schlecht U, Gehring W, Schorderet DF: Biological characterization of gene response in Rpe65-/- mouse model of Leber’s congenital amaurosis during progression of the disease. FASEB J 2006, 20:2036–2049.
OpenUrl CrossRef PubMed Web of Science
43.↵
del Toro R, Prahst C, Mathivet T, Siegfried G, Kaminker JS, Larrivee B, Breant C, Duarte A, Takakura N, Fukamizu A, et al: Identification and functional analysis of endothelial tip cell-enriched genes. Blood 2010, 116:4025–4033.
OpenUrl Abstract/FREE Full Text
44.
Newman JC, Weiner AM: L2L: a simple tool for discovering the hidden significance in microarray expression data. Genome Biol 2005, 6:R81.
OpenUrl CrossRef PubMed
45.
Higgins ME, Claremont M, Major JE, Sander C, Lash AE: CancerGenes: a gene selection resource for cancer genome projects. Nucleic Acids Res 2007, 35:D721–726.
OpenUrl CrossRef PubMed Web of Science
46.
Culhane AC, Schroder MS, Sultana R, Picard SC, Martinelli EN, Kelly C, Haibe-Kains B, Kapushesky M, St Pierre AA, Flahive W, et al: GeneSigDB: a manually curated database and resource for analysis of gene expression signatures. Nucleic Acids Res 2012, 40:D1060–1066.
OpenUrl CrossRef PubMed Web of Science
47.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene Ontology: tool for the unification of biology. Nature Genetics 2000, 25:25–29.
OpenUrl CrossRef PubMed Web of Science
48.
Mi H, Lazareva-Ulitsky B, Loo R, Kejariwal A, Vandergriff J, Rabkin S, Guo N, Muruganujan A, Doremieux O, Campbell MJ, et al: The PANTHER database of protein families, subfamilies, functions and pathways. Nucleic Acids Res 2005, 33:D284–288.
OpenUrl CrossRef PubMed Web of Science
49.
Kelder T, van Iersel MP, Hanspers K, Kutmon M, Conklin BR, Evelo CT, Pico AR: WikiPathways: building research communities on biological pathways. Nucleic Acids Res 2012, 40:D1301–1307.
OpenUrl CrossRef PubMed Web of Science
50.
Yamamoto S, Sakai N, Nakamura H, Fukagawa H, Fukuda K, Takagi T: INOH: ontology-based highly structured database of signal transduction pathways. Database (Oxford) 2011, 2011:bar052.
OpenUrl
51.
Kandasamy K, Mohan SS, Raju R, Keerthikumar S, Kumar GS, Venugopal AK, Telikicherla D, Navarro JD, Mathivanan S, Pecquet C, et al: NetPath: a public resource of curated signal transduction pathways. Genome Biol 2010, 11:R3.
OpenUrl CrossRef PubMed
52.
Jupe S, Akkerman JW, Soranzo N, Ouwehand WH: Reactome - a curated knowledgebase of biological pathways: megakaryocytes and platelets. J Thromb Haemost 2012.
53.
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Research 2006, 34:D354–D357.
OpenUrl CrossRef PubMed Web of Science
54.
Ma H, Sorokin A, Mazein A, Selkov A, Selkov E, Demin O, Goryanin I: The Edinburgh human metabolic network reconstruction and its functional analysis. Mol Syst Biol 2007, 3:135.
OpenUrl Abstract/FREE Full Text
55.
Evsikov AV, Dolan ME, Genrich MP, Patek E, Bult CJ: MouseCyc: a curated biochemical pathways database for the laboratory mouse. Genome Biol 2009, 10:R84.
OpenUrl CrossRef PubMed
56.
Mattingly CJ, Rosenstein MC, Colby GT, Forrest JN, Jr.., Boyer JL: The Comparative Toxicogenomics Database (CTD): a resource for comparative toxicological studies. J Exp Zool A Comp Exp Biol 2006, 305:689–692.
OpenUrl PubMed
57.
Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P: A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol 2010, 6:343.
OpenUrl Abstract/FREE Full Text
58.
Gunther S, Kuhn M, Dunkel M, Campillos M, Senger C, Petsalaki E, Ahmed J, Urdiales EG, Gewiess A, Jensen LJ, et al: SuperTarget and Matador: resources for exploring drug-target relationships. Nucleic Acids Res 2008, 36:D919–922.
OpenUrl CrossRef PubMed Web of Science
59.
Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, Gautam B, Hassanali M: DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res 2008, 36:D901–906.
OpenUrl CrossRef PubMed Web of Science
60.
Frolkis A, Knox C, Lim E, Jewison T, Law V, Hau DD, Liu P, Gautam B, Ly S, Guo AC, et al: SMPDB: The Small Molecule Pathway Database. Nucleic Acids Res 2010, 38:D480–487.
OpenUrl CrossRef PubMed Web of Science
61.
Wang X: miRDB: a microRNA target prediction and functional annotation database with a wiki interface. RNA 2008, 14:1012–1017.
OpenUrl Abstract/FREE Full Text
62.
Betel D, Wilson M, Gabow A, Marks DS, Sander C: The microRNA.org resource: targets and expression. Nucleic Acids Res 2008, 36:D149–153.
OpenUrl CrossRef PubMed Web of Science
63.
Grimson A, Farh KK, Johnston WK, Garrett-Engele P, Lim LP, Bartel DP: MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol Cell 2007, 27:91–105.
OpenUrl CrossRef PubMed Web of Science
64.
Vergoulis T, Vlachos IS, Alexiou P, Georgakilas G, Maragkakis M, Reczko M, Gerangelos S, Koziris N, Dalamagas T, Hatzigeorgiou AG: TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support. Nucleic Acids Res 2012, 40:D222–229.
OpenUrl CrossRef PubMed Web of Science
65.
Hsu SD, Lin FM, Wu WY, Liang C, Huang WC, Chan WL, Tsai WT, Chen GZ, Lee CJ, Chiu CM, et al: miRTarBase: a database curates experimentally validated microRNA-target interactions. Nucleic Acids Res 2011, 39:D163–169.
OpenUrl CrossRef PubMed Web of Science
66.
Krek A, Grun D, Poy MN, Wolf R, Rosenberg L, Epstein EJ, MacMenamin P, da Piedade I, Gunsalus KC, Stoffel M, Rajewsky N: Combinatorial microRNA target predictions. Nat Genet 2005, 37:495–500.
OpenUrl CrossRef PubMed Web of Science
67.
Essaghir A, Toffalini F, Knoops L, Kallin A, van Helden J, Demoulin JB: Transcription factor regulation can be accurately predicted from the presence of target gene signatures in microarray gene expression data. Nucleic Acids Res 2010, 38:e120.
OpenUrl CrossRef PubMed
68.
Zhao F, Xuan Z, Liu L, Zhang MQ: TRED: a Transcriptional Regulatory Element Database and a platform for in silico gene regulation studies. Nucleic Acids Res 2005, 33:D103–107.
OpenUrl CrossRef PubMed Web of Science
69.
Friard O, Re A, Taverna D, De Bortoli M, Cora D: CircuitsDB: a database of mixed microRNA/transcription factor feed-forward regulatory circuits in human and mouse. BMC Bioinformatics 2010, 11:435.
OpenUrl CrossRef PubMed
70.
Wingender E, Chen X, Hehl R, Karas H, Liebich I, Matys V, Meinhardt T, Pruss M, Reuter I, Schacherer F: TRANSFAC: an integrated system for gene expression regulation. Nucleic Acids Res 2000, 28:316–319.
OpenUrl CrossRef PubMed Web of Science
71.
Robinson PN, Mundlos S: The human phenotype ontology. Clin Genet 2010, 77:525–534.
OpenUrl CrossRef PubMed Web of Science
72.
Kuhn M, von Mering C, Campillos M, Jensen LJ, Bork P: STITCH: interaction networks of chemicals and proteins. Nucleic Acids Res 2008, 36:D684–688.
OpenUrl CrossRef PubMed Web of Science
73.
Smith CL, Eppig JT: The Mammalian Phenotype Ontology as a unifying standard for experimental and high-throughput phenotyping data. Mamm Genome 2012, 23:653–668.
OpenUrl CrossRef PubMed Web of Science
74.
Lim E, Pon A, Djoumbou Y, Knox C, Shrivastava S, Guo AC, Neveu V, Wishart DS: T3DB: a comprehensively annotated database of common toxins and their targets. Nucleic Acids Res 2010, 38:D781–786.
OpenUrl CrossRef PubMed Web of Science
75.
Schaefer CF, Anthony K, Krupa S, Buchoff J, Day M, Hannay T, Buetow KH: PID: the Pathway Interaction Database. Nucleic Acids Res 2009, 37:D674–679.
OpenUrl CrossRef PubMed Web of Science
76.
He X, Chang S, Zhang J, Zhao Q, Xiang H, Kusonmano K, Yang L, Sun ZS, Yang H, Wang J: MethyCancer: the database of human DNA methylation and cancer. Nucleic Acids Res 2008, 36:D836–841.
OpenUrl CrossRef PubMed Web of Science
77.
Lauss M, Visne I, Weinhaeusel A, Vierlinger K, Noehammer C, Kriegner A: MethCancerDB--aberrant DNA methylation in human cancer. Br J Cancer 2008, 98:816–817.
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted October 24, 2016.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Bioinformatics

Subject Areas

All Articles

Animal Behavior and Cognition (5204)
Biochemistry (11725)
Bioengineering (8728)
Bioinformatics (29135)
Biophysics (14940)
Cancer Biology (12052)
Cell Biology (17363)
Clinical Trials (138)
Developmental Biology (9408)
Ecology (14148)
Epidemiology (2067)
Evolutionary Biology (18273)
Genetics (12223)
Genomics (16773)
Immunology (11844)
Microbiology (28027)
Molecular Biology (11564)
Neuroscience (60843)
Paleontology (451)
Pathology (1864)
Pharmacology and Toxicology (3232)
Physiology (4940)
Plant Biology (10405)
Scientific Communication and Education (1681)
Synthetic Biology (2878)
Systems Biology (7335)
Zoology (1642)

[1] 1.↵
Huang da W, Sherman BT, Lempicki RA: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res 2009, 37:1–13.
OpenUrl CrossRef PubMed Web of Science

[2] 2.↵
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005, 102:15545–15550.
OpenUrl Abstract/FREE Full Text

[3] 3.↵
Kim SY, Volsky DJ: PAGE: parametric analysis of gene set enrichment. BMC Bioinformatics 2005, 6:144.
OpenUrl CrossRef PubMed

[4] 4.↵
Irizarry RA, Wang C, Zhou Y, Speed TP: Gene set enrichment analysis made simple. Stat Methods Med Res 2009, 18:565–575.
OpenUrl CrossRef PubMed Web of Science

[5] 5.↵
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdottir H, Tamayo P, Mesirov JP: Molecular signatures database (MSigDB) 3.0. Bioinformatics 2011, 27:1739–1740.
OpenUrl CrossRef PubMed Web of Science

[6] 6.↵
Ellis J, Goodswen S, Kennedy PJ, Bush S: The core mouse response to infection by neospora caninum defined by gene set enrichment analyses. Bioinform Biol Insights 2012, 6:187–202.
OpenUrl PubMed

[7] 7.↵
Backes C, Keller A, Kuentzer J, Kneissl B, Comtesse N, Elnakady YA, Muller R, Meese E, Lenhof HP: GeneTrail--advanced gene set enrichment analysis. Nucleic Acids Res 2007, 35:W186–192.
OpenUrl CrossRef PubMed Web of Science

[8] 8.↵
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25:25–29.
OpenUrl CrossRef PubMed Web of Science

[9] 9.↵
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res 2006, 34:D354–357.
OpenUrl CrossRef PubMed Web of Science

[10] 10.↵
Araki H, Knapp C, Tsai P, Print C: GeneSetDB: A comprehensive meta-database, statistical and visualisation framework for gene set analysis. FEBS Open Bio 2012, 2:76–82.
OpenUrl CrossRef PubMed Web of Science

[11] 11.↵
Huang H, Wu X, Sonachalam M, Mandape SN, Pandey R, MacDorman KF, Wan P, Chen JY: PAGED: a pathway and gene-set enrichment database to enable molecular phenotype discoveries. BMC Bioinformatics 2012, 13 Suppl 15:S2.
OpenUrl CrossRef PubMed

[12] 12.↵
Lai L, Liberzon A, Hennessey J, Jiang G, Qi J, Mesirov JP, Ge SX: AraPath: a knowledgebase for pathway analysis in Arabidopsis. Bioinformatics 2012, 28:2291–2292.
OpenUrl CrossRef PubMed Web of Science

[13] 13.↵
Gaudet P, Bairoch A, Field D, Sansone SA, Taylor C, Attwood TK, Bateman A, Blake JA, Bult CJ, Cherry JM, et al: Towards BioDBcore: a community-defined information specification for biological databases. Database (Oxford) 2011, 2011:baq027.
OpenUrl PubMed

[14] 14.↵
Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 2003, 4:249–264.
OpenUrl CrossRef PubMed Web of Science

[15] 15.↵
Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, et al: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004, 5:R80.
OpenUrl CrossRef PubMed

[16] 16.↵
Furge K, Dykema K: PGSEA: Parametric Gene Set Enrichment Analysis. R package version 1360 2012.

[17] 17.↵
Hand NJ, Horner AM, Master ZR, Boateng LA, LeGuen C, Uvaydova M, Friedman JR: MicroRNA profiling identifies miR-29 as a regulator of disease-associated pathways in experimental biliary atresia. J Pediatr Gastroenterol Nutr 2012, 54:186–192.
OpenUrl CrossRef PubMed

[18] 18.↵
Holl D, Kuckenberg P, Woynecki T, Egert A, Becker A, Huss S, Stabenow D, Zimmer A, Knolle P, Tolba R, et al: Transgenic overexpression of Tcfap2c/AP-2gamma results in liver failure and intestinal dysplasia. PLoS One 2011, 6:e22034.
OpenUrl PubMed

[19] 19.↵
Wang P, Renes J, Bouwman F, Bunschoten A, Mariman E, Keijer J: Absence of an adipogenic effect of rosiglitazone on mature 3T3-L1 adipocytes: increase of lipid catabolism and reduction of adipokine expression. Diabetologia 2007, 50:654–665.
OpenUrl CrossRef PubMed Web of Science

[20] 20.↵
Nones K, Dommels YE, Martell S, Butts C, McNabb WC, Park ZA, Zhu S, Hedderley D, Barnett MP, Roy NC: The effects of dietary curcumin and rutin on colonic inflammation and gene expression in multidrug resistance gene-deficient (mdr1a-/-) mice, a model of inflammatory bowel diseases. Br J Nutr 2009, 101:169–181.
OpenUrl CrossRef PubMed

[21] 21.↵
Roderburg C, Urban GW, Bettermann K, Vucur M, Zimmermann H, Schmidt S, Janssen J, Koppe C, Knolle P, Castoldi M, et al: Micro-RNA profiling reveals a role for miR-29 in human and murine liver fibrosis. Hepatology 2011, 53:209–218.
OpenUrl CrossRef PubMed Web of Science

[22] 22.↵
van Rooij E, Sutherland LB, Thatcher JE, DiMaio JM, Naseem RH, Marshall WS, Hill JA, Olson EN: Dysregulation of microRNAs after myocardial infarction reveals a role of miR-29 in cardiac fibrosis. Proc Natl Acad Sci U S A 2008, 105:13027–13032.
OpenUrl Abstract/FREE Full Text

[23] 23.↵
Ogawa T, Iizuka M, Sekiya Y, Yoshizato K, Ikeda K, Kawada N: Suppression of type I collagen production by microRNA-29b in cultured human stellate cells. Biochem Biophys Res Commun 2010, 391:316–321.
OpenUrl CrossRef PubMed Web of Science

[24] 24.↵
Du B, Ma LM, Huang MB, Zhou H, Huang HL, Shao P, Chen YQ, Qu LH: High glucose down-regulates miR-29a to increase collagen IV production in HK-2 cells. FEBS Lett 2010, 584:811–816.
OpenUrl CrossRef PubMed Web of Science

[25] 25.↵
Yokoyama C, Wang X, Briggs MR, Admon A, Wu J, Hua X, Goldstein JL, Brown MS: SREBP-1, a basic-helix-loop-helix-leucine zipper protein that controls transcription of the low density lipoprotein receptor gene. Cell 1993, 75:187–197.
OpenUrl CrossRef PubMed Web of Science

[26] 26.↵
Roberts AB, Tian F, Byfield SD, Stuelten C, Ooshima A, Saika S, Flanders KC: Smad3 is key to TGF-beta-mediated epithelial-to-mesenchymal transition, fibrosis, tumor suppression and metastasis. Cytokine Growth Factor Rev 2006, 17:19–27.
OpenUrl CrossRef PubMed Web of Science

[27] 27.↵
Qin W, Chung AC, Huang XR, Meng XM, Hui DS, Yu CM, Sung JJ, Lan HY: TGF-beta/Smad3 signaling promotes renal fibrosis by inhibiting miR-29. J Am Soc Nephrol 2011, 22:1462–1474.
OpenUrl Abstract/FREE Full Text

[28] 28.↵
Xiao J, Meng XM, Huang XR, Chung AC, Feng YL, Hui DS, Yu CM, Sung JJ, Lan HY: miR-29 inhibits bleomycin-induced pulmonary fibrosis in mice. Mol Ther 2012, 20:1251–1260.
OpenUrl CrossRef PubMed

[29] 29.↵
Tanabe H, Kohno N, Yamaguchi M: Global profiling of gene expression in mouse astrocyte in response to the potential longevity determinant miR-29. Mem Fac Agr Kinki Univ 2012, 45:1–16.
OpenUrl

[30] 30.↵
Thorrez L, Laudadio I, Van Deun K, Quintens R, Hendrickx N, Granvik M, Lemaire K, Schraenen A, Van Lommel L, Lehnert S, et al: Tissue-specific disallowance of housekeeping genes: the other face of cell differentiation. Genome Res 2011, 21:95–105.
OpenUrl Abstract/FREE Full Text

[31] 31.↵
Parviz F, Matullo C, Garrison WD, Savatski L, Adamson JW, Ning G, Kaestner KH, Rossi JM, Zaret KS, Duncan SA: Hepatocyte nuclear factor 4alpha controls the development of a hepatic epithelium and liver morphogenesis. Nat Genet 2003, 34:292–296.
OpenUrl CrossRef PubMed Web of Science

[32] 32.↵
Moreau-Gachelin F, Tavitian A, Tambourin P: Spi-1 is a putative oncogene in virally induced murine erythroleukaemias. Nature 1988, 331:277–280.
OpenUrl CrossRef PubMed

[33] 33.↵
Phelan JK, McCabe ER: Mutations in NR0B1 (DAX1) and NR5A1 (SF1) responsible for adrenal hypoplasia congenita. Hum Mutat 2001, 18:472–487.
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Rosen ED, Sarraf P, Troy AE, Bradwin G, Moore K, Milstone DS, Spiegelman BM, Mortensen RM: PPAR gamma is required for the differentiation of adipose tissue in vivo and in vitro. Mol Cell 1999, 4:611–617.
OpenUrl CrossRef PubMed Web of Science

[35] 35.↵
Ge SX: Large-scale analysis of expression signatures reveals hidden links among diverse cellular processes. BMC Syst Biol 2011, 5:87.
OpenUrl CrossRef PubMed

[36] 36.↵
Ishikawa K, Yoshida S, Kadota K, Nakamura T, Niiro H, Arakawa S, Yoshida A, Akashi K, Ishibashi T: Gene expression profile of hyperoxic and hypoxic retinas in a mouse model of oxygen-induced retinopathy. Invest Ophthalmol Vis Sci 2010, 51:4307–4319.
OpenUrl Abstract/FREE Full Text

[37] 37.
Rattner A, Nathans J: The genomic response to retinal disease and injury: evidence for endothelin signaling from photoreceptors to glia. J Neurosci 2005, 25:4540–4549.
OpenUrl Abstract/FREE Full Text

[38] 38.↵
Zhu Y, Natoli R, Valter K, Stone J: Differential gene expression in mouse retina related to regional differences in vulnerability to hyperoxia. Mol Vis 2010, 16:740–755.
OpenUrl PubMed

[39] 39.↵
Kaur C, Foulds WS, Ling EA: Hypoxia-ischemia and retinal ganglion cell damage. Clin Ophthalmol 2008, 2:879–889.
OpenUrl PubMed

[40] 40.↵
Lewis GP, Fisher SK: Up-regulation of glial fibrillary acidic protein in response to retinal injury: its potential role in glial remodeling and a comparison to vimentin expression. Int Rev Cytol 2003, 230:263–290.
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Zhang J, Gray J, Wu L, Leone G, Rowan S, Cepko CL, Zhu X, Craft CM, Dyer MA: Rb regulates proliferation and rod photoreceptor development in the mouse retina. Nat Genet 2004, 36:351–360.
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Cottet S, Michaut L, Boisset G, Schlecht U, Gehring W, Schorderet DF: Biological characterization of gene response in Rpe65-/- mouse model of Leber’s congenital amaurosis during progression of the disease. FASEB J 2006, 20:2036–2049.
OpenUrl CrossRef PubMed Web of Science

[43] 43.↵
del Toro R, Prahst C, Mathivet T, Siegfried G, Kaminker JS, Larrivee B, Breant C, Duarte A, Takakura N, Fukamizu A, et al: Identification and functional analysis of endothelial tip cell-enriched genes. Blood 2010, 116:4025–4033.
OpenUrl Abstract/FREE Full Text

[44] 44.
Newman JC, Weiner AM: L2L: a simple tool for discovering the hidden significance in microarray expression data. Genome Biol 2005, 6:R81.
OpenUrl CrossRef PubMed

[45] 45.
Higgins ME, Claremont M, Major JE, Sander C, Lash AE: CancerGenes: a gene selection resource for cancer genome projects. Nucleic Acids Res 2007, 35:D721–726.
OpenUrl CrossRef PubMed Web of Science

[46] 46.
Culhane AC, Schroder MS, Sultana R, Picard SC, Martinelli EN, Kelly C, Haibe-Kains B, Kapushesky M, St Pierre AA, Flahive W, et al: GeneSigDB: a manually curated database and resource for analysis of gene expression signatures. Nucleic Acids Res 2012, 40:D1060–1066.
OpenUrl CrossRef PubMed Web of Science

[47] 47.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene Ontology: tool for the unification of biology. Nature Genetics 2000, 25:25–29.
OpenUrl CrossRef PubMed Web of Science

[48] 48.
Mi H, Lazareva-Ulitsky B, Loo R, Kejariwal A, Vandergriff J, Rabkin S, Guo N, Muruganujan A, Doremieux O, Campbell MJ, et al: The PANTHER database of protein families, subfamilies, functions and pathways. Nucleic Acids Res 2005, 33:D284–288.
OpenUrl CrossRef PubMed Web of Science

[49] 49.
Kelder T, van Iersel MP, Hanspers K, Kutmon M, Conklin BR, Evelo CT, Pico AR: WikiPathways: building research communities on biological pathways. Nucleic Acids Res 2012, 40:D1301–1307.
OpenUrl CrossRef PubMed Web of Science

[50] 50.
Yamamoto S, Sakai N, Nakamura H, Fukagawa H, Fukuda K, Takagi T: INOH: ontology-based highly structured database of signal transduction pathways. Database (Oxford) 2011, 2011:bar052.
OpenUrl

[51] 51.
Kandasamy K, Mohan SS, Raju R, Keerthikumar S, Kumar GS, Venugopal AK, Telikicherla D, Navarro JD, Mathivanan S, Pecquet C, et al: NetPath: a public resource of curated signal transduction pathways. Genome Biol 2010, 11:R3.
OpenUrl CrossRef PubMed

[52] 52.
Jupe S, Akkerman JW, Soranzo N, Ouwehand WH: Reactome - a curated knowledgebase of biological pathways: megakaryocytes and platelets. J Thromb Haemost 2012.

[53] 53.
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Research 2006, 34:D354–D357.
OpenUrl CrossRef PubMed Web of Science

[54] 54.
Ma H, Sorokin A, Mazein A, Selkov A, Selkov E, Demin O, Goryanin I: The Edinburgh human metabolic network reconstruction and its functional analysis. Mol Syst Biol 2007, 3:135.
OpenUrl Abstract/FREE Full Text

[55] 55.
Evsikov AV, Dolan ME, Genrich MP, Patek E, Bult CJ: MouseCyc: a curated biochemical pathways database for the laboratory mouse. Genome Biol 2009, 10:R84.
OpenUrl CrossRef PubMed

[56] 56.
Mattingly CJ, Rosenstein MC, Colby GT, Forrest JN, Jr.., Boyer JL: The Comparative Toxicogenomics Database (CTD): a resource for comparative toxicological studies. J Exp Zool A Comp Exp Biol 2006, 305:689–692.
OpenUrl PubMed

[57] 57.
Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P: A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol 2010, 6:343.
OpenUrl Abstract/FREE Full Text

[58] 58.
Gunther S, Kuhn M, Dunkel M, Campillos M, Senger C, Petsalaki E, Ahmed J, Urdiales EG, Gewiess A, Jensen LJ, et al: SuperTarget and Matador: resources for exploring drug-target relationships. Nucleic Acids Res 2008, 36:D919–922.
OpenUrl CrossRef PubMed Web of Science

[59] 59.
Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, Gautam B, Hassanali M: DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res 2008, 36:D901–906.
OpenUrl CrossRef PubMed Web of Science

[60] 60.
Frolkis A, Knox C, Lim E, Jewison T, Law V, Hau DD, Liu P, Gautam B, Ly S, Guo AC, et al: SMPDB: The Small Molecule Pathway Database. Nucleic Acids Res 2010, 38:D480–487.
OpenUrl CrossRef PubMed Web of Science

[61] 61.
Wang X: miRDB: a microRNA target prediction and functional annotation database with a wiki interface. RNA 2008, 14:1012–1017.
OpenUrl Abstract/FREE Full Text

[62] 62.
Betel D, Wilson M, Gabow A, Marks DS, Sander C: The microRNA.org resource: targets and expression. Nucleic Acids Res 2008, 36:D149–153.
OpenUrl CrossRef PubMed Web of Science

[63] 63.
Grimson A, Farh KK, Johnston WK, Garrett-Engele P, Lim LP, Bartel DP: MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol Cell 2007, 27:91–105.
OpenUrl CrossRef PubMed Web of Science

[64] 64.
Vergoulis T, Vlachos IS, Alexiou P, Georgakilas G, Maragkakis M, Reczko M, Gerangelos S, Koziris N, Dalamagas T, Hatzigeorgiou AG: TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support. Nucleic Acids Res 2012, 40:D222–229.
OpenUrl CrossRef PubMed Web of Science

[65] 65.
Hsu SD, Lin FM, Wu WY, Liang C, Huang WC, Chan WL, Tsai WT, Chen GZ, Lee CJ, Chiu CM, et al: miRTarBase: a database curates experimentally validated microRNA-target interactions. Nucleic Acids Res 2011, 39:D163–169.
OpenUrl CrossRef PubMed Web of Science

[66] 66.
Krek A, Grun D, Poy MN, Wolf R, Rosenberg L, Epstein EJ, MacMenamin P, da Piedade I, Gunsalus KC, Stoffel M, Rajewsky N: Combinatorial microRNA target predictions. Nat Genet 2005, 37:495–500.
OpenUrl CrossRef PubMed Web of Science

[67] 67.
Essaghir A, Toffalini F, Knoops L, Kallin A, van Helden J, Demoulin JB: Transcription factor regulation can be accurately predicted from the presence of target gene signatures in microarray gene expression data. Nucleic Acids Res 2010, 38:e120.
OpenUrl CrossRef PubMed

[68] 68.
Zhao F, Xuan Z, Liu L, Zhang MQ: TRED: a Transcriptional Regulatory Element Database and a platform for in silico gene regulation studies. Nucleic Acids Res 2005, 33:D103–107.
OpenUrl CrossRef PubMed Web of Science

[69] 69.
Friard O, Re A, Taverna D, De Bortoli M, Cora D: CircuitsDB: a database of mixed microRNA/transcription factor feed-forward regulatory circuits in human and mouse. BMC Bioinformatics 2010, 11:435.
OpenUrl CrossRef PubMed

[70] 70.
Wingender E, Chen X, Hehl R, Karas H, Liebich I, Matys V, Meinhardt T, Pruss M, Reuter I, Schacherer F: TRANSFAC: an integrated system for gene expression regulation. Nucleic Acids Res 2000, 28:316–319.
OpenUrl CrossRef PubMed Web of Science

[71] 71.
Robinson PN, Mundlos S: The human phenotype ontology. Clin Genet 2010, 77:525–534.
OpenUrl CrossRef PubMed Web of Science

[72] 72.
Kuhn M, von Mering C, Campillos M, Jensen LJ, Bork P: STITCH: interaction networks of chemicals and proteins. Nucleic Acids Res 2008, 36:D684–688.
OpenUrl CrossRef PubMed Web of Science

[73] 73.
Smith CL, Eppig JT: The Mammalian Phenotype Ontology as a unifying standard for experimental and high-throughput phenotyping data. Mamm Genome 2012, 23:653–668.
OpenUrl CrossRef PubMed Web of Science

[74] 74.
Lim E, Pon A, Djoumbou Y, Knox C, Shrivastava S, Guo AC, Neveu V, Wishart DS: T3DB: a comprehensively annotated database of common toxins and their targets. Nucleic Acids Res 2010, 38:D781–786.
OpenUrl CrossRef PubMed Web of Science

[75] 75.
Schaefer CF, Anthony K, Krupa S, Buchoff J, Day M, Hannay T, Buetow KH: PID: the Pathway Interaction Database. Nucleic Acids Res 2009, 37:D674–679.
OpenUrl CrossRef PubMed Web of Science

[76] 76.
He X, Chang S, Zhang J, Zhao Q, Xiang H, Kusonmano K, Yang L, Sun ZS, Yang H, Wang J: MethyCancer: the database of human DNA methylation and cancer. Nucleic Acids Res 2008, 36:D836–841.
OpenUrl CrossRef PubMed Web of Science

[77] 77.
Lauss M, Visne I, Weinhaeusel A, Vierlinger K, Noehammer C, Kriegner A: MethCancerDB--aberrant DNA methylation in human cancer. Br J Cancer 2008, 98:816–817.
OpenUrl CrossRef PubMed Web of Science

GSKB: A gene set database for pathway analysis in mouse

ABSTRACT

INTRODUCTION

METHODS

RESULTS AND DISCUSSION

Construction of the knowledgebase

Using the database

In-depth analysis of the expression profile of miR-29 silencing and induction

Identifying transcription factors from gene expression data

Meta-analysis of published gene lists yields insight into blindness and visual perception

CONCLUSION

FUNDING

ACKNOWLEDGEMENTS

References

Citation Manager Formats

Subject Area