GAGE: generally applicable gene set enrichment for pathway analysis
- PMID: 19473525
- PMCID: PMC2696452
- DOI: 10.1186/1471-2105-10-161
GAGE: generally applicable gene set enrichment for pathway analysis
Abstract
Background: Gene set analysis (GSA) is a widely used strategy for gene expression data analysis based on pathway knowledge. GSA focuses on sets of related genes and has established major advantages over individual gene analyses, including greater robustness, sensitivity and biological relevance. However, previous GSA methods have limited usage as they cannot handle datasets of different sample sizes or experimental designs.
Results: To address these limitations, we present a new GSA method called Generally Applicable Gene-set Enrichment (GAGE). We successfully apply GAGE to multiple microarray datasets with different sample sizes, experimental designs and profiling techniques. GAGE shows significantly better results when compared to two other commonly used GSA methods of GSEA and PAGE. We demonstrate this improvement in the following three aspects: (1) consistency across repeated studies/experiments; (2) sensitivity and specificity; (3) biological relevance of the regulatory mechanisms inferred.GAGE reveals novel and relevant regulatory mechanisms from both published and previously unpublished microarray studies. From two published lung cancer data sets, GAGE derived a more cohesive and predictive mechanistic scheme underlying lung cancer progress and metastasis. For a previously unpublished BMP6 study, GAGE predicted novel regulatory mechanisms for BMP6 induced osteoblast differentiation, including the canonical BMP-TGF beta signaling, JAK-STAT signaling, Wnt signaling, and estrogen signaling pathways-all of which are supported by the experimental literature.
Conclusion: GAGE is generally applicable to gene expression datasets with different sample sizes and experimental designs. GAGE consistently outperformed two most frequently used GSA methods and inferred statistically and biologically more relevant regulatory pathways. The GAGE method is implemented in R in the "gage" package, available under the GNU GPL from http://sysbio.engin.umich.edu/~luow/downloads.php.
Figures





Similar articles
-
Time series gene expression profiling and temporal regulatory pathway analysis of BMP6 induced osteoblast differentiation and mineralization.BMC Syst Biol. 2011 May 23;5:82. doi: 10.1186/1752-0509-5-82. BMC Syst Biol. 2011. PMID: 21605425 Free PMC article.
-
Improving gene set analysis of microarray data by SAM-GS.BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242. BMC Bioinformatics. 2007. PMID: 17612399 Free PMC article.
-
Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information.BMC Bioinformatics. 2008 Nov 3;9:467. doi: 10.1186/1471-2105-9-467. BMC Bioinformatics. 2008. PMID: 18980677 Free PMC article.
-
Investigating the effect of paralogs on microarray gene-set analysis.BMC Bioinformatics. 2011 Jan 24;12:29. doi: 10.1186/1471-2105-12-29. BMC Bioinformatics. 2011. PMID: 21261946 Free PMC article.
-
Concordant integrative gene set enrichment analysis of multiple large-scale two-sample expression data sets.BMC Genomics. 2014;15 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2164-15-S1-S6. Epub 2014 Jan 24. BMC Genomics. 2014. PMID: 24564564 Free PMC article.
Cited by
-
KOBAS-i: intelligent prioritization and exploratory visualization of biological functions for gene enrichment analysis.Nucleic Acids Res. 2021 Jul 2;49(W1):W317-W325. doi: 10.1093/nar/gkab447. Nucleic Acids Res. 2021. PMID: 34086934 Free PMC article.
-
Dietary sulfur amino acid restriction in humans with overweight and obesity: a translational randomized controlled trial.J Transl Med. 2024 Jan 9;22(1):40. doi: 10.1186/s12967-023-04833-w. J Transl Med. 2024. PMID: 38195568 Free PMC article. Clinical Trial.
-
Pathway size matters: the influence of pathway granularity on over-representation (enrichment analysis) statistics.BMC Genomics. 2021 Mar 16;22(1):191. doi: 10.1186/s12864-021-07502-8. BMC Genomics. 2021. PMID: 33726670 Free PMC article.
-
NADPH oxidase 2 limits amplification of IL-1β-G-CSF axis and an immature neutrophil subset in murine lung inflammation.Blood Adv. 2023 Apr 11;7(7):1225-1240. doi: 10.1182/bloodadvances.2022007652. Blood Adv. 2023. PMID: 36103336 Free PMC article.
-
Aromatic amino acid metabolites alter interferon signaling and influenza pathogenesis.Front Mol Biosci. 2024 Jan 23;10:1232573. doi: 10.3389/fmolb.2023.1232573. eCollection 2023. Front Mol Biosci. 2024. PMID: 38322710 Free PMC article.
References
-
- Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102:15545–50. doi: 10.1073/pnas.0506580102. - DOI - PMC - PubMed
-
- Mootha VK, Lindgren CM, Eriksson KF, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstrale M, Laurila E, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34:267–73. doi: 10.1038/ng1180. - DOI - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases