Gene set enrichment analysis: performance evaluation and usage guidelines
UMass Chan Affiliations
Program in Bioinformatics and Integrative BiologyDepartment of Biochemistry and Molecular Pharmacology
Document Type
Journal ArticlePublication Date
2012-05-01Keywords
AlgorithmsComputational Biology
Databases, Genetic
Gene Expression
Guidelines as Topic
Humans
Bioinformatics
Computational Biology
Molecular Biology
Systems Biology
Metadata
Show full item recordAbstract
A central goal of biology is understanding and describing the molecular basis of plasticity: the sets of genes that are combinatorially selected by exogenous and endogenous environmental changes, and the relations among the genes. The most viable current approach to this problem consists of determining whether sets of genes are connected by some common theme, e.g. genes from the same pathway are overrepresented among those whose differential expression in response to a perturbation is most pronounced. There are many approaches to this problem, and the results they produce show a fair amount of dispersion, but they all fall within a common framework consisting of a few basic components. We critically review these components, suggest best practices for carrying out each step, and propose a voting method for meeting the challenge of assessing different methods on a large number of experimental data sets in the absence of a gold standard.Source
Brief Bioinform. 2012 May;13(3):281-91. doi: 10.1093/bib/bbr049. Epub 2011 Sep 7. Link to article on publisher's site
DOI
10.1093/bib/bbr049Permanent Link to this Item
http://hdl.handle.net/20.500.14038/25880PubMed ID
21900207Related Resources
ae974a485f413a2113503eed53cd6c53
10.1093/bib/bbr049