Show simple item record

dc.contributor.authorHoldorf, Amy D.
dc.contributor.authorHiggins, Daniel P.
dc.contributor.authorHart, Anne C.
dc.contributor.authorBoag, Peter R.
dc.contributor.authorPazour, Gregory J.
dc.contributor.authorWalhout, Albertha J M
dc.contributor.authorWalker, Amy K
dc.date2022-08-11T08:08:24.000
dc.date.accessioned2022-08-23T15:53:46Z
dc.date.available2022-08-23T15:53:46Z
dc.date.issued2019-11-15
dc.date.submitted2019-11-27
dc.identifier.citation<p>bioRxiv 844928; doi: https://doi.org/10.1101/844928. <a href="https://doi.org/10.1101/844928" target="_blank">Link to preprint on bioRxiv service.</a></p>
dc.identifier.doi10.1101/844928
dc.identifier.urihttp://hdl.handle.net/20.500.14038/29418
dc.description.abstractThe emergence of large gene expression datasets has revealed the need for improved tools to identify enriched gene categories and visualize enrichment patterns. While Gene Ontogeny (GO) provides a valuable tool for gene set enrichment analysis, it has several limitations. First, it is difficult to graphically compare multiple GO analyses. Second, genes from some model systems are not well represented. For example, around 30% of Caenorhabditis elegans genes are missing from analysis in commonly used databases. To allow categorization and visualization of enriched C. elegans gene sets in different types of genome-scale data, we developed WormCat, a web-based tool that uses a near-complete annotation of the C. elegans genome to identify co-expressed gene sets and scaled heat map for enrichment visualization. We tested the performance of WormCat using a variety of published transcriptomic datasets and show that it reproduces major categories identified by GO. Importantly, we also found previously unidentified categories that are informative for interpreting phenotypes or predicting biological function. For example, we analyzed published RNA-seq data from C. elegans treated with combinations of lifespan-extending drugs where one combination paradoxically shortened lifespan. Using WormCat, we identified sterol metabolism as a category that was not enriched in the single or double combinations but emerged in a triple combination along with the lifespan shortening. Thus, WormCat identified a gene set with potential phenotypic relevance that was not uncovered with previous GO analysis. In conclusion, WormCat provides a powerful tool for the analysis and visualization of gene set enrichment in different types of C. elegans datasets.
dc.language.isoen_US
dc.relation<p>Now published in <em>Genetics</em>, <a href="https://doi.org/10.1534/genetics.119.302919" target="_blank" title="10.1534/genetics.119.302919">10.1534/genetics.119.302919</a></p>
dc.rightsThe copyright holder for this preprint (which was not peer-reviewed) is the author/funder. It is made available under a CC-BY-NC 4.0 International license.
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.subjectBioinformatics
dc.subjectWormCat
dc.subjectCaenorhabditis elegans
dc.subjectdatasets
dc.subjectBiochemical Phenomena, Metabolism, and Nutrition
dc.subjectBioinformatics
dc.subjectGenetic Phenomena
dc.subjectGenetics and Genomics
dc.subjectNucleic Acids, Nucleotides, and Nucleosides
dc.titleWormCat: an online tool for annotation and visualization of Caenorhabditis elegans genome-scale data [preprint]
dc.typePreprint
dc.source.journaltitlebioRxiv
dc.identifier.legacyfulltexthttps://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=2660&amp;context=faculty_pubs&amp;unstamped=1
dc.identifier.legacycoverpagehttps://escholarship.umassmed.edu/faculty_pubs/1646
dc.identifier.contextkey15882227
refterms.dateFOA2022-08-23T15:53:46Z
html.description.abstract<p>The emergence of large gene expression datasets has revealed the need for improved tools to identify enriched gene categories and visualize enrichment patterns. While Gene Ontogeny (GO) provides a valuable tool for gene set enrichment analysis, it has several limitations. First, it is difficult to graphically compare multiple GO analyses. Second, genes from some model systems are not well represented. For example, around 30% of Caenorhabditis elegans genes are missing from analysis in commonly used databases. To allow categorization and visualization of enriched C. elegans gene sets in different types of genome-scale data, we developed WormCat, a web-based tool that uses a near-complete annotation of the C. elegans genome to identify co-expressed gene sets and scaled heat map for enrichment visualization. We tested the performance of WormCat using a variety of published transcriptomic datasets and show that it reproduces major categories identified by GO. Importantly, we also found previously unidentified categories that are informative for interpreting phenotypes or predicting biological function. For example, we analyzed published RNA-seq data from C. elegans treated with combinations of lifespan-extending drugs where one combination paradoxically shortened lifespan. Using WormCat, we identified sterol metabolism as a category that was not enriched in the single or double combinations but emerged in a triple combination along with the lifespan shortening. Thus, WormCat identified a gene set with potential phenotypic relevance that was not uncovered with previous GO analysis. In conclusion, WormCat provides a powerful tool for the analysis and visualization of gene set enrichment in different types of C. elegans datasets.</p>
dc.identifier.submissionpathfaculty_pubs/1646
dc.contributor.departmentProgram in Molecular Medicine
dc.contributor.departmentProgram in Systems Biology


Files in this item

Thumbnail
Name:
844928.full.pdf
Size:
3.562Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record

The copyright holder for this preprint (which was not peer-reviewed) is the author/funder. It is made available under a CC-BY-NC 4.0 International license.
Except where otherwise noted, this item's license is described as The copyright holder for this preprint (which was not peer-reviewed) is the author/funder. It is made available under a CC-BY-NC 4.0 International license.