Loading...
Thumbnail Image
Publication

WormCat 2.0 defines characteristics and conservation of poorly annotated genes in Caenorhabditis elegans [preprint]

Higgins, Daniel P.
Weisman, Caroline M.
Lui, Dominique
D’Agostino, Frank A.
Walker, Amy K
Embargo Expiration Date
Link to Full Text
Abstract

Genome-wide measurement of mRNA or protein levels provides broad data sets for biological discovery. However, subsequent computational methods are essential for uncovering the functional implications of the data as well as intuitively visualizing the findings. Current computational tools are biased toward well-described pathways, limiting their utility for novel discovery. Recently, we developed an annotation and category enrichment tool for Caenorhabditis elegans genomic data, WormCat, that provides an intuitive visualization output. Unlike GO, which excludes genes with no annotation information, WormCat 2.0 retains these genes as a special UNASSIGNED category. Here, we show that the UNASSIGNED gene category enrichment exhibits tissue-specific expression patterns and include genes with biological functions. Poorly annotated genes have previously been considered to lack homologs in closely related species. Instead, we find that around 3% of the UNASSIGNED genes have poorly characterized human orthologs. These human orthologs are themselves have little annotation information. A recently developed method that incorporates lineage relationships (abSENSE) indicates that failure of BLAST to detect homology explains the apparent lineage specificity for many UNASSIGNED genes, suggesting that a larger subset could be related to human genes. WormCat provides an annotation strategy that allows association of UNASSIGNED genes with specific phenotypes and known pathways. Our analysis indicates that the UNASSIGNED gene category contains candidates that merit further functional study which could yield insight into understudied areas of biology.

Source

bioRxiv 2021.11.11.467968; doi: https://doi.org/10.1101/2021.11.11.467968. Link to preprint on bioRxiv.

Year of Medical School at Time of Visit
Sponsors
Dates of Travel
DOI
10.1101/2021.11.11.467968
PubMed ID
Other Identifiers
Notes

This article is a preprint. Preprints are preliminary reports of work that have not been certified by peer review.

Funding and Acknowledgements
Corresponding Author
Related Resources

Now published in Genetics doi: 10.1093/genetics/iyac085

Related Resources
Repository Citation
Rights
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.