Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness [preprint]
dc.contributor.author | Obermeyer, Fritz | |
dc.contributor.author | Jankowiak, Martin | |
dc.contributor.author | Barkas, Nikolaos | |
dc.contributor.author | Schaffner, Stephen F | |
dc.contributor.author | Pyle, Jesse D. | |
dc.contributor.author | Yurkovetskiy, Lonya | |
dc.contributor.author | Bosso, Matteo | |
dc.contributor.author | Park, Daniel J. | |
dc.contributor.author | Babadi, Mehrtash | |
dc.contributor.author | MacInnis, Bronwyn L. | |
dc.contributor.author | Luban, Jeremy | |
dc.contributor.author | Sabeti, Pardis C. | |
dc.contributor.author | Lemieux, Jacob E. | |
dc.date | 2022-08-11T08:08:34.000 | |
dc.date.accessioned | 2022-08-23T15:59:43Z | |
dc.date.available | 2022-08-23T15:59:43Z | |
dc.date.issued | 2022-02-16 | |
dc.date.submitted | 2022-05-26 | |
dc.identifier.citation | <p>Obermeyer F, Jankowiak M, Barkas N, Schaffner SF, Pyle JD, Yurkovetskiy L, Bosso M, Park DJ, Babadi M, MacInnis BL, Luban J, Sabeti PC, Lemieux JE. Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness. medRxiv [Preprint]. 2022 Feb 16:2021.09.07.21263228. doi: 10.1101/2021.09.07.21263228. Update in: Science. 2022 May 24;:abm1208. PMID: 35194619; PMCID: PMC8863165. <a href="https://doi.org/10.1101/2021.09.07.21263228" target="_blank" title="view preprint in medrxiv">Link to preprint on medRxiv.</a></p> | |
dc.identifier.doi | 10.1101/2021.09.07.21263228 | |
dc.identifier.pmid | 35194619 | |
dc.identifier.uri | http://hdl.handle.net/20.500.14038/30735 | |
dc.description | <p>This article is a preprint. Preprints are preliminary reports of work that have not been certified by peer review.</p> | |
dc.description.abstract | Repeated emergence of SARS-CoV-2 variants with increased fitness necessitates rapid detection and characterization of new lineages. To address this need, we developed PyR0, a hierarchical Bayesian multinomial logistic regression model that infers relative prevalence of all viral lineages across geographic regions, detects lineages increasing in prevalence, and identifies mutations relevant to fitness. Applying PyR0 to all publicly available SARS-CoV-2 genomes, we identify numerous substitutions that increase fitness, including previously identified spike mutations and many non-spike mutations within the nucleocapsid and nonstructural proteins. PyR0 forecasts growth of new lineages from their mutational profile, identifies viral lineages of concern as they emerge, and prioritizes mutations of biological and public health concern for functional characterization. | |
dc.language.iso | en_US | |
dc.relation | <p>Now published in <em>Science</em> doi: <a href="https://doi.org/10.1126/science.abm1208" target="_blank">10.1126/science.abm1208</a></p> <p><a href="https://pubmed.ncbi.nlm.nih.gov/35194619/" target="_blank" title="view preprint in PubMed">View preprint in PubMed</a></p> | |
dc.rights | The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license. | |
dc.rights.uri | http://creativecommons.org/licenses/by-nd/4.0/ | |
dc.subject | Epidemiology | |
dc.subject | SARS-CoV-2 variants | |
dc.subject | mutations | |
dc.subject | viral lineages | |
dc.subject | fitness | |
dc.subject | Epidemiology | |
dc.subject | Genetics and Genomics | |
dc.subject | Immunology and Infectious Disease | |
dc.subject | Infectious Disease | |
dc.subject | Microbiology | |
dc.subject | Virus Diseases | |
dc.title | Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness [preprint] | |
dc.type | Preprint | |
dc.source.journaltitle | medRxiv | |
dc.identifier.legacyfulltext | https://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=3239&context=faculty_pubs&unstamped=1 | |
dc.identifier.legacycoverpage | https://escholarship.umassmed.edu/faculty_pubs/2206 | |
dc.identifier.contextkey | 29379126 | |
refterms.dateFOA | 2022-08-23T15:59:43Z | |
html.description.abstract | <p>Repeated emergence of SARS-CoV-2 variants with increased fitness necessitates rapid detection and characterization of new lineages. To address this need, we developed PyR<sub>0</sub>, a hierarchical Bayesian multinomial logistic regression model that infers relative prevalence of all viral lineages across geographic regions, detects lineages increasing in prevalence, and identifies mutations relevant to fitness. Applying PyR<sub>0</sub> to all publicly available SARS-CoV-2 genomes, we identify numerous substitutions that increase fitness, including previously identified spike mutations and many non-spike mutations within the nucleocapsid and nonstructural proteins. PyR<sub>0</sub> forecasts growth of new lineages from their mutational profile, identifies viral lineages of concern as they emerge, and prioritizes mutations of biological and public health concern for functional characterization.</p> | |
dc.identifier.submissionpath | faculty_pubs/2206 | |
dc.contributor.department | Program in Molecular Medicine |