Show simple item record

dc.contributor.advisorJeffrey Bailey
dc.contributor.authorHathaway, Nicholas J
dc.date2022-08-11T08:08:46.000
dc.date.accessioned2022-08-23T16:07:59Z
dc.date.available2022-08-23T16:07:59Z
dc.date.issued2018-03-19
dc.date.submitted2018-05-04
dc.identifier.doi10.13028/M2039K
dc.identifier.urihttp://hdl.handle.net/20.500.14038/32357
dc.description.abstractThe rapid development of DNA sequencing technologies has opened up new avenues of research, including the investigation of population structure within infectious diseases (both within patient and between populations). In order to take advantage of these advances in technologies and the generation of new types of data, novel bioinformatics tools are needed that won’t succumb to artifacts introduced by the data generation, and thus provide accurate and precise results. To achieve this goal I have create several tools. First, SeekDeep, a pipeline for analyzing targeted amplicon sequencing datasets from various technologies, is able to achieve 1-base resolution even at low frequencies and read depths allowing for accurate comparison between samples and the detection of important SNPs. Next, PathWeaver, a local haplotype assembler designed for complex infections and highly variable genomic regions with poor reference mapping. PathWeaver is able to create highly accurate haplotypes without generating chimeric assemblies. PathWeaver was used on the key protein in pregnancy-associated malaria Plasmodium falciparum VAR2CSA which revealed population sub-structuring within the key binding domain of the protein observed to be present globally along with confirming copy number variation. Finally, the program Carmen is able to utilize PathWeaver to augment the results from targeted amplicon approaches by reporting where and when local haplotypes have been found previously. These rigorously tested tools allow the analysis of local haplotype data from various technologies and approaches to provide accurate, precise and easily accessible results.
dc.language.isoen_US
dc.publisherUniversity of Massachusetts Medical School
dc.rightsCopyright is held by the author, with all rights reserved.
dc.subjectplasmodium
dc.subjectsequencing
dc.subjectinfectious diseases
dc.subjecttargeted amplicon
dc.subjectBioinformatics
dc.subjectComputational Biology
dc.subjectParasitic Diseases
dc.titleA suite of computational tools to interrogate sequence data with local haplotype analysis within complex ​Plasmodium​ infections and other microbial mixtures
dc.typeDoctoral Dissertation
dc.identifier.legacyfulltexthttps://escholarship.umassmed.edu/cgi/viewcontent.cgi?article=1976&context=gsbs_diss&unstamped=1
dc.identifier.legacycoverpagehttps://escholarship.umassmed.edu/gsbs_diss/970
dc.legacy.embargo2019-05-04T00:00:00-07:00
dc.identifier.contextkey12075262
refterms.dateFOA2022-08-25T04:29:56Z
html.description.abstract<p>The rapid development of DNA sequencing technologies has opened up new avenues of research, including the investigation of population structure within infectious diseases (both within patient and between populations). In order to take advantage of these advances in technologies and the generation of new types of data, novel bioinformatics tools are needed that won’t succumb to artifacts introduced by the data generation, and thus provide accurate and precise results. To achieve this goal I have create several tools.</p> <p>First, SeekDeep, a pipeline for analyzing targeted amplicon sequencing datasets from various technologies, is able to achieve 1-base resolution even at low frequencies and read depths allowing for accurate comparison between samples and the detection of important SNPs. Next, PathWeaver, a local haplotype assembler designed for complex infections and highly variable genomic regions with poor reference mapping. PathWeaver is able to create highly accurate haplotypes without generating chimeric assemblies. PathWeaver was used on the key protein in pregnancy-associated malaria Plasmodium falciparum VAR2CSA which revealed population sub-structuring within the key binding domain of the protein observed to be present globally along with confirming copy number variation. Finally, the program Carmen is able to utilize PathWeaver to augment the results from targeted amplicon approaches by reporting where and when local haplotypes have been found previously.</p> <p>These rigorously tested tools allow the analysis of local haplotype data from various technologies and approaches to provide accurate, precise and easily accessible results.</p>
dc.identifier.submissionpathgsbs_diss/970
dc.contributor.departmentDepartment of Bioinformatics and Integrative Biology
dc.identifier.orcid0000-0001-9639-2894


Files in this item

Thumbnail
Name:
Hathaway_Nicholas_reduced_size.pdf
Size:
3.940Mb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record