Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorFeelders, A.J
dc.contributor.advisorFrancioli, L.C.
dc.contributor.authorCretu Stancu, M.
dc.date.accessioned2014-09-16T17:01:09Z
dc.date.available2014-09-16T17:01:09Z
dc.date.issued2014
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/18344
dc.description.abstractWe address the problem of accurately and efficiently identifying de-novo mutations in the human germline. More precisely, how can we detect de-novo point mutations on the sex chromosome in a robust yet sensible manner? What are the challenges that arise from the quality of the available data for this chromosome? What is the pattern of de-novo events on this chromosome, compared to the rest of our genome? The challenge of devising a discovery method for such events comes from their rarity relative to the error rates of the underlying technology involved in DNA reading. We discuss the relevance of this research in the light of our increasing understanding of evolution and our genetic code’s structure and function, as well as its practical applications of finding genetic disease risk factors. We present the field’s currently most used analysis methods and technologies, and describe each step that influences the design and/or performance of the model we implement. We present a straightforward yet efficient general model of de-novo mutations discovery and then show how the model needs to be adapted in order to correctly capture the particularities of the chromosome. Furthermore we illustrate what information can be explained by our model and where we still need to apply domain knowledge to correct the output. Finally, we show how the model is integrated in the complex and modular analysis pipeline used in the community.
dc.description.sponsorshipUtrecht University
dc.language.isoen
dc.titleA framework for de-novo mutations discovery in Next Generation Sequencing data
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsbayesian model, next generation sequencing, large ammounts of data, java, machine learning
dc.subject.courseuuComputing Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record