Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorSteenbeek, Frank van
dc.contributor.authorRumpt, Marilijn van
dc.date.accessioned2024-12-01T00:01:39Z
dc.date.available2024-12-01T00:01:39Z
dc.date.issued2024
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/48197
dc.description.abstractCanine genetics play a pivotal role in unraveling the genetic basis of inherited diseases in both dogs and humans. Single-nucleotide polymorphisms (SNPs) are a robust tool for genetic research and with the development of commercial SNP arrays and increasing research activities the available amount of canine SNP data has grown tremendously. Combining and (re)using these data enhances sample sizes and, consequently, the power of genetic studies. However, the utility of multi-source SNP data depends on effective data harmonization and quality control (QC) procedures. This includes removal of poor-quality samples based on low sample call rates and excessive heterozygosity, and detection of duplicates, relationships between dogs, phenotyping errors or potential sample swaps by checking the dog’s identity based on sex, breed, and kinship. In total, data from approximately 19,000 dogs from 5 different platforms and Whole Genome Sequencing datasets were analyzed and merged. Recognizing the limitations of readily applying QC thresholds from human research to canine SNP data, this project aims to explore data preprocessing and QC steps essential for ensuring high-quality and accurately phenotyped canine SNP data, to establish a SNP reference database to advance genetic research in canines.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectThe growing amount of canine SNP data can be used for research on inherited diseases in dogs and humans. Combining and (re)using these data enhances sample sizes and the power of genetic studies. However, the utility of multi-source SNP data depends on effective data harmonization and quality control (QC) procedures. This project aims to explore data preprocessing and QC steps essential for ensuring high-quality and accurately phenotyped SNP data, to establish a SNP reference database.
dc.titleBuilding a Canine SNP Reference Database: Data Preprocessing and Quality Control Procedures
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordscanine genetics;dog;merging datasets;quality control;SNP database
dc.subject.courseuuBioinformatics and Biocomplexity
dc.thesis.id29539


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record