dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Steenbeek, Frank van | |
dc.contributor.author | Rumpt, Marilijn van | |
dc.date.accessioned | 2024-12-01T00:01:39Z | |
dc.date.available | 2024-12-01T00:01:39Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/48197 | |
dc.description.abstract | Canine genetics play a pivotal role in unraveling the genetic basis of inherited diseases in both dogs and humans. Single-nucleotide polymorphisms (SNPs) are a robust tool for genetic research and with the development of commercial SNP arrays and increasing research activities the available amount of canine SNP data has grown tremendously. Combining and (re)using these data enhances sample sizes and, consequently, the power of genetic studies. However, the utility of multi-source SNP data depends on effective data harmonization and quality control (QC) procedures. This includes removal of poor-quality samples based on low sample call rates and excessive heterozygosity, and detection of duplicates, relationships between dogs, phenotyping errors or potential sample swaps by checking the dog’s identity based on sex, breed, and kinship. In total, data from approximately 19,000 dogs from 5 different platforms and Whole Genome Sequencing datasets were analyzed and merged. Recognizing the limitations of readily applying QC thresholds from human research to canine SNP data, this project aims to explore data preprocessing and QC steps essential for ensuring high-quality and accurately phenotyped canine SNP data, to establish a SNP reference database to advance genetic research in canines. | |
dc.description.sponsorship | Utrecht University | |
dc.language.iso | EN | |
dc.subject | The growing amount of canine SNP data can be used for research on inherited diseases in dogs and humans. Combining and (re)using these data enhances sample sizes and the power of genetic studies. However, the utility of multi-source SNP data depends on effective data harmonization and quality control (QC) procedures. This project aims to explore data preprocessing and QC steps essential for ensuring high-quality and accurately phenotyped SNP data, to establish a SNP reference database. | |
dc.title | Building a Canine SNP Reference Database: Data Preprocessing and Quality Control Procedures | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | canine genetics;dog;merging datasets;quality control;SNP database | |
dc.subject.courseuu | Bioinformatics and Biocomplexity | |
dc.thesis.id | 29539 | |