Browsing by Subject "Applied Data Science"
Now showing items 1-20 of 118
-
45 Anomaly detection with similarity graphs and active learning Building and storing static and dynamic similarity graphs with the help of a vector database
(2022)Fraudulent transactions of credit cards are a major problem for financial institutions and continues to grow along digital transformation. A conventional view states that fraudulent transactions are anomalies. A novel view ... -
A comparison between Bayesian penalized regression priors: lasso and regularized horseshoe
(2021)A comparison is performed between Bayesian penalized regression priors: the lasso and regularized horseshoe using the statistical programming language R. This study aims to provide researchers with insights into the use ... -
A deep neural network for lake ice detection with Sentinel-1 data
(2022)Ice cover of lakes is an indicator of climate conditions and possible changes thereof. It is therefore identified as an essential climate variable, and tracking its worldwide timing, duration and extent is important. Due ... -
A publication ontology for the CBG library
(2022)A problem of the exponential growth of digital data collection and storage of today, is that this is not done in a standardised way, leading to inconsistencies among data sources even when the subject is the same. These ... -
A Web Crawler for Automated Document Retrieval in Health Policy
(2021)Document retrieval in Health Policy Research is labor-intensive and inefficient. To investigate the efficacy and transparency of health policy processes such as drug approval, reports are manually collected from the websites ... -
Agent-Based Modelling of Trans-Atlantic Bird Flights
(2022)Many North American bird species migrate towards South America to spend the colder seasons. Occasionally, some of the birds that were meant to migrate south arrive in Europe, far outside of their normal habitat. It is ... -
An Implementation and Assessment of Semantic Search Few-Shot Classification
(2021)This thesis compares multiple methods of classification following cosine-similarity calculation from semantic search with Sentence-BERT (SBERT), as well as various class representations in few-shot classification with ... -
Analysing the Social-Economic Impact of Wireless Mobile Services During and Before COVID-19 Using Topic Modelling and Sentiment Analysis on Tweets
(2021)Social media platforms can be used as a data source for measuring public opinion on various topics such as wireless mobile services. Twitter is a suitable platform that is able to map the sentiments. In this research the ... -
Analyzing gender bias in children’s television shows
(2022)In this research gender bias was analyzed in two of the most watched children’s television shows in the Netherlands, Sesamstraat and Het Klokhuis, due to the impressionability of the target audience of these shows. Automated ... -
Analyzing maize price elasticity in the US
(2022)The main goal of this paper is to explore the effect of demand and supply quantity on maize prices by analyzing price elasticity. To examine the price elasticity of supply and demand of maize, multiple linear regression ... -
Anomaly Detection Techniques as a Quality Evaluation of graphs
(2022)The goal of this project is the implementation of PyGQE, a software package that given a graph measures its quality by measuring the possible anomaly detections. The aim of this application is to help data scientists ... -
Anomaly Detection Techniques on relational data as Quality Evaluation of a dataset
(2022)An outlier is a point that deviates significantly from the pattern that has been formed from the majority of the data points. The presence of outliers can exacerbate statistical results which leads to misrepresented ... -
Area comparisons on municipality, neighbourhood, and borough level: Shiny App in R for open data about menities and health
(2022)Open government data can contribute to more transparency and a participatory governance. If the data is neighbourhood level data, it can support community-led actions and lead to changes in communities. However, open data ... -
Assessing Credibility in Online Sexual Health Information based on the level of Readability, Sensational Tone and the use of Dutch Swearing Words
(2022)Today, for most adolescents the use of social networking sites plays a major role in their lives. However, the great level of accessibility to these networking sites makes it very easy for anyone to publish information ... -
Automated summary scoring using a linguistic feature approach
(2021)Summary-writing tasks are often used to assess reading comprehension of students. Grading these types of tasks is time-consuming and teachers have difficulty being consistent when grading. The goal of this research is ... -
Automatic Grading of CITO Mathematics Tests
(2022)This thesis attempts to discover whether it is possible to do automatic grading of CITO mathematics tests using Optical Character Recognition (OCR) methods, among others. It is part of a cooperation between three students, ... -
Can a 𝙿𝚢𝚝𝚑𝚘𝚗 (package) do what 𝚖𝚒𝚌𝚎 can?
(2022)Missing data frequently complicate data analysis. Multiple imputation is a well known and robust technique for addressing missing data. In R, multiple imputation is commonly implemented through the mice package which ...