View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Machine-Learning-Based Dimensionality Assessment for Cognitive Diagnosis Models

        Thumbnail
        View/Open
        ADS thesis (7013590).pdf (896.8Kb)
        Publication date
        2025
        Author
        Neele, Rosalie
        Metadata
        Show full item record
        Summary
        This research aimed to select, tune, and interpret a supervised Machine Learning (ML) model in search of a model that could correctly predict the number of attributes, i.e., dimensionality assessment, in Cognitive Diagnosis Models (CDMs). These objectives were achieved by benchmarking various supervised ML algorithms, tuning the best-performing model, and applying interpretable ML in the form of counterfactual explanations. A large-scale simulated dataset of 607,579 observations and 946 predictors was used in this research. Feature selection was used to reduce the number of predictors to 142, after which the analysis was performed. An ensemble model combining random forest and XGBoost performed best among other supervised ML models, with a validation accuracy of 56.0%. Hyperparameter tuning using Model-Based Optimisation did not further increase the accuracy of the model. The final model evaluation on unseen test data achieved an accuracy of 56.1%. Generated counterfactuals revealed relevant predictors influencing model predictions and showed that, on average, 26 predictors needed to be altered to correct misclassifications. Despite limitations in model performance, the chosen model still provided meaningful improvement over the baseline of 11% and the counterfactuals offered insight into the complexity of dimensionality assessment in CDMs.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/49910
        Collections
        • Theses

        Related items

        Showing items related by title, author, creator and subject.

        • Modeling dual-task performance: do individualized models predict dual-task performance better than average models? 

          Cao, W. (2017)
          Understanding multitasking can be a complicated venture. The goal of this paper is to see whether using individual parameters for modeling dual-task will lead to better predictions of individual performance compared to ...
        • Modelling Wastewater Quantity and Quality in Mexico -- using an agent-based model 

          Chen, Y. (2021)
          Wastewater is a key element in regional and global water circles, and the discharge of a large quantity of untreated wastewater is posing serious threats to the environment and public health in Mexico. To have a thorough ...
        • Modelling offshore wind in the IMAGE/TIMER model 

          Gernaat, D.E.H.J. (2012)
          Current global energy consumption is expected to continue to grow as the global population is likely to increase towards 9 billion in 2050 while income levels per capita surge with 3-5% per year. Resource depletion, climate ...
        Utrecht university logo