Interactive Explanation of High-dimensional Data Projections

Thijssen, Julian

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Telea, Alex
dc.contributor.author	Thijssen, Julian
dc.date.accessioned	2022-07-19T00:01:37Z
dc.date.available	2022-07-19T00:01:37Z
dc.date.issued	2022
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/41799
dc.description.abstract	Companies, institutions and researchers around the world are collecting enormous sets of high-dimensional data at breakneck speed. However, our understanding of the collected data is not nearly keeping up. One of the main approaches to understanding these datasets has been to reduce the data to a low-dimensional representation, called a projection, that can subsequently be visualised. Seeing visible patterns in these projections indicates there are relationships between the dimensions of the high-dimensional data. However, it does not tell us anything about what those relationships are. Several efforts have previously been made to explain the patterns in the projection in terms of their original dimensions. However, they tend to fall short in adequately explaining them, or the techniques don't scale well to a higher number of dimensions. Therefore, this thesis aims to answer the question how to adequately explain these patterns in projections of high-dimensional data, while simultaneously scaling better than previous techniques in the number of data dimensions. We extend the variance-based explanations of previous work with a value-based explanation, that gives insight into, not only why the patterns are there, but what they represent. Furthermore, we introduce a user-driven exploration mechanism that provides significantly more detailed explanations of regions in the projection. In addition, these explanations are augmented by a number of tools that support their function. We integrate all of the above elements into a visualisation solution for exploring high-dimensional data projections. We assess the visualisation system using an evaluation study asking a mix of 23 experts and non-experts to analyze several datasets of increasing dimensionality (12, 31, 58) using the proposed solution, as well as their opinion on the usefulness of each of the elements of the visualisation solution. Participants rated each of the elements of the visualisation system highly in terms of their usefulness. In addition, with minimal training and by overwhelming majority, participants answered correctly to a series of twelve control questions meant to test whether they understood how to read the explanations generated by the visualisation system. On a series of nine more complex analysis questions, where participants had to use the system themselves, the majority gave answers that strongly aligned with our analysis. This indicates use of the system results in consistent insights about the data with only minor training or expertise required. Overall, the evaluation study indicates that our visualisation solution is capable of providing detailed and consistent explanations of patterns in data projections, even as the dimensionality of the data gets higher.
dc.description.sponsorship	Utrecht University
dc.language.iso	EN
dc.subject	A common tool in the analysis of high-dimensional data is projecting it to a low-dimensional projection. Unfortunately, such a projection says very little about the original data. Several explanatory mechanisms have been proposed to fix this, however none are quite satisfactory. We propose a novel interactive explanatory mechanism allowing for detailed analysis of high-dimensional projections.
dc.title	Interactive Explanation of High-dimensional Data Projections
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.keywords	high-dimensional, analysis, projections, explanation, interactive
dc.subject.courseuu	Game and Media Technology
dc.thesis.id	5809

Files in this item

Name:: Interactive_Explanation_of_Hig ...
Size:: 21.15Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record

Interactive Explanation of High-dimensional Data Projections

Files in this item

This item appears in the following Collection(s)

Related items

Visual exploration of the generalizationof neural network projections ﻿

Evaluating and improving collaboration in ProRail's project alliances for large and complex infrastructure projects ﻿

Heroines! Examining the effect of counter-stereotypical role models on gender stereotypes in middle childhood through art content analysis and teachers’ perspectives ﻿

Visual exploration of the generalizationof neural network projections

Evaluating and improving collaboration in ProRail's project alliances for large and complex infrastructure projects

Heroines! Examining the effect of counter-stereotypical role models on gender stereotypes in middle childhood through art content analysis and teachers’ perspectives