Artificial intelligence for the predication of demographics from unsupervised eye tracking data
Summary
Background and motivation. Regular eye-tracking studies rarely report data from more than a
hundred participants. This is mostly due to the considerable effort involved in assessing data, given
that usually only one participant can be tested at a time and an experimenter needs to be present and
eye-tracking equipment is relatively expensive. In the present thesis, I present data obtained from an
eye-tracker that was available to the public at the NEMO science museum Amsterdam. As part of a
display at the museum, gaze data from more than a thousand participants was assessed. This is a
unique opportunity for improving the breadth and depth of behavioral studies while taking into
account the potential differences between different individuals. However, it is not known how much
the data quality is impacted by the limitations of the eye-tracker, untrained participants, and the
unsupervised experiment. Therefore, it is not clear what we can learn from this experiment and
whether similar installations may be a way forward towards assessing huge numbers or participants,
potentially revealing differences in gaze behavior. Furthermore, to maximize the impact of these
experiments new automated data analysis approaches are needed to classify gaze events and to
process such a large amount of data. While a large number of such algorithm exist, it is important to
investigate which one of them performs the best and how that choice affects the results of data
analysis and, thus, the conclusions. The other point is that the large amount of data might compensate
the potentially lower quality of data.
The aim of this graduation assignment is to use freely available coding tools to analyze a unique and
large dataset collected with unsupervised eye-tracking experiment. Further, the data will allow for
explorative analyses into the relationship of central gaze parameters and demographic variables. Data
analysis approaches are inspired by different research fields including image processing and machine
learning