dc.rights.license | CC-BY-NC-ND | |
dc.contributor.advisor | Akdag, Almila | |
dc.contributor.author | Loddo, Nicolò | |
dc.date.accessioned | 2024-11-15T01:03:19Z | |
dc.date.available | 2024-11-15T01:03:19Z | |
dc.date.issued | 2024 | |
dc.identifier.uri | https://studenttheses.uu.nl/handle/20.500.12932/48149 | |
dc.description.abstract | Modern Artificial Agents will increasingly leverage AI speech synthesis models to verbally communicate with their users. This study explores the integration of breathing patterns into synthesized speech and their potential to deepen empathy towards said agents, testing the hypothesis that the inclusion of breathing capabilities can significantly enhance the emotional connection in human-AI interaction.
Breathing patterns have not been unequivocally linked to human emotional states, but respiration has been consistently proven to be involved in emotions' appraisal and regulation, and literature suggests that an inestimable expressive potential may lie behind respiratory noises and their rhythm. Despite this, breathing is hardly involved in speech synthesis models, and literature on breathing agents is still limited.
We first perform a thorough evaluation of open-source and commercial Speech Synthesis models to understand the breathing synthesis capabilities of state-of-the-art architectures. We then proceed to assess the influence of breathing on the capacity of the voice to evoke empathy. The research methodologically diverges from traditional empathy studies by proposing to the subjects the resolution of an emotional dilemma within a cooperative game scenario, where they face a choice reflecting their empathic engagement with an AI partner.
The findings indicate that breathing in synthesized speech significantly enhances agents' perceived naturalness and users' empathy towards them. These insights underscore the importance of breathing in speech synthesis for AI design and call for its consideration in future models and interactive Artificial Agents. Ultimately, the study aims to contribute to the development of a more empathetic digital world through enhanced human-AI interaction. | |
dc.description.sponsorship | Utrecht University | |
dc.language.iso | EN | |
dc.subject | The thesis studies the impact of breathing instances in fostering empathy from users towards speaking non-embodied Virtual Agents. To do so, it evaluates the integration of breathing in State of The Art speech synthesis models, and then tests the breathing feature in a gamified experiment with human subjects. | |
dc.title | What if HAL breathed? Enhancing Empathy in Human-AI Interactions with Breathing Speech Synthesis | |
dc.type.content | Master Thesis | |
dc.rights.accessrights | Open Access | |
dc.subject.keywords | Human-Computer Interaction; Human-AI Interaction; Affective Computing; Speech Synthesis; Text-to-Speech; Breathing; Empathy; Respiration and Emotion; Virtual Agent | |
dc.subject.courseuu | Human-Computer Interaction | |
dc.thesis.id | 26875 | |