View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Voice Quality: An empirical assessment & a computational model

        Thumbnail
        View/Open
        thesis.pdf (582.1Kb)
        draft_v4.8.pdf (582.1Kb)
        Publication date
        2016
        Author
        Androutsos, D.
        Metadata
        Show full item record
        Summary
        Current objective assessments of speech signals show little correlation with the listener's perceived voice quality (VQ) , with their quality of experience. To remedy this omission in our knowledge on the voice, a survey was executed, including 102 listeners, who each provided their Self-Assessment Manikin(SAM) on 100 (i.e., 4_ 25) speech samples of two males and two females. These samples were either high quality or degraded by pink noise, impulse noise, packet loss, or bandwidth reduction. An repeated measures analysis of variance (ANOVA) on the obtained SAM , speaker gender, and signal quality revealed that the listeners preferred one female voice and that degradations influences the SAM . The SAM was also compared with International Telecommunication Union Telecommunication standardization sector (ITU-T) 's Perceptual Objective Listening Quality Assessment (POLQA) , which showed to handle the degradations excellently; but, was unable to assess VQ adequately. To resolve POLQA 's weak spot, we developed initial computational models, founded on paralinguistic parameters solely. These models correctly predicted VQ in 87.84% (4 levels) and 70.58% (8 levels) of the cases. Unknown speaker's VQ was predicted correctly in 88.71% (4 levels) and 70.42% (8 levels) of the cases. The results of this empirical study emphasize that VQ is a complex, multidimensional construct, which is influenced by several types of common noise. Moreover, it shows that ITU-T 's POLQA can be provided with an add-on, which enables it to predict VQ as well. As such, this study provides a major step towards understanding VQ and including it in ITU-T 's standards.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/23939
        Collections
        • Theses
        Utrecht university logo