Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorAdriaans, F.W.
dc.contributor.authorLambooij, N.L.C.
dc.date.accessioned2017-09-06T17:02:21Z
dc.date.available2017-09-06T17:02:21Z
dc.date.issued2017
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/27440
dc.description.abstractIn speech recognition, Neural Networks are used to recognise the sequence of phonemes in an audio signal. These networks are trained on audio data pre-processed into some (type of) spectral vector. We present an alternative method that pre-processes speech utterances into visual representations, called spectrograms, and train a neural network suitable for image recognition to identify phonemes. The resulting network was able to classify 99.73% of a set of vowels containing samples of ‘iy’, ‘ah’ and ‘uw’ correctly, 91.87% of a set of vowels containing samples ‘iy’, ‘ih’ and ‘eh’, and 75.97% of the full dataset of twelve vowels. These results show that using image recognition in automatic speech recognition is worth further investigating.
dc.description.sponsorshipUtrecht University
dc.format.extent893431
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titleApplying Image Recognition to Automatic Speech Recognition: Determining Suitability of Spectrograms for Training a Deep Neural Network for Speech Recognition
dc.type.contentBachelor Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsSpeech Recognition, Neural Network, Spectrogram, Image Recognition,
dc.subject.courseuuKunstmatige Intelligentie


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record