Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorNguyen, Dong
dc.contributor.authorMohamed, S.
dc.date.accessioned2020-09-03T18:00:13Z
dc.date.available2020-09-03T18:00:13Z
dc.date.issued2020
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/37415
dc.description.abstractIn this research, topic segmentation in texts (a.k.a. text segmentation) is used as a proxy for topic segmentation in videos. The main application is automatically providing a topic transition structure for videos, because it is difficult to quickly scan them and figure out where a new subject starts. Topic models are used to figure out the topic transition positions. The available data for this research is provided by the Netherlands Institute for Sound and Vision and consists of 25,600 transcripts and subtitles of the same Dutch news broadcasts. The research questions whether it is better to use automatic speech recognition transcripts or subtitles when segmenting a video based on topics.The subtitles and speech transcripts were compared for the same news broadcasts and both qualitative and quantitative differences between them were found. However, no significant difference was found between the performance of the text segmentation algorithm using subtitles and speech transcripts. The research presents the challenges and benefits of the developed text segmentation algorithm. The research can give insight into the realizability of the application of text segmentation to help structure videos, which can become a starting point for future research.
dc.description.sponsorshipUtrecht University
dc.format.extent541492
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.titleOn the effects of using speech transcripts and subtitles to detect topic shifts in news broadcasts
dc.type.contentBachelor Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordstopic segmentation, news broadcasts, ASR, subtitles
dc.subject.courseuuKunstmatige Intelligentie


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record