"What Has Been Said Cannot Be Taken Back": A Toxic Speech Detection Framework for TikTok using Whisper and Perspective API
Summary
["""On social media platforms, such as TikTok, toxic speech is a common problem. With a focus on
videos from the 2020 US presidential election, this study suggests a framework for spotting toxic
speech in TikTok videos. For the purpose of transcribing and analyzing spoken content in TikTok
videos, the framework combines a speech-to-text algorithm and a toxicity detection API.
The findings show that TikTok videos have varying amounts of toxic speech, with the majority of
texts scoring low for toxicity. With the help of BERTopic, semantic characteristics extraction,
dominant topics like Joe Biden's actions and discussions of race and politics are identified. Sentiment
analysis shows different emotional tones across topics. It is also shown that there may be a correlation
between some sentiments and higher levels of toxicity by looking at the relationship between toxicity
and sentiment. These findings provide insights into the characteristics of toxic speech in TikTok
videos. The results contribute to the development of strategies for content moderation and the
promotion of healthier online communities. Future research should address limitations and further
explore toxic speech on video-based social media platforms.""]
Collections
Related items
Showing items related by title, author, creator and subject.
-
Speech recognition at higher-than-normal speech and noise levels
Gelder, M.E. van (2010)Previous research has demonstrated reduced speech recognition of normal hearing listeners when speech is presented at higher-than-normal levels (e.g., above conversational speech levels), particularly in the presence of ... -
Measuring the performance of an automatic speech recognition system: The effect of speaker gender and speech register.
Leliveld, I. (2020)Speech recognition is an important part of artificial intelligence and has gotten a lot better over the years, but there is still room for improvement. Often the automatic speech recognition systems are not trained equally ... -
Речь Короля. 'The King's speech'. De gevolgen van de speech van Chroesjtsjov voor het Warschaupact.
Berger, L.E. (2021)Met het openen van de Oost-Europese archieven werd er een enorme hoeveelheid aan eerder onbekende bronnen over de politieke geschiedenis van het Oosten openbaar gemaakt. Met deze nieuwe kennis afkomstig van de binnenste ...