View Item 
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        •   Utrecht University Student Theses Repository Home
        • UU Theses Repository
        • Theses
        • View Item
        JavaScript is disabled for your browser. Some features of this site may not work without it.

        Browse

        All of UU Student Theses RepositoryBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

        Software Quality Assessment of Developer Community Reviews Using Sentiment Analysis Techniques

        Thumbnail
        View/Open
        381CFEBE-00FC-4560-9E3A-9FB035214B31.Final-thesis- Liang Feng-0804.pdf (2.495Mb)
        Publication date
        2025
        Author
        Feng, Liang Feng
        Metadata
        Show full item record
        Summary
        Software developers actively contribute their reviews of software components within various software communities, including platforms such as Stack Overflow. Analyzing these user reviews can provide valuable insight into the strengths and weaknesses of software components. The objective of this study is to explore implementation strategies for the application of sentiment analysis tools in specific domains, with a particular emphasis on the software engineering domain. As well as to investigate approaches for evaluating domain-specific sentiment analysis methods and techniques. The key components of this study include constructing a comprehensive software quality keyword list for effective mapping and implementing accurate sentiment analysis algorithms. In order to evaluate the effectiveness of the models, an evaluation dataset containing 4500 data was created by leveraging advanced pre-trained models to generate sentiment labels. Finally, a dataset comprising over 50,000 relevant reviews was extracted from Stack Overflow, G2, and TrustRadius. After conducting a comparative analysis of four state-of-the-art models in this domain, such as Senti4SD, TextCNN, SentiStrength-SE, and RoBERTa, it was observed that the RoBERTa model outperforms the others in various metrics. Therefore, the RoBERTa model was selected as the sentiment analysis model for this study. On the evaluation dataset, the RoBERTa model demonstrated a weighted average precision of 0.91 and a weighted average f-1 score of 0.78.
        URI
        https://studenttheses.uu.nl/handle/20.500.12932/50316
        Collections
        • Theses
        Utrecht university logo