Software Quality Assessment of Developer Community Reviews Using Sentiment Analysis Techniques

Feng, Liang Feng

View/Open

381CFEBE-00FC-4560-9E3A-9FB035214B31.Final-thesis- Liang Feng-0804.pdf (2.495Mb)

Publication date

2025

Author

Feng, Liang Feng

Metadata

Show full item record

Summary

Software developers actively contribute their reviews of software components within various software communities, including platforms such as Stack Overflow. Analyzing these user reviews can provide valuable insight into the strengths and weaknesses of software components. The objective of this study is to explore implementation strategies for the application of sentiment analysis tools in specific domains, with a particular emphasis on the software engineering domain. As well as to investigate approaches for evaluating domain-specific sentiment analysis methods and techniques. The key components of this study include constructing a comprehensive software quality keyword list for effective mapping and implementing accurate sentiment analysis algorithms. In order to evaluate the effectiveness of the models, an evaluation dataset containing 4500 data was created by leveraging advanced pre-trained models to generate sentiment labels. Finally, a dataset comprising over 50,000 relevant reviews was extracted from Stack Overflow, G2, and TrustRadius. After conducting a comparative analysis of four state-of-the-art models in this domain, such as Senti4SD, TextCNN, SentiStrength-SE, and RoBERTa, it was observed that the RoBERTa model outperforms the others in various metrics. Therefore, the RoBERTa model was selected as the sentiment analysis model for this study. On the evaluation dataset, the RoBERTa model demonstrated a weighted average precision of 0.91 and a weighted average f-1 score of 0.78.

URI

https://studenttheses.uu.nl/handle/20.500.12932/50316

Collections

Theses