Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorBagheri, Ayoub
dc.contributor.authorSantos Moitinho de Almeida, Lourenço
dc.date.accessioned2024-08-29T00:02:51Z
dc.date.available2024-08-29T00:02:51Z
dc.date.issued2024
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/47465
dc.description.abstractCross-cultural moral variation has become evi- dent throughout social media. Since the emer- gence of large language models (LLMs), the ethical implications of these discrepancies has grown in significance. In spite of all their ca- pabilities, these models are often criticized for their undesirable or even controversial output. Consequently, fields such as explainable (XAI) NLP have emerged in order to address the dilemma. Although moral variation has been examined in past research, the predominant methodology tends to focus on a broader per- spective that may overlook subtle differences. For these reasons, this study aims to fill the re- search gap by investigating cross-cultural moral variation with an emphasis on local explainabil- ity across four mono- and multi-lingual LLMs. Through language model probing, SHapley Ad- ditive exPlanations (SHAP) and an ethical val- ues dataset gathered from the World Values Survey (WVS), a fine-grained analysis was conducted. This study introduces the ’SHAP Logprob’ model that was built for token-level interpretations. Lastly, this study address the challenges and limitations of interpreting cross- cultural moral variation through SHAP.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectLarge Language Models as Mirrors of Societal Moral Standards
dc.titleLarge Language Models as Mirrors of Societal Moral Standards
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.keywordsLarge language models; data science; machine learning; explainable AI
dc.subject.courseuuApplied Data Science
dc.thesis.id38121


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record