Show simple item record

dc.rights.licenseCC-BY-NC-ND
dc.contributor.advisorNouwen, Rick
dc.contributor.authorDinten, Yvette van
dc.date.accessioned2025-04-18T00:00:52Z
dc.date.available2025-04-18T00:00:52Z
dc.date.issued2025
dc.identifier.urihttps://studenttheses.uu.nl/handle/20.500.12932/48860
dc.description.abstractWith millions of users across the globe, the popularity of Large Language Models gave rise to questions about how companies could utilize these models effectively. Despite great interest, a gap remains in effectively applying these to corporate use cases. That is why this research explores the efficient use of LLMs by companies that utilize non-English and non-homogeneous data. Feature selection algorithms were used to reduce the cost and time of using LLMs. Moreover, the data was translated to observe if this would affect the model’s performance. Datasets were used with the E5 and Flan models, in which the E5 model is a smaller LLM with fewer parameters. More specifically, the E5 model is encoder-only, while Flan is an encoder-decoder model. Encoding is the process of embedding the input into vectors, and decoding is the process of de-embedding these vectors back into natural language. Both models preferred feature selection algorithms that reduced the datasets to very few features. Results show that the feature selection method Forward Searching combined with the E5 model has the lowest time duration and produces the highest performance. Furthermore, translation is not required, as it gives little to no improvement in performance but does add an extra step for preprocessing that increases time and costs. In conclusion, the higher performance of the E5 model suggests that a complex model does not necessarily imply a more efficient solution for corporate use cases.
dc.description.sponsorshipUtrecht University
dc.language.isoEN
dc.subjectFoundation Models: Evaluating Alternatives, Benefits, and Challenges in Automating Claims Processing in the Insurance Industry
dc.titleFoundation Models: Evaluating Alternatives, Benefits, and Challenges in Automating Claims Processing in the Insurance Industry
dc.type.contentMaster Thesis
dc.rights.accessrightsOpen Access
dc.subject.courseuuArtificial Intelligence
dc.thesis.id45136


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record