Applying Characteristics of Human Reasoning to Zero-shot Reasoning in GPT Models

Dona, Loïs

dc.rights.license	CC-BY-NC-ND
dc.contributor.advisor	Paperno, Denis
dc.contributor.author	Dona, Loïs
dc.date.accessioned	2023-08-08T00:01:24Z
dc.date.available	2023-08-08T00:01:24Z
dc.date.issued	2023
dc.identifier.uri	https://studenttheses.uu.nl/handle/20.500.12932/44523
dc.description.abstract	This work builds upon an analogy between human and artificial reasoning. Large Transformer based language models have achieved state-of-the-art in many tasks, and recently have even been able to do this without requiring task specific fine-tuning, by deploying either in-context or zero-shot learning. However, reasoning remains to be a difficult task for these models, especially in a zero-shot setting. On the contrary, humans are good at reasoning without being explicitly trained for it like models are, hinting that properties of human reasoning might be of help to boost the performance of models. We explore this intuition in two ways: (1) using human-like linguistic input for fine-tuning and (2) prompting models to ``imagine", a technique that has shown to help humans reason better. Our results show that our approach was fruitful for reasoning about fantastical scenarios, which is in line with previous research on humans, confirming that making an analogy between human and artificial reasoning can be helpful. This research opens many doors for future research on zero-shot reasoning, also using smaller models, which is a desirable development towards human-like general intelligence.
dc.description.sponsorship	Utrecht University
dc.language.iso	EN
dc.subject	In this work I make an analogy between human reasoning and zero-shot reasoning in GPT-2 and GPT-3. This is done in two ways: (1) using human-like input to fine-tune GPT models and (2) using an adapted version of chain-of-thought prompting, inspired by psychological research on children's reasoning. Results show that these methods, especially the first, can be a promising avenue for using smaller language models in the context of zero-shot reasoning.
dc.title	Applying Characteristics of Human Reasoning to Zero-shot Reasoning in GPT Models
dc.type.content	Master Thesis
dc.rights.accessrights	Open Access
dc.subject.courseuu	Artificial Intelligence
dc.thesis.id	21271

Files in this item

Name:: MSc_thesis_Lois_Dona-5.pdf
Size:: 1.223Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Theses

Show simple item record

Applying Characteristics of Human Reasoning to Zero-shot Reasoning in GPT Models

Files in this item

This item appears in the following Collection(s)

Related items

Do children need to give less reason to get their will when they have the power anyway? – The moderating roles of culture and age on the relationship of power and reason-giving ﻿

Validation of the Dutch version of the Nurses Clinical Reasoning Scale to evaluate nurses’ perception of clinical reasoning competence ﻿

Combining A Fortiori Reasoning and a Similarity Measure in Case-Based Reasoning ﻿

Do children need to give less reason to get their will when they have the power anyway? – The moderating roles of culture and age on the relationship of power and reason-giving

Validation of the Dutch version of the Nurses Clinical Reasoning Scale to evaluate nurses’ perception of clinical reasoning competence

Combining A Fortiori Reasoning and a Similarity Measure in Case-Based Reasoning