LLM Math and Moral Reasoning

LLM Reasoning: A preliminary study on mathematical and moral reasoning

This project is a part of a preliminary study done using MMLU benchmark using Mistral 8x7B LLM

Data: Measuring Massive Multitask Language Understanding (Hendrycks et al., 2020) Task: Elementary Mathematics and Moral Scenarios Test data: A randomly generated sample of 100 instances from test data of MMLU moral scenarios and elementary mathematics

Results:

RQ1

Here the max_tokens parameter is set to 1 for LLM generation

Elementary Math

Prompt type	Accuracy	Invalid answers
Zero-shot	16%	71%
Random one-shot	35%	12%
Random three-shot	39%	13%
Dynamic one-shot	38%	10%
Dynamic three-shot	40%	13%

Moral Scenarios

Prompt type	Accuracy	Invalid answers
Zero-shot	29%	22%
Random one-shot	47%	1%
Random three-shot	48%	8%
Dynamic one-shot	40%	1%
Dynamic three-shot	53%	1%

RQ2

Here the max_tokens parameter is set to 256 for LLM generation

Elementary Math

Prompt type	Accuracy	Invalid answers
Zero-shot	57%	2%
Linguistic Chain-of-Thought	49%	5%
Chain-of-Thought	56%	1%

Moral Scenarios

Prompt type	Accuracy	Invalid answers
Zero-shot	29	0%
Linguistic Chain-of-Thought	28%	0%
Chain-of-Thought	26%	0%

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
RQ1_data		RQ1_data
RQ2_data		RQ2_data
RQ2_results		RQ2_results
README.md		README.md
RQ1.ipynb		RQ1.ipynb
RQ2_code.ipynb		RQ2_code.ipynb
project_report.pdf		project_report.pdf
research_overall_presentation.pdf		research_overall_presentation.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Math and Moral Reasoning

RQ1

Elementary Math

Moral Scenarios

RQ2

Elementary Math

Moral Scenarios

About

Releases

Packages

Contributors 2

Languages

dhwaniserai/LLM_math_and_moral_reasoning

Folders and files

Latest commit

History

Repository files navigation

LLM Math and Moral Reasoning

RQ1

Elementary Math

Moral Scenarios

RQ2

Elementary Math

Moral Scenarios

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages