gAImble
Persistent URL
Author(s)
Wolff, William
Date Issued
May 5, 2025
Abstract
Ever since the rise in popularity with large language modesl such as GPT, Gemini, and Claude. We now are seeing an advance towards a new age of technology. With what we know now about large language models, there are still further questions on how advanced are large language models. This test is to see how well can large language models reason in a risk/reward scenario with the scenario chosen being blackjack. The average player wins 42% of the time. Using the basic blackjack strategy as a baseline evaluation, we look to see if 3 large language models (GPT, Gemini, and Claude) can reason and be either more than 2% better or worse than the average player. To see if the reasoning is on par with reasoning, each large language cannot make an illegal move more than 3% of the time. Furthermore, this project look to see if models can reason more than 20% of the total runs that have occured when ran.
Major
Computer Science
First Reader(s)
Luman, Douglas J.
Other Reader(s)
Green, Morgan
Department
Computer and Information Science
Type of Publication
Senior Project Paper
Subjects
File(s)![Thumbnail Image]()
Name
SeniorThesis.pdf
Size
421.94 KB
Format
Adobe PDF
Checksum (MD5)
545457b2d22cdb43a83958068b532a00