1. Can ChatGPT Solve Undergraduate Exams from Warehousing Studies? An Investigation.
- Author
-
Franke, Sven, Pott, Christoph, Rutinowski, Jérôme, Pauly, Markus, Reining, Christopher, and Kirchheim, Alice
- Subjects
LANGUAGE models ,GENERATIVE pre-trained transformers ,CHATGPT ,EDUCATIONAL evaluation ,WAREHOUSES - Abstract
The performance of Large Language Models, such as ChatGPT, generally increases with every new model release. In this study, we investigated to what degree different GPT models were able to solve the exams of three different undergraduate courses on warehousing. We contribute to the discussion of ChatGPT's existing logistics knowledge, particularly in the field of warehousing. Both the free version (GPT-4o mini) and the premium version (GPT-4o) completed three different warehousing exams using three different prompting techniques (with and without role assignments as logistics experts or students). The o1-preview model was also used (without a role assignment) for six runs. The tests were repeated three times. A total of 60 tests were conducted and compared with the in-class results of logistics students. The results show that the GPT models passed a total of 46 tests. The best run solved 93% of the exam correctly. Compared with the students from the respective semester, ChatGPT outperformed the students in one exam. In the other two exams, the students performed better on average than ChatGPT. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF