Content area

Abstract

The automation of educational and instructional assessment plays a crucial role in enhancing the quality of teaching management. In physics education, calculation problems with intricate problem-solving ideas pose challenges to the intelligent grading of tests. This study explores the automatic grading of physics problems through a combination of large language models and prompt engineering. By comparing the performance of four prompt strategies (one-shot, few-shot, chain of thought, tree of thought) within two large model frameworks, namely ERNIEBot-4-turbo and GPT-4o. This study finds that the tree of thought prompt can better assess calculation problems with complex ideas (N = 100, ACC ≥ 0.9, kappa > 0.8) and reduce the performance gap between different models. This research provides valuable insights for the automation of assessments in physics education.

Details

1009240
Business indexing term
Company / organization
Title
Research on Intelligent Grading of Physics Problems Based on Large Language Models
Author
Wei, Yuhao 1 ; Zhang, Rui 2   VIAFID ORCID Logo  ; Zhang, Jianwei 2 ; Qi, Dizhi 2 ; Cui, Wenqian 2 

 Guohao College, Tongji University, Shanghai 200082, China; [email protected] 
 School of Physics Science and Engineering, Tongji University, Shanghai 200082, China; [email protected] (J.Z.); [email protected] (D.Q.); [email protected] (W.C.) 
Publication title
Volume
15
Issue
2
First page
116
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
22277102
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-01-21
Milestone dates
2024-11-03 (Received); 2025-01-15 (Accepted)
Publication history
 
 
   First posting date
21 Jan 2025
ProQuest document ID
3170873302
Document URL
https://www.proquest.com/scholarly-journals/research-on-intelligent-grading-physics-problems/docview/3170873302/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-02-25
Database
ProQuest One Academic