Full text

Turn on search term navigation

© 2024. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Background:Previous research applying large language models (LLMs) to medicine was focused on text-based information. Recently, multimodal variants of LLMs acquired the capability of recognizing images.

Objective:We aim to evaluate the image recognition capability of generative pretrained transformer (GPT)-4V, a recent multimodal LLM developed by OpenAI, in the medical field by testing how visual information affects its performance to answer questions in the 117th Japanese National Medical Licensing Examination.

Methods:We focused on 108 questions that had 1 or more images as part of a question and presented GPT-4V with the same questions under two conditions: (1) with both the question text and associated images and (2) with the question text only. We then compared the difference in accuracy between the 2 conditions using the exact McNemar test.

Results:Among the 108 questions with images, GPT-4V’s accuracy was 68% (73/108) when presented with images and 72% (78/108) when presented without images (P=.36). For the 2 question categories, clinical and general, the accuracies with and those without images were 71% (70/98) versus 78% (76/98; P=.21) and 30% (3/10) versus 20% (2/10; P≥.99), respectively.

Conclusions:The additional information from the images did not significantly improve the performance of GPT-4V in the Japanese National Medical Licensing Examination.

Details

Title
Capability of GPT-4V(ision) in the Japanese National Medical Licensing Examination: Evaluation Study
Author
Nakao, Takahiro  VIAFID ORCID Logo  ; Miki, Soichiro  VIAFID ORCID Logo  ; Nakamura, Yuta  VIAFID ORCID Logo  ; Kikuchi, Tomohiro  VIAFID ORCID Logo  ; Nomura, Yukihiro  VIAFID ORCID Logo  ; Hanaoka, Shouhei  VIAFID ORCID Logo  ; Yoshikawa, Takeharu  VIAFID ORCID Logo  ; Abe, Osamu  VIAFID ORCID Logo 
First page
e54393
Section
Artificial Intelligence (AI) in Medical Education
Publication year
2024
Publication date
2024
Publisher
JMIR Publications
e-ISSN
23693762
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2956705925
Copyright
© 2024. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.