[1]WANG Lu,DING Mufei,ZHOU He,et al.Developing and employing large language models in medicine[J].CAAI Transactions on Intelligent Systems,2025,20(6):1295-1303.[doi:10.11992/tis.202410020]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
20
Number of periods:
2025 6
Page number:
1295-1303
Column:
综述
Public date:
2025-11-05
- Title:
-
Developing and employing large language models in medicine
- Author(s):
-
WANG Lu1; 2; DING Mufei1; ZHOU He1; HE Qianqian1; SONG Jiangdian1
-
1. School of Health Management, China Medical University, Shenyang110122, China;
2. Shengjing Hospital of China Medical University, Shenyang 110004, China
-
- Keywords:
-
chatbot; artificial intelligence; large language models; ChatGPT; health care; clinical diagnosis; medical consultation; medical informatics
- CLC:
-
TP18; R319
- DOI:
-
10.11992/tis.202410020
- Abstract:
-
Since the introduction of ChatGPT (chat generative pre-trained Transformer) in November 2022, studies related to large language models (LLMs) for medical applications are increasing; however, a systematic review of this field is lacking. This review covered studies indexed in PubMed, Google Scholar, arXiv, bioXiv, and medRxiv up until June 31, 2024, and identified 129 medical LLMs. LLMs were evaluated in clinical contexts, including their responses to medical queries, performance comparison, and specialist evaluation. The results revealed that general-purpose LLMs, such as ChatGPT and GPT-4, demonstrate better accuracy in generating medical records, whereas disease-specific LLMs excel in niche areas but may lack comprehensiveness. Challenges include variability in responses, readability issues, and biases, with few studies on LLM trustworthiness from patient or insurance perspectives.