Liao, Yuxiang, Liu, Hantao ORCID: https://orcid.org/0000-0003-4544-3481 and Spasic, Irena ORCID: https://orcid.org/0000-0002-8132-3885 2023. Deep learning approaches to automatic radiology report generation: A systematic review. Informatics in Medicine Unlocked 39 , 101273. 10.1016/j.imu.2023.101273 |
Preview |
PDF
- Published Version
Available under License Creative Commons Attribution. Download (11MB) | Preview |
Abstract
Background: A radiology report communicates the imaging findings to the referring clinicians. The rising number of referrals has created a bottleneck in healthcare. Writing a report takes disproportionally more time than the imaging itself. Therefore, Automatic Radiology Report Generation (ARRG) has a great potential to unclog this bottleneck. Objectives: This study aims to provide a systematic review of Deep Learning (DL) approaches to ARRG. Specifically, it aims to answer the following research questions. What data have been used to train and evaluate DL approaches to ARRG? How are DL approaches to ARRG evaluated? How is DL used to generate the reports from radiology images? Materials and methods: We followed the PRISMA guidelines. We retrieved 1443 records from PubMed and Web of Science on November 3, 2021. Relevant studies were categorized and compared from multiple perspectives. The corresponding findings were reported narratively. Results: A total of 41 studies were included. We identified 14 radiology datasets. In terms of evaluation, we identified four commonly used natural language generation metrics, six clinical efficacy metrics, and other qualitative methods. We compared DL approaches with respect to the underlying neural network architecture, the method of text generation, problem representation, training strategy, interpretability, and intermediate processing. Discussion and conclusion: Data imbalance (normal versus abnormal cases) and the inner complexity of reports pose major difficulties in ARRG. More appropriate evaluation metrics are required as well as datasets on a much larger scale. Leveraging structured representation of radiology reports and pre-trained language models warrant further research. Keywords: Image processing, Natural language generation, Natural language processing, Deep learning, Neural network
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Computer Science & Informatics |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science Q Science > QA Mathematics > QA76 Computer software |
Publisher: | Elsevier |
ISSN: | 2352-9148 |
Funders: | China Scholarship Council-Cardiff University Scholarship |
Date of First Compliant Deposit: | 22 May 2023 |
Date of Acceptance: | 12 May 2023 |
Last Modified: | 30 Jun 2023 06:49 |
URI: | https://orca.cardiff.ac.uk/id/eprint/159751 |
Actions (repository staff only)
Edit Item |