AUTOMATED CHEST X-RAY REPORT GENERATION USING ARTIFICIAL INTELLIGENCE BASED ATTENTION-ENHANCED GOOGLENET-LSTM ARCHITECTURE

Muhammad Faheem Paracha; Mudasir Mahmood; Muhammad Farhan

doi:10.15588/1607-3274-2026-2-9

Authors

Muhammad Faheem Paracha Gomal University, Dera Ismail Khan, Pakistan
Mudasir Mahmood Gomal University, Dera Ismail Khan, Pakistan
Muhammad Farhan Gomal University, Dera Ismail Khan, Pakistan

DOI:

https://doi.org/10.15588/1607-3274-2026-2-9

Keywords:

Convolution Neural Networks, Deep Learning, Chest X-ray, Radiology Report Generation, GoogleNet, LSTM, Attention Mechanism

Abstract

Context. Proper and effective diagnosis of chest diseases is vital in timely treatment and efficient clinical decision-making. Chest X-rays (CXRs) are commonly utilized in the detection of chest diseases because they are readily accessible and cost-effective. Nevertheless, radiograph interpretation is still a time-consuming, subjective, and error-prone process, especially in health care settings where resources are limited. Radiologists require automated systems capable of producing consistent diagnostic information that will facilitate quicker and standardized patient care.
Objective. The proposed research will develop and assess a deep learning model with the ability to produce valid and explainable diagnostic reports using chest radiographs. The main aim is to minimize human error, save time in the diagnostic process, and deliver uniform findings, which will guide clinicians to make sound decisions within a short period of time.
Method. The images are then extracted using a convolutional neural network called GoogleNet that extracts high-level visual features, which contain important structural and anatomical information. The features extracted are then fed to a Long Short-Term Memory network, which represents the sequential character of the report generation process by conditioning itself on relationships between words and phrases in diagnostic text. To enhance its accuracy and interpretability, an attention mechanism is added to allow the system to concentrate on the most clinically valuable parts of the image when producing every part of the report. The Indiana University Chest X-ray dataset was employed to train and evaluate the proposed system, while several experiments were carried out to evaluate the performance of the proposed system regarding its performance against the existing benchmark models.
Results. The GoogleNet-LSTM-Attention model was shown to be more effective at generating high-quality diagnostic reports. It has performed well compared to benchmark models on various natural language evaluation measures, such as BLEU, ROUGE, and CIDEr scores. These improvements indicate that the quality of clinical data and fluency of text generated are correlated and that CNN, RNN, and attention mechanisms are effective in medical image reporting.
Conclusions. The study presented shows that a combination of CNNs, LSTMs, and attention in a single architecture has the potential to transform the process of interpreting chest X-rays. The system proposed not only improves the precision of the diagnosis but also provides clinical assistance, as it allows for performing radiographic assessment rapidly, consistently, and interpretably. Such AI-driven systems can bring about the potential to reduce workloads, decrease diagnostic errors, and enhance patient outcomes in various healthcare facilities.

Author Biographies

Muhammad Faheem Paracha, Gomal University, Dera Ismail Khan

Post-graduate student of the Faculty of Computing

Mudasir Mahmood, Gomal University, Dera Ismail Khan

Assistant Professor, Faculty of Computing, Gomal University

Muhammad Farhan, Gomal University, Dera Ismail Khan

Lecture, Faculty of Computing

References

Wang X., Peng Y., Lu L. et al. ChestX-ray8: Hospitalscale Chest X-ray Database and Benchmarks on WeaklySupervised Classification and Localization of Common Thorax Diseases. IEEE Conf. Comput. Vis. Pattern Recogni, 2017, pp. 3462–3471.

Brady A. P. Measuring radiologist workload: How to do it, and why it matters. Eur. Radiol., 2011, Vol. 21, № 11, pp. 2315–2317.

Cowan I. A., MacDonald S. L. S., Floyd R. A. Measuring and managing radiologist workload: Measuring radiologist reporting times using data from a Radiology Information System. J. Med. Imaging Radiat. Oncol., 2013, Vol. 57, № 5, pp. 558–566.

Geel van K., Lameijer I., Wielaard S. et al. Chest X-ray evaluation training: impact of normal and abnormal image ratio and instructional sequence. Med. Educ., 2019, Vol. 53, № 2, pp. 153–164.

Islam M. S., Rahman M. M., Mahmud S. et al. Challenges issues and future recommendations of deep learning techniques for SARS-CoV-2 detection utilising X-ray and CT images: a comprehensive review. PeerJ Comput. Sci., 2024, Vol. 10.

Li Q., Zhang W., Zhang X. et al. A Survey on Text Classification: From Shallow to Deep Learning. ACM Trans. Intell. Syst. Technol. Association for Computing Machinery, 2020, Vol. 37, № 4.

Sajed S., Inayat M., Qadir A. et al. The effectiveness of deep learning vs. traditional methods for lung disease diagnosis using chest X-ray images: A systematic review. Appl. Soft Comput., 2020, Vol. 147.

Hwang E. J., Park S. Y., Kim J. H. et al. Deep learning for chest radiograph diagnosis in the emergency department. Radiology, 2019, Vol. 293, № 3, pp. 573–580.

Rajpurkar P., Irvin J., Zhu K.et al. Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists. PLoS Med., 2018, Vol. 15, № 11, pp. 1–17.

Sahlol A. T., Abdulkadir M., Elaziz A. A. et al. A novel method for detection of tuberculosis in chest radiographs using artificial ecosystem-based optimisation of deep neural network features. Symmetry (Basel), 2020, Vol. 12, № 7.

Majkowska A., Pham C., He K. et al. Chest radiograph interpretation with deep learning models: Assessment with radiologist-adjudicated reference standards and population-adjusted evaluation. Radiology, 2020, Vol. 294, № 2. pp. 421–431.

Li X., Wang Y., Xu S. et al. Deep Learning in Chest Radiography: Detection of Pneumoconiosis. Biomed. Environ. Sci., 2021, Vol. 34, № 10, pp. 842–845.

Uçar M. Deep neural network model with Bayesian optimization for tuberculosis detection from X-Ray images. Multimed. Tools Appl. Springer US, 2023, Vol. 82, № 24, pp. 36951–36972.

Devlin J., Chang M., Lee K. et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. NAACL HLT 2019 – 2019 Conf. North Am. Chapter Assoc. Comput. Linguist. Hum. Lang. Technol. – Proc. Conf. Association for Computational Linguistics (ACL), 2018, Vol. 1, pp. 4171–4186.

Ouis M. Y., Akhloufi M. A. ChestBioX-Gen: contextual biomedical report generation from chest X-ray images using BioGPT and co-attention mechanism. Front. Imaging., 2024, Vol. 3.

Wang X., Peng Y., Lu L. et al. ChestX-ray: HospitalScale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases. Adv. Comput. Vis. Pattern Recognit., 2019, № September, pp. 369–392.

Irvin J., Rajpurkar P., Ko M. et al. CheXpert: A large chest radiograph dataset with uncertainty labels and expert comparison. 33rd AAAI Conf. Artif. Intell. AAAI 2019, 31st Innov. Appl. Artif. Intell. Conf. IAAI 2019 9th AAAI Symp. Educ. Adv. Artif. Intell. EAAI 2019, 2019,

pp. 590–597.

Johnson A. E. W., Pollard T. J., Berkowitz S. J. et al. MIMIC-CXR-JPG, a large publicly available database of labeled chest radiographs, 2019, Vol. 14, pp. 1–7.

Nicolson A., Dowling J., Koopman B. Improving chest X-ray report generation by leveraging warm starting. Artif. Intell. Med. Elsevier B.V., 2023, Vol. 144, № April 2022, P. 102633.

Bustos A., Pertusa Y., Salinas J. M. et al. PadChest: A large chest x-ray image dataset with multi-label annotated reports. Med. Image Anal., 2020, Vol. 66, pp. 1–35.

Yang X., Zhou J., Li Y. et al. Writing by memorizing: Hierarchical retrieval-based medical report generation. ACL-IJCNLP 2021 – 59th Annu. Meet. Assoc. Comput. Linguist. 11th Int. Jt. Conf. Nat. Lang. Process. Proc. Conf., 2021, pp. 5000–5009.

Chen Z., Peng Y., Liu X. et al. Cross-modal memory networks for radiology report generation. ACL-IJCNLP 2021 – 59th Annu. Meet. Assoc. Comput. Linguist. 11th Int. Jt. Conf. Nat. Lang. Process. Proc. Conf., 2021, № 2018, pp. 5904–5914.

Kaur N., Mittal A. CheXPrune: sparse chest X-ray report generation model using multi-attention and one-shot global pruning. J. Ambient Intell. Humaniz. Comput. J Ambient Intell Humaniz Comput, 2023, Vol. 14, № 6, pp. 7485–7497.

Gu Y., Li X., Zhang L. et al. Automatic Medical Report Generation Based on Cross-View Attention and VisualSemantic Long Short Term Memorys. Bioengineering. Multidisciplinary Digital Publishing Institute (MDPI), 2023, Vol. 10, № 8.

Yang S., Yin H., Wang X. et al. Radiology report generation with a learned knowledge base and multimodal alignment. Med. Image Anal., 2023, Vol. 86.

Veras Magalhães G., Santos dos L., Ferreira M. et al. XRaySwinGen: Automatic medical reporting for X-ray exams with multimodal model. Heliyon, Elsevier Ltd, 2024, Vol. 10, № 7.

Zhao J., Chen Y., Li Q. et al. Automated Chest X-Ray Diagnosis Report Generation with Cross-Attention Mechanism. Appl. Sci., 2025, Vol. 15, № 1.

Singh P., Singh S. ChestX-Transcribe: a multimodal transformer for automated radiology report generation from chest x-rays. Front. Digit. Heal. Frontiers Media SA, 2025, Vol. 7, P. 1535168

AUTOMATED CHEST X-RAY REPORT GENERATION USING ARTIFICIAL INTELLIGENCE BASED ATTENTION-ENHANCED GOOGLENET-LSTM ARCHITECTURE

Authors

DOI:

Keywords:

Abstract

Author Biographies

Muhammad Faheem Paracha, Gomal University, Dera Ismail Khan

Mudasir Mahmood, Gomal University, Dera Ismail Khan

Muhammad Farhan, Gomal University, Dera Ismail Khan

References

Downloads

Published

How to Cite

Issue

Section

License

Creative Commons Licensing Notifications in the Copyright Notices

Information

Current Issue

Announcements