A Recent Survey on Multi-modal Medical Image Fusion

Vandit Akhilesh Barola; Prabhishek Singh; Manoj Diwakar

doi:10.62762/BISH.2025.414869

CiteScore

Impact Factor

Volume 1, Issue 3, Biomedical Informatics and Smart Healthcare

Volume 1, Issue 3, 2025

Submit Manuscript Edit a Special Issue

Article QR Code

Scan the QR code for reading

Popular articles

Case Studies on Integrating Artificial Intelligence in Finance to Transform Decision Making and Risk Management for Enhanced Financial Outcomes Reinforcement Learning for Prompt Optimization in Language Models: A Comprehensive Survey of Methods, Representations, and Evaluation Challenges Research on A Ship Trajectory Classification Method Based on Deep Learning Bridging Modalities: A Survey of Cross-Modal Image-Text Retrieval AI and the Future of Education: Advancing Personalized Learning and Intelligent Tutoring Systems Enhancing Fake News Detection with a Hybrid NLP-Machine Learning Framework Plant Disease Detection Using Deep Learning Techniques Acrylamide in Food: Sources and Prevention Modeling Brain Functional Networks Using Graph Neural Networks: A Review and Clinical Application Analyzing the Translation and Impact of Popular Science Literature in China: A Case Study Approach

Biomedical Informatics and Smart Healthcare, Volume 1, Issue 3, 2025: 89-97

Open Access | Review Article | 07 November 2025

A Recent Survey on Multi-modal Medical Image Fusion

Vandit Akhilesh Barola 1

Prabhishek Singh 1 *

Manoj Diwakar 2

1 School of Computer Science Engineering and Technology, Bennett University, Greater Noida, Uttar Pradesh 201310, India

2 Department of CSE, Graphic Era Deemed to be University, Dehradun, Uttarakhand 248002, India

* Corresponding Author: Prabhishek Singh, [email protected]

DOI: 10.62762/BISH.2025.414869

Received: 26 July 2025, Accepted: 17 August 2025, Published: 07 November 2025

PDF (856.53 KB)

Article Metrics Cite This Article

Abstract

Fusion of multi-modal medical images has transformed healthcare by overcoming the limitations of single-modality imaging, where modalities such as CT, MRI, PET, and SPECT provide complementary information. This review systematically traces the evolution of multi-modal medical image fusion from conventional mathematical models to state-of-the-art artificial intelligence (AI) techniques. We examine the transition from classical approaches---such as multiscale transformations, wavelet decompositions, and sparse representation---to modern deep learning methods, including convolutional neural networks, generative adversarial networks, and transformer architectures. Key limitations of existing methods are highlighted, including limited interpretability, insufficient preservation of modality-specific details, poor cross-dataset generalization, and inadequate post-fusion refinement. The review categorizes fusion strategies into three hierarchical levels---pixel-level, feature-level, and decision-level---and analyzes their benefits and computational complexities. We provide a comparative evaluation of hybrid methods combining classical transform-based approaches with deep learning models, which show improved preservation of anatomical structures and functional information. Major clinical applications are discussed in oncology (tumor detection, staging), neurology (brain tumor localization, surgical planning), ophthalmology (disease characterization), and orthopedics (fracture detection). Challenges to clinical adoption---such as modality misalignment, information loss, high computational requirements, and lack of universal benchmarks---are examined alongside emerging concerns over data privacy and security in telehealth. We also analyze interpretability frameworks, modality-preservation techniques, cross-dataset generalization strategies, and post-fusion refinement methods. The review concludes with strategic recommendations to develop interpretable, real-time AI fusion models with robust clinical validation and standardized benchmarks to bridge the gap between theoretical advances and practical clinical integration, ultimately enhancing diagnostic accuracy and patient outcomes.

Graphical Abstract

A Recent Survey on Multi-modal Medical Image Fusion

Keywords

biomedical informatics

smart healthcare

artificial intelligence in medicine

precision medicine

digital health

Data Availability Statement

Not applicable.

Funding

This work was supported without any funding.

Conflicts of Interest

The authors declare no conflicts of interest.

Ethical Approval and Consent to Participate

Not applicable.

References

Hermessi, H., Mourali, O., & Zagrouba, E. (2021). Multimodal medical image fusion review: Theoretical background and recent advances. Signal Processing, 183, 108036.
[CrossRef] [Google Scholar]
Li, Y., Daho, M. E. H., Conze, P. H., Zeghlache, R., Le Boité, H., Tadayoni, R., ... & Quellec, G. (2024). A review of deep learning-based information fusion techniques for multimodal medical image classification. Computers in Biology and Medicine, 177, 108635.
[CrossRef] [Google Scholar]
Zhou, T., Ruan, S., & Canu, S. (2019). A review: Deep learning for medical image segmentation using multi-modality fusion. Array, 3, 100004.
[CrossRef] [Google Scholar]
Huang, B., Yang, F., Yin, M., Mo, X., & Zhong, C. (2020). A review of multimodal medical image fusion techniques. Computational and mathematical methods in medicine, 2020(1), 8279342.
[CrossRef] [Google Scholar]
Tirupal, T., Mohan, B. C., & Kumar, S. S. (2021). Multimodal medical image fusion techniques–a review. Current Signal Transduction Therapy, 16(2), 142-163.
[CrossRef] [Google Scholar]
Alseelawi, N., Hazim, H. T., & Salim ALRikabi, H. T. (2022). A novel method of multimodal medical image fusion based on hybrid approach of NSCT and DTCWT. International Journal of Online & Biomedical Engineering, 18(3).
[CrossRef] [Google Scholar]
Maqsood, S., & Javed, U. (2020). Multi-modal medical image fusion based on two-scale image decomposition and sparse representation. Biomedical Signal Processing and Control, 57, 101810.
[CrossRef] [Google Scholar]
Bhatnagar, G., Wu, Q. J., & Liu, Z. (2015). A new contrast based multimodal medical image fusion framework. Neurocomputing, 157, 143-152.
[CrossRef] [Google Scholar]
Jose, J., Gautam, N., Tiwari, M., Tiwari, T., Suresh, A., Sundararaj, V., & MR, R. (2021). An image quality enhancement scheme employing adolescent identity search algorithm in the NSST domain for multimodal medical image fusion. Biomedical Signal Processing and Control, 66, 102480.
[CrossRef] [Google Scholar]
Zhang, Y., Sidibé, D., Morel, O., & Mériaudeau, F. (2021). Deep multimodal fusion for semantic image segmentation: A survey. Image and Vision Computing, 105, 104042.
[CrossRef] [Google Scholar]
Liu, Z., Yin, H., Chai, Y., & Yang, S. X. (2014). A novel approach for multimodal medical image fusion. Expert systems with applications, 41(16), 7425-7435.
[CrossRef] [Google Scholar]
Tang, W., He, F., Liu, Y., & Duan, Y. (2022). MATR: Multimodal medical image fusion via multiscale adaptive transformer. IEEE Transactions on Image Processing, 31, 5134-5149.
[CrossRef] [Google Scholar]
Bhavana, V., & Krishnappa, H. K. (2015). Multi-modality medical image fusion using discrete wavelet transform. Procedia Computer Science, 70, 625-631.
[CrossRef] [Google Scholar]
Parvathy, V. S., Pothiraj, S., & Sampson, J. (2020). Optimal deep neural network model based multimodality fused medical image classification. Physical Communication, 41, 101119.
[CrossRef] [Google Scholar]
He, C., Liu, Q., Li, H., & Wang, H. (2010). Multimodal medical image fusion based on IHS and PCA. Procedia Engineering, 7, 280-285.
[CrossRef] [Google Scholar]
Tan, W., Tiwari, P., Pandey, H. M., Moreira, C., & Jaiswal, A. K. (2020). Multimodal medical image fusion algorithm in the era of big data. Neural computing and applications, 1-21.
[CrossRef] [Google Scholar]
Singh, K. N., Singh, O. P., Singh, A. K., & Agrawal, A. K. (2024). Watmif: Multimodal medical image fusion-based watermarking for telehealth applications. Cognitive Computation, 16(4), 1947-1963.
[CrossRef] [Google Scholar]
Lin, C., Chen, Y., Feng, S., & Huang, M. (2024). A multibranch and multiscale neural network based on semantic perception for multimodal medical image fusion. Scientific Reports, 14(1), 17609.
[CrossRef] [Google Scholar]

Cite This Article

APA Style

Barola, V. A., Singh, P., & Diwakar, M. (2025). A Recent Survey on Multi-modal Medical Image Fusion. Biomedical Informatics and Smart Healthcare, 1(3), 89–97. https://doi.org/10.62762/BISH.2025.414869

Article Metrics

Citations:

Google Scholar

Crossref

Scopus

Web of Science

Article Access Statistics:

PDF Downloads: 139

Publisher's Note

ICCK stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and Permissions

Copyright © 2025 by the Author(s). Published by Institute of Central Computation and Knowledge. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

Biomedical Informatics and Smart Healthcare

ISSN: 3068-5524 (Online)

Email: [email protected]

Portico

All published articles are preserved here permanently:
https://www.portico.org/publishers/icck/

Google Scholar

Crossref

Scopus

Web of Science

We use cookies