Article citationsMore >>

P. Lewis, E. Perez, A. Piktus, F. Petroni, V. Karpukhin, N. Goyal, H. Küttler, M. Lewis, W. Yih, T. Rocktäschel, S. Riedel, and D. Kiela, "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks," Advances in Neural Information Processing Systems (NeurIPS), 2020.

has been cited by the following article:

Article

Improving Medical Multimodal Retrieval with Graph-Rag and Fusion Methods with Mmedrag++

1Department of Computer Science Engineering, Keshav Memorial College of Engineering, Ibrahimpatnam, Telangana, India


Journal of Computer Sciences and Applications. 2026, Vol. 14 No. 1, 21-30
DOI: 10.12691/jcsa-14-1-4
Copyright © 2026 Science and Education Publishing

Cite this paper:
Dr. A.Shilpa Gupta, M. Uday Reddy, K. Jayanth Kumar Reddy, P. Medha Goud, T. Harshitha Reddy, Aditi Jopat. Improving Medical Multimodal Retrieval with Graph-Rag and Fusion Methods with Mmedrag++. Journal of Computer Sciences and Applications. 2026; 14(1):21-30. doi: 10.12691/jcsa-14-1-4.

Correspondence to: Dr.  A.Shilpa Gupta, Department of Computer Science Engineering, Keshav Memorial College of Engineering, Ibrahimpatnam, Telangana, India. Email: a.shilpagupta@gmail.com

Abstract

We introduce MMedRAG++,a medical multimodal retrieval system that uses fusion-based representation learning and graph-based reranking (Graph-RAG) to improve on conventional Retrieval-Augmented Generation (RAG). Unlike baseline systems, which do not incorporate fusion strategies or graph-based reranking, MMedRAG++ improves cross-modal embeddings and retrieval coherence. Experiments are conducted primarily on PMC-OA, a challenging dataset with only ~10% unique captions and many unrelated image-text pairs, and IU-Xray is used for modality- specific subtasks. Graph-RAG demonstrates improved retrieval centrality and diversity, and fusion strategies, including Cross- Attention and DeepSet Fusion, enhance embedding quality. Quantitative evaluation on PMC-OA and IU-Xray confirms improved retrieval coherence and cross-modal alignment over baseline configurations. Top-1 Accuracy: 18.7%, Top-10 Accuracy: 57.4%.

Keywords