Identity Theft Detection at Data Ingestion Using AI: An Explainable Anomaly Detection Approach

Sachin Dattatreya Murthy

American Journal of Software Engineering. 2026, 9(1), 1-9
DOI: 10.12691/ajse-9-1-1

Open AccessArticle

Identity Theft Detection at Data Ingestion Using AI: An Explainable Anomaly Detection Approach

Sachin Dattatreya Murthy^{1, 2,}

¹Independent Researcher

²United States

Pub. Date: January 03, 2026

View Full Text Full Text PDF (324 KB) Full Text ePUB(233 KB)

Cite this paper:
Sachin Dattatreya Murthy. Identity Theft Detection at Data Ingestion Using AI: An Explainable Anomaly Detection Approach. American Journal of Software Engineering. 2026; 9(1):1-9. doi: 10.12691/ajse-9-1-1

Abstract

The rise of identity theft has become one of the most dangerous growing cybercrimes today, particularly as individuals are now digitally on-boarding; therefore, with minimal information provided for identification/verification purposes, traditional rule-based systems cannot identify many of the sophisticated schemes used today such as Deepfakes, Document Forging, Synthetic Identities etc. Fraud detection has been the focus of much research but there is still a large void in the area of data ingestions, specifically in identifying and alerting Identity Theft prior to an account being created through a Real Time Explainable Solution. Fraud detection is a well-researched topic; however, fraud detection at the time of account creation (during the ingestion of data) remains a largely unexplored area where fraud detection is most important. In addition, current fraud detection systems do not have the capability to use hybrid models that can detect multi-modal, synthetic identities, and deepfakes as well as other cross-channel anomalies. Additionally, most current fraud detection systems do not provide an integrated approach of using both supervised and unsupervised methods for detection or include the ability to provide explanations for the decision-making process of the model to combat modern forms of synthetic and AI-based attacks. We present a Hybrid AI Framework which utilizes Supervised Learning, Unsupervised Anomaly Detection, and Explanatory AI (XAI), to identify Identity Fraud prior to Account Creation. This Framework will combine multiple Data Sources (Documents, Biometric Information, Devices, Structured Attributes) to produce Interpretable Risk Scores, utilizing SHAP Values & Rule Based Explanation, allowing Analysts to Identify Alerts & Resolve Them Efficiently. Our End-To-End Design Offers a Scalable, Compliant Solution to Early-Stage Identity Theft Prevention in Financial Services.

Keywords:
Identity Theft Anomaly detection Deep Fakes Explainable AI Feature engineering Data pre-processing finance Machine Learning (ML) Hybrid AI

This work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Figures

Figure 3 of 3

References:

[1]	Ianzito, C. (2025). Identity Fraud and Scams Cost Americans $47 Billion in 2024. AARP. Available: https:// www.aarp.org/ money/ scams-fraud/javelin-identity-theft-report-2024/.

[2]	Federal Trade Commission (FTC). (2025). New FTC Data Show a Big Jump in Reported Losses to Fraud to $12.5 Billion in 2024. Available: https:// www.ftc.gov/ news-events/ news/press-releases/2025/03/new-ftc-data-show-big-jump-reported-losses-fraud-125-billion-2024.

[3]	Zhang, C. J. (2025). AI-based Identity Fraud Detection: A Systematic Review. arXiv preprint arXiv:2501.09239. Available: https://arxiv.org/abs/2501.09239.

[4]	Sugavanam, A. (2025). AI Threats Pose New Fraud Risks, But AI Can Also Defend Banks. The Financial Brand. Available: https://thefinancialbrand.com/news/banking-technology/ai-threats-pose-new-financial-frauds-but-ai-can-also-defend-banks-192450/.

[5]	Agarwal, V. (2021). Identity Theft Detection Using Machine Learning. International Journal for Research in Applied Science & Engineering Technology (IJRASET), 9(8), 1943–1949.

[6]	Mitchell, C. D., & Sambasivam, S. (2025). Predictive Modeling for Identity Theft Detection: A Design Science Approach Using Machine Learning and Historical Data. Issues in Informing Science and Information Technology, 22, 1–13.

[7]	Almalki, F., & Masud, M. (2025). Financial Fraud Detection Using Explainable AI and Stacking Ensemble Methods. arXiv preprint arXiv:2505.10050. Available: https:// arxiv.org/ abs/ 2505.10050.

[8]	Kaur, G., et al. (2025). Explainable AI for Regulatory Compliance in Financial and Healthcare Sectors: A Comprehensive Review. International Journal of Advances in Engineering and Management (IJAEM), 7(3), 489–494.

[9]	Sai’d, Z. (2025). Explainable AI (XAI) in Identity Access Management: Bridging Trust and Transparency in User Authentication. TechRxiv preprint.

[10]	Vaidya, A., & Awasthi, A. (2025). Zero-to-One IDV: A Conceptual Model for AI-Powered Identity Verification. arXiv preprint arXiv:2503.08734. Available: https:// arxiv.org/ abs/ 2503.08734.

[11]	Palo Alto Networks. (2024). What Is Explainable AI (XAI)? Available: https:// www.paloaltonetworks.com/ cyberpedia/ explainable-ai.

[12]	FinTech Global. (2025). Is Explainable AI the Missing Link in Regulatory Compliance? Available: https:// fintech.global/ globalregtechsummitusa/is-explainable-ai-the-missing-link-in-regulatory-compliance/.

[13]	ESS Open Archive. (2025). Interpretable Machine Learning in Financial Risk Systems. Available: https:// essopenarchive.org/ browse-all ? tags=% 5B% 22interpretable +machine+ learning%22%5D.

[14]	The Financial Brand — Deepfake Fraud Examples & Document Spoof Detection (2025). Available: https:// thefinancialbrand.com/ news/banking-technology/ai-threats-pose-new-financial-frauds-but-ai-can-also-defend-banks-192450.

[15]	Deloitte Insights. (2024). Explainable Artificial Intelligence (XAI) in Banking – Towards Transparency. Available: https:// www. deloitte.com/us/en/insights/industry/financial-services/ explainable - ai-in-banking.html.

[16]	Zero-to-One IDV Regulatory Appendix (AMLD5, GDPR). (2025). arXiv supplemental material. Available: https:// arxiv.org/ pdf/2503.08734.

[17]	U.S. Bank Secrecy Act (Customer Identification Program Requirements). (Regulatory reference; no URL required.