Article citationsMore >>

Acion, Laura & Kelmansky, Diana & Laan, Mark & Sahker, Ethan & Jones, Deshauna & Arndt, Stephan. (2017). Use of a machine learning framework to predict substance use disorder treatment success. PLOS ONE. 12. e0175383.

has been cited by the following article:

Article

A Machine Learning Approach to Identify Socio-Economic Factors Responsible for Patients Dropping out of Substance Abuse Treatment

1Marquette High School, Chesterfield, MO, USA

2Department of Mathematics, Southeast Missouri State University, Cape Girardeau, MO, USA


American Journal of Public Health Research. 2020, Vol. 8 No. 5, 140-146
DOI: 10.12691/ajphr-8-5-2
Copyright © 2020 Science and Education Publishing

Cite this paper:
Prateek Gautam, Pradeep Singh. A Machine Learning Approach to Identify Socio-Economic Factors Responsible for Patients Dropping out of Substance Abuse Treatment. American Journal of Public Health Research. 2020; 8(5):140-146. doi: 10.12691/ajphr-8-5-2.

Correspondence to: Pradeep  Singh, Department of Mathematics, Southeast Missouri State University, Cape Girardeau, MO, USA. Email: psingh@semo.edu

Abstract

In recent years, the subject of substance abuse has drawn considerable attention from researchers and policymakers alike. Researchers have been utilizing the wealth of patient level data available from various agencies to develop prediction models for the relationship between socio-economic factors and substance abuse issues. According to the Substance Abuse and Mental Health Services Administration (SAMHSA), in 2017, 26% of patients admitted to treatment facilities drop out prematurely, which is significant when considering that roughly 1.5 million people are admitted to these treatment facilities every year, thereby revealing the need for an analysis to identify variables associated with such a large number of people not completing treatment. This study applies Multiple Logistic Regression (MLRM) as well as Random Forest Classification (RF) model to determine significant socio-economic factors responsible for patients prematurely dropping out of substance abuse treatment for opioid misuse. A MLRM has its limitations when the dataset has a large number of categorical variables; machine learning methods such as RF have proven more effective and accurate when dealing with such data. Patient level data from the Treatment Episode Dataset - Discharge (TEDS-D 2017) was analyzed and the models were compared using the Area Under the Curve (AUC) operating characteristic. The MLRM was found to have an AUC of .68 while the RF model had an AUC of .89, thereby demonstrating the advantage of machine learning methods. The factors deemed significant from the RF model can provide healthcare professionals as well as administrative officials with the necessary information to help address the issue of patients prematurely dropping out of opioid misuse treatment.

Keywords