Article citationsMore >>

Shima Haghani., Morteza Sedehi, and Soleiman Kheiri, “Artificial neural network to modeling zero-inflated count data: Application to predicting number of return to blood donation,” Journal of Research in Health Sciences, 17(3), 2017.

has been cited by the following article:

Article

Model Selection for Count Data with Excess Number of Zero Counts

1Department of Statistics, Bharathiar University,Coimbatore-641046, Tamilnadu, India


American Journal of Applied Mathematics and Statistics. 2019, Vol. 7 No. 1, 43-51
DOI: 10.12691/ajams-7-1-7
Copyright © 2019 Science and Education Publishing

Cite this paper:
K.M. Sakthivel, C.S. Rajitha. Model Selection for Count Data with Excess Number of Zero Counts. American Journal of Applied Mathematics and Statistics. 2019; 7(1):43-51. doi: 10.12691/ajams-7-1-7.

Correspondence to: K.M.  Sakthivel, Department of Statistics, Bharathiar University,Coimbatore-641046, Tamilnadu, India. Email: sakthithebest@gmail.com

Abstract

Zero inflated models have been widely studied in statistical literature. Zero inflated Poisson model and hurdle model are the most commonly used models for modeling the overdispersed count data. In adddition to this, recent studies shows that a nonparametric and data dependent technique known as artificial neural networks (ANN) produce better performance for modeling the over dispersed and zero inflated count data. In this paper, we compared the performance of different models such as zero inflated Poisson model, hurdle model and ANN for modelling the zero inflated count data in terms of standardized MSE, SE, bias and relative efficiency. An application study is carried out for both the simulated data set and real data set. Also for checking the suitability of these three models, we verified the group membership of the models, by adopting three classification techniques known as discriminant analysis, CART and random forest. We proposed an algorithm for selecting the better model among a set of models and computed the misclassification rates for a zero inflated count data set using different classifiers.

Keywords