Journal of Computer Sciences and Applications
ISSN (Print): 2328-7268 ISSN (Online): 2328-725X Website: http://www.sciepub.com/journal/jcsa Editor-in-chief: Minhua Ma, Patricia Goncalves
Open Access
Journal Browser
Go
Journal of Computer Sciences and Applications. 2015, 3(3), 61-66
DOI: 10.12691/jcsa-3-3-1
Open AccessArticle

Transplanting Binary Decision Trees

Eli M. Dow1, and Tim Penderghest2,

1IBM / Clarkson University, Potsdam NY, USA

2Clarkson University, Potsdam NY, USA

Pub. Date: May 04, 2015

Cite this paper:
Eli M. Dow and Tim Penderghest. Transplanting Binary Decision Trees. Journal of Computer Sciences and Applications. 2015; 3(3):61-66. doi: 10.12691/jcsa-3-3-1

Abstract

In this paper, we describe a means of compiling binary decision trees as generated by the C4.5 binary decision tree classifier into high-performance, reusable, stand-alone, run-time classifiers. We demonstrate the memory savings and run time characteristics of a compiled tree as compared to the traditional use of a C4.5 runtime. We demonstrate 100% correctness over every input we have available for testing as compared to our own enhanced version of the classic C4.5 run-time classification routine, consultr. In addition, this work provides a framework for comparing decision tree classifiers to more in vogue classifiers such as support vector machines as demonstrated within.

Keywords:
binary decision tree classifier code-generator AI

Creative CommonsThis work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

References:

[1]  GCC - http://gcc.gnu.org/.
 
[2]  LIBSVM - Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1-27:27, 2011. Software available online: http://www.csie.ntu.edu.tw/~cjlin/libsvm.
 
[3]  VAPNIK1 - Vapnik, V. N. (1995). The nature of statistical learning theory. New York: Springer.
 
[4]  VAPNIK2 - Vapnik, V. N. (1998). Statistical learning theory. New York: Wiley.
 
[5]  EFFECTIVENESS - Mahesh Pal, Paul M Mather, An assessment of the effectiveness of decision tree methods for land cover classification, Remote Sensing of Environment, Volume 86, Issue 4, 30 August 2003, Pages 554-565.
 
[6]  QUINLAN - J. Ross Quinlan: C4.5: Programs for Machine Learning Morgan Kaufmann 1993.
 
[7]  IODT - J. R. Quinlan. 1986. Induction of Decision Trees. Mach. Learn. 1, 1 (March 1986), 81-106.
 
[8]  C - ANSI X3.159-1989 "Programming Language C.
 
[9]  C++ - C++ standard, 14882:2011.
 
[10]  GO – http://golang.org.
 
[11]  Python – http://www.python.org.
 
[12]  PATCLASS - Pattern Classification, 2nd Edition Richard O. Duda, Peter E. Hart, David G. Stork ISBN: 978-0-471-05669-0 680 pages November 2000, ©2001.
 
[13]  MITCHELL - Machine Learning, Tom Mitchell, McGraw Hill, 1997. http://www.cs.cmu.edu/~tom/mlbook.html.
 
[14]  QEMU - Fabrice Bellard. 2005. QEMU, a fast and portable dynamic translator. In Proceedings of the annual conference on USENIX Annual Technical Conference (ATEC '05). USENIX Association, Berkeley, CA, USA, 41-41.
 
[15]  SGI - . http://www.sgi.com/tech/mlc/db/.
 
[16]  UCI - http://archive.ics.uci.edu/ml/.
 
[17]  EMPTY - http://empty.sourceforge.net/.
 
[18]  ML - Ryszard S. Michalski, Jaime G. Carbonell, Tom M. Mitchell (1983), Machine Learning: An Artificial Intelligence Approach, Tioga Publishing Company.
 
[19]  SURVEY - Wu et al. Top 10 algorithms in data mining. Knowledge and Information Systems, 14(2008), 1: 1-37.