Article citationsMore >>

Baeza-Yates, Ricardo and Ribeiro-Neto, Berthier. 1999. Modern Information Retrieval. ACM Press, New York. Addison-Wesley. Retrieved from people.ischool.berkeley.edu/~hearst/irbook/print/chap10.pdf.

has been cited by the following article:

Article

Characterisation of Academic Journal Publications Using Text Mining Techniques

1Computer Science Department, University of Ibadan, Ibadan, Nigeria


Journal of Computer Sciences and Applications. 2017, Vol. 5 No. 2, 42-49
DOI: 10.12691/jcsa-5-2-1
Copyright © 2017 Science and Education Publishing

Cite this paper:
Adebola K. Ojo, Adesesan B. Adeyemo. Characterisation of Academic Journal Publications Using Text Mining Techniques. Journal of Computer Sciences and Applications. 2017; 5(2):42-49. doi: 10.12691/jcsa-5-2-1.

Correspondence to: Adebola  K. Ojo, Computer Science Department, University of Ibadan, Ibadan, Nigeria. Email: adebola_ojo@yahoo.co.uk

Abstract

The ever-growing volume of published academic journals and the implicit knowledge that can be derived from them has not fully enhanced knowledge development but rather resulted into information and cognitive overload. However, publication data are textual, unstructured and anomalous. Analysing such high dimensional data manually is time consuming and this has limited the ability to make projections and trends derivable from the patterns hidden in various publications. This study was designed to develop and use intelligent text mining techniques to characterise academic journal publications. Journals Scoring Criteria by nineteen rankers from 2001 to 2013 of 50th edition of Journal Quality List (JQL) were used as criteria for selecting the highly rated journals. The text-miner software developed was used to crawl and download the abstracts of papers and their bibliometric information from the articles selected from these journal articles. The datasets were transformed into structured data and cleaned using filtering and stemming algorithms. Thereafter, the data were grouped into series of word features based on bag of words document representation. The highly rated journals were clustered using Self-Organising Maps (SOM) method with attribute weights in each cluster.

Keywords