Efficient and Scalable Matrix Factorization Transfer with Review Helpfulness for Massive Data Processing

Aboagye Emelia Opoku; Jianbin Gao; Dagadu Joshua Caleb; Qi Xia

Journal of Computer Sciences and Applications. 2017, 5(2), 76-82
DOI: 10.12691/jcsa-5-2-4

Open AccessArticle

Efficient and Scalable Matrix Factorization Transfer with Review Helpfulness for Massive Data Processing

Aboagye Emelia Opoku^{1, 2,}, Jianbin Gao³, Dagadu Joshua Caleb⁴ and Qi Xia^{5, 6}

¹School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China

²Kumasi Technical University, Kumasi, Ghana

³School of Resource and Environment, University of Electronic, Science and Technology of China Chengdu, Chengdu, China

⁴Computer Science Department, University of Electronic Science and Technology, Chengdu, China

⁵School of Computer Science and Engineering, University of Electronic Science and Technology of China Chengdu, Chengdu, China

⁶Center for Cyber Security, University of Electronic Science and Technology of China, Chengdu, Sichuan, China

Pub. Date: July 18, 2017

View Full Text Full Text PDF (361 KB) Full Text ePUB(215 KB)

Cite this paper:
Aboagye Emelia Opoku, Jianbin Gao, Dagadu Joshua Caleb and Qi Xia. Efficient and Scalable Matrix Factorization Transfer with Review Helpfulness for Massive Data Processing. Journal of Computer Sciences and Applications. 2017; 5(2):76-82. doi: 10.12691/jcsa-5-2-4

Abstract

We explore the sparsity problem associated with recommendation system through the concept of transfer learning (TL) which are normally caused by missing and noisy ratings and or review helpfulness. TL is a machine learning (ML) method which aims to extract knowledge gained in a source task/domain and use it to facilitate the learning of a target predictive function in a different domain. The creation and transfer of knowledge are a basis for competitive advantage. One of the challenges prevailing in this era of big data is scalable algorithms that process the massive data in reducing computational complexity. In the RS field, one of the inherent problems researchers always try to solve is data sparsity. The data associated with rating scores and helpfulness of review scores are always sparse presenting sparsity problems in recommendation systems (RSs). Meanwhile, review helpfulness votes helps facilitate consumer purchase decision-making processes. We use online review helpfulness votes as an auxiliary in formation source and design a matrix transfer framework to address the sparsity problem. We model our Homogenous Fusion Transfer Learning approach based on Matrix Factorization HMT with review helpfulness to solve sparsity problem of recommender systems and to enhance predictive performance within the same domain. Our experiments show that, our framework Efficient Matrix Transfer Learning (HMT) is scalable, computationally less expensive and solves the sparsity problem of recommendations in the e-commerce industry.

Keywords:
fusion transfer learning sparsity helpfulness

This work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

References:

[1]	J. Hoxha and C. Weng, “Leveraging dialog systems research to assist biomedical researchers interrogation of big clinical data,” Journal of biomedical informatics, vol. 61, pp. 176-184, 2016.

[2]	A. Endert, W. Ribarsky, C. Turkay, B. Wong, I. Nabney, I. D. Blanco, and F. Rossi, “The state of the art in integrating machine learning into visual analytics,” in Computer Graphics Forum. Wiley Online Library, 2017.

[3]	B. Mittal, “Facing the shelf: Four consumer decision-making styles,” Journal of International Consumer Marketing, pp. 1-16, 2017.

[4]	M. Geetha, P. Singha, and S. Sinha, “Relationship between customer sentiment and online customer ratings for hotels-an empirical analysis,” Tourism Management, vol. 61, pp. 43-54, 2017.

[5]	J. V. Chen, D. C. Yen, W.-R. Kuo, and E. P. S. Capistrano, “The antecedents of purchase and re-purchase intentions of online auction consumers,” Computers in Human Behavior, vol. 54, pp. 186-196, 2016.

[6]	A. Gajewicz, K. Jagiello, M. Cronin, J. Leszczynski, and T. Puzyn, “Addressing a bottle neck for regulation of nanomaterials: quantitative read-across (nano-qra) algorithm for cases when only limited data is available,” Environmental Science: Nano, 2017.13

[7]	H. Chen, Z. Li, and W. Hu, “An improved collaborative recommendation algorithm based on optimized user similarity,” The Journal of Supercomputing, vol. 72, no. 7, pp. 2565-2578, 2016.

[8]	F. Shoeleh and M. Asadpour, “Graph based skill acquisition and transfer learning for continuous reinforcement learning domains,” Pattern Recognition Letters, vol. 87, pp. 104-116, 2017.

[9]	T. D. Dawes, R. Turincio, S. W. Jones, R. A. Rodriguez, D. Gadiag- ellan, P. Thana, K. R. Clark, A. E. Gustafson, L. Orren, M. Liimatta et al., “Compound transfer by acoustic droplet ejection promotes quality and efficiency in ultra-high-throughput screening campaigns,” Journal of laboratory automation, vol. 21, no. 1, pp. 64-75, 2016.

[10]	Q. Wu, X. Zhou, Y. Yan, H. Wu, and H. Min, “Online transfer learning by leveraging multiple source domains,” Knowledge and Information Systems, pp. 1-21, 2017.

[11]	S. Thrun and L. Pratt, Learning to learn. Springer Science & Business Media, 2012.

[12]	J. Moskaliuk, F. Bokhorst, and U. Cress, “Learning from others’ experiences: How patterns foster interpersonal transfer of knowledge-in-use,” Computers in Human Behavior, vol. 55, pp. 69-75, 2016.

[13]	Q. Chen, “Learning, knowledge transfer, and institutional innovation: The impact of academic mobility,” in Globalization and Transnational Academic Mobility. Springer, 2017, pp. 89-111.

[14]	D. Chen, L. Yuan, J. Liao, N. Yu, and G. Hua, “Stylebank: An explicit representation for neural image style transfer,” arXiv preprint arXiv:1703.09210, 2017.

[15]	M. Malik and A. Hussain, “Helpfulness of product reviews as a function of discrete positive and negative emotions,” Computers in Human Behavior, 2017.

[16]	Z. Xiang, Q. Du, Y. Ma, and W. Fan, “A comparative analysis of major online review platforms: Implications for social media analytics in hospitality and tourism,” Tourism Management, vol. 58, pp. 51-65, 2017.

[17]	W. Pan, E. W. Xiang, and Q. Yang, “Transfer learning in collaborative filtering with uncertain ratings.” in AAAI, vol. 12, 2012, pp. 662-668.14

[18]	J. Nam, S. J. Pan, and S. Kim, “Transfer defect learning,” in Proceedings of the 2013 International Conference on Software Engineering. IEEE Press, 2013, pp. 382-391.

[19]	L. Zhang, W. Zuo, and D. Zhang, “Lsdt: Latent sparse domain transfer learning for visual adaptation,” IEEE Transactions on Image Processing, vol. 25, no. 3, pp. 1177-1191, 2016.

[20]	Z. Ding and Y. Fu, “Robust transfer metric learning for image classification,” IEEE Transactions on Image Processing, vol. 26, no. 2, pp. 660-670, 2017.

[21]	J. He, H. H. Zhuo, and J. Law, “Distributed-representation based hybrid recommender system with short item descriptions,” arXiv preprint arXiv: 1703.04854, 2017.

[22]	Y. Li, “Deep reinforcement learning: An overview,” arXiv preprint arXiv: 1701.07274, 2017.

[23]	G. Einecke, G. Beutel, M. M. Hoeper, and J. T. Kielstein, “The answer is blowing in the wind: an uncommon cause for severe ards accompanied by circulatory insufficiency requiring extracorporeal membrane oxygenation,” BMJ Case Reports, vol. 2017, p. bcr2016218079, 2017.

[24]	W. Pan and Q. Yang, “Transfer learning for behavior prediction,” IEEE Intelligent Systems, vol. 31, no. 2, pp. 86-88, 2016.

[25]	R. Ghoncheh, M. S. Gould, J. W. Twisk, A. J. Kerkhof, and H. M. Koot, “Efficacy of adolescent suicide prevention e-learning modules for gatekeepers: a randomized controlled trial,” JMIR mental health, vol. 3, no. 1, 2016.

[26]	M. Long, J. Wang, G. Ding, S. J. Pan, and S. Y. Philip, “Adaptation regularization: A general framework for transfer learning,” IEEE Transactions on Knowledge and Data Engineering, vol. 26, no. 5, pp. 1076-1089, 2014.

[27]	P.-J. Kindermans, M. Tangermann, K.-R. Müller, and B. Schrauwen, “Integrating dynamic stopping, transfer learning and language models in an adaptive zero-training erp speller,” Journal of neural engineering,vol. 11, no. 3, p. 035005, 2014.

[28]	L. A. Guzman, D. de la Hoz, and A. Monzón, “Optimization of transport measures to reduce ghg and pollutant emissions through a luti modeling 15 approach,” International Journal of Sustainable Transportation, vol. 10, no. 7, pp. 590–603, 2016.

[29]	T. Han, H. Yao, C. Xu, X. Sun, Y. Zhang, and J. Corso, “Dancelets mining for video recommendation based on dance styles,” IEEE Transactions on Multimedia, 2016.

[30]	J. Yang, S. Ma, B. Gao, X. Li, Y. Zhang, J. Cai, M. Li, L. Yao, B. Huang, and M. Zheng, “Single particle mass spectral signatures from vehicle exhaust particles and the source apportionment of on-line pm 2.5 by single particle aerosol mass spectrometry,” Science of The Total Environment, vol. 593, pp. 310-318, 2017.

[31]	J. Bralich, D. Reichman, L. M. Collins, and J. M. Malof, “Improving convolutional neural networks for buried threat in ground penetrating radar using transfer learning via pre-training,” in SPIE Defense+Security. International Society for Optics and Photonics, 2017, pp. 101820X-101820X.

[32]	Y. Kim, “Convolutional neural networks for sentence classification,” arXiv preprint arXiv:1408.5882, 2014.

[33]	G. Mesnil, Y. Dauphin, X. Glorot, S. Rifai, Y. Bengio, I. J. Goodfellow, E. Lavoie, X. Muller, G. Desjardins, D. Warde-Farley et al., “Unsupervised and transfer learning challenge: a deep learning approach.” ICML Unsupervised and Transfer Learning, vol. 27, pp. 97-110, 2012.

[34]	M. Oquab, L. Bottou, I. Laptev, and J. Sivic, “Learning and transferring mid-level image representations using convolutional neural networks,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2014, pp. 1717-1724.

[35]	K. Weiss, T. M. Khoshgoftaar, and D. Wang, “A survey of transfer learning,” Journal of Big Data, vol. 3, no. 1, pp. 1-40, 2016.

[36]	J. Liu, M. Gong, Q. Miao, X. Wang, and H. Li, “Structure learning for deep neural networks based on multiobjective optimization,” IEEE Transactions on Neural Networks and Learning Systems, 2017.

[37]	D. Patel, M. D. Thompson, S. K. Manna, K. W. Krausz, L. Zhang, N. Nilbuol, F. J. Gonzalez, and E. Kebebew, “Unique and novel urinary metabolomic features in malignant versus benign adrenal neoplasms,” Clinical Cancer Research, pp. clincanres-3156, 2017. 16.

[38]	S. Kumar, X. Gao, and I. Welch, “Learning under data shift for domain adaptation: A model-based co-clustering transfer learning solution,” in Pacific Rim Knowledge Acquisition Workshop. Springer, 2016, pp. 43-54.

[39]	H.-C. Shin, H. R. Roth, M. Gao, L. Lu, Z. Xu, I. Nogues, J. Yao, D. Mollura, and R. M. Summers, “Deep convolutional neural networks for computer-aided detection: Cnn architectures, dataset characteristics and transfer learning,” IEEE transactions on medical imaging, vol. 35, no. 5, pp. 1285-1298, 2016.

[40]	L. Qu, G. Ferraro, L. Zhou, W. Hou, and T. Baldwin, “Named entity recognition for novel types by transfer learning,” arXiv preprint arXiv: 1610.09914, 2016.

[41]	Z. Shi, W. Zuo, W. Chen, L. Yue, J. Han, and L. Feng, “User relation prediction based on matrix factorization and hybrid particle swarm optimization,” in Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 2017, pp. 1335-1341.

[42]	M. Udell, C. Horn, R. Zadeh, S. Boyd et al., “Generalized low rank models,” Foundations and Trends R ? in Machine Learning, vol. 9, no. 1, pp. 1-118, 2016.

[43]	K. R. Weiss and T. M. Khoshgoftaar, “An investigation of transfer learning and traditional machine learning algorithms,” in Tools with Artificial Intelligence (ICTAI), 2016 IEEE 28th International Conference on. IEEE, 2016, pp. 283-290.

[44]	L. Zhao, S. J. Pan, and Q. Yang, “A unified framework of active transfer learning for cross-system recommendation,” Artificial Intelligence, vol. 245, pp. 38-55, 2017.

[45]	A. Beutel, E. H. Chi, Z. Cheng, H. Pham, and J. Anderson, “Beyond globally optimal: Focused learning for improved recommendations,” in Proceedings of the 26th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 2017, pp. 203-212.

[46]	I. Fernández-Tob´ıas, “Matrix factorization models for cross-domain recommendation: Addressing the cold start in collaborative filtering,” 2017. 17.

[47]	D. T. Bui, Q.-T. Bui, Q.-P. Nguyen, B. Pradhan, H. Nampak, and P. T. Trinh, “A hybrid artificial intelligence approach using gis-based neural-fuzzy inference system and particle swarm optimization for forest fire susceptibility modeling at a tropical area,” Agricultural and Forest Meteorology, vol. 233, pp. 32-44, 2017.

[48]	J. G. Rohra, B. Perumal, S. J. Narayanan, P. Thakur, and R. B. Bhatt, “User localization in an indoor environment using fuzzy hybrid of particle swarm optimization & gravitational search algorithm with neural networks,” in Proceedings of Sixth International Conference on Soft Computing for Problem Solving. Springer, 2017, pp. 286-295.

[49]	W. Pan, “A survey of transfer learning for collaborative recommendation with auxiliary data,” Neuro-computing, vol. 177, pp. 447-453, 2016.

[50]	C. Jiang, R. Duan, H. K. Jain, S. Liu, and K. Liang, “Hybrid collaborative filtering for high-involvement products: A solution to opinion sparsity and dynamics,” Decision Support Systems, vol. 79, pp. 195-208, 2015.

[51]	D. Belanche, C. Flavián, and A. Pérez-Rueda, “Understanding interactive online advertising: Congruence and product involvement in highly and lowly arousing, skippable video ads,” Journal of Interactive Marketing, vol. 37, pp. 75-88, 2017.

[52]	S. Slowkowski and D. Jarratt, “Assessing the impact of culture on the adoption of high involvement products,” in Proceedings of the 1995, World Marketing Congress. Springer, 2015, pp. 152-152.

[53]	M. Ali, “Evaluating advertising effectiveness of creative television advertisements for high involvement products,” 2016.

[54]	S. Biamukda and C. C. Tan, “Factors influencing high-involvement behaviors in the real-estate investment in northern thailand,” International Journal of Behavioral Science (IJBS), vol. 11, no. 1, 2016.

[55]	E. Grolman, A. Bar, B. Shapira, L. Rokach, and A. Dayan, “Utilizing transfer learning for in-domain collaborative filtering,” Knowledge-Based Systems, vol. 107, pp. 70-82, 2016.

[56]	D. Bannach, M. Jänicke, V. F. Rey, S. Tomforde, B. Sick, and P. Lukowicz, “Self-adaptation of activity recognition systems to new sensors,” arXiv preprint arXiv:1701.08528, 2017.18.

[57]	T. Al-Moslmi, N. Omar, S. Abdullah, and M. Albared, “Approaches to cross-domain sentiment analysis: A systematic literature review,” IEEE Access, 2017.

[58]	A. Gretton, P. Hennig, C. E. Rasmussen, and B. Schölkopf, “New directions for learning with kernels and gaussian processes (dagstuhl seminar 16481),” in Dagstuhl Reports, vol. 6, no. 11. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2017.

[59]	J. McAuley, C. Targett, Q. Shi, and A. Van Den Hengel, “Image-based recommendations on styles and substitutes,” in Proceedings of the 38^th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2015, pp. 43-52.