International Transaction of Electrical and Computer Engineers System
ISSN (Print): 2373-1273 ISSN (Online): 2373-1281 Website: http://www.sciepub.com/journal/iteces Editor-in-chief: Dr. Pushpendra Singh, Dr. Rajkumar Rajasekaran
Open Access
Journal Browser
Go
International Transaction of Electrical and Computer Engineers System. 2017, 4(1), 14-25
DOI: 10.12691/iteces-4-1-3
Open AccessArticle

An Experiential Study of the Big Data

Yusuf Perwej1,

1Department of Information Technology, AI Baha University, Al Baha, Kingdom of Saudi Arabia (KSA)

Pub. Date: March 24, 2017

Cite this paper:
Yusuf Perwej. An Experiential Study of the Big Data. International Transaction of Electrical and Computer Engineers System. 2017; 4(1):14-25. doi: 10.12691/iteces-4-1-3

Abstract

The intention of this paper is to evoke discussion rather than to provide an experiential extensive survey of big data research. The Big data is not a single technology but an amalgamation of old and new technologies that assistance companies gain actionable awareness. The big data are vital because it empowers organizations to congregate, store, manage, and manipulate countless amounts data at the pertinent speed, at the pertinent time, to gain the pertinent intuition. Eventually big data solutions and practices are typically essential when eternal data processing, analysis and storage technologies and techniques are inadequate. In particular, big data addresses detached requirements, in other words the amalgamate of multiple un-associated datasets, processing of huge amounts of amorphous data and harvesting of unseen information in a time-sensitive genre. In this paper, aimed to demonstrate a close-up view about big data, including big data concepts, security, privacy, data storage, data processing, and data analysis of these technological developments, we also brief description about the characteristic of big data, big data techniques, technologies and tools emphasizes critical points on these issues.

Keywords:
datasets big data differential privacy anonymization diagnostic analytics data storage

Creative CommonsThis work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

References:

[1]  Gandomi, A., & Haider, M. Beyond, “The hype: big data concepts, methods, and analytics,” International Journal of Information Management, 35(2), 137-144, (2015).
 
[2]  Sagiroglu, S.; Sinanc, D., “Big Data: A Review”, 20-24 May 2013.
 
[3]  S. Kaisler, F. Armour, J.A. Espinosa, and W. Money, “Big Data: issues and challenges moving forward,” in: Proceedings of the 46th IEEE Annual Hawaii international Conference on System Sciences (HICC 2013), Grand Wailea, Maui, Hawaii, January 2013, pp. 995-1004.
 
[4]  Garlasu, D.; Sandulescu, V. ; Halcu, I. ; Neculoiu, G., “A Big Data implementation based on Grid Computing”, Grid Computing, 17-19 Jan. 2013.
 
[5]  Mukherjee, A.; Datta, J.; Jorapur, R.; Singhvi, R.; Haloi, S.; Akram, W., “Shared disk big data analytics with Apache Hadoop”, 18-22 Dec., 2012.
 
[6]  X. Wu, X. Zhu, G. Q. Wu, and W. Ding, “Data Mining with Big Data”, IEEE Transactions on Knowledge and Data Engineering, 26(1) (2014) 97-107.
 
[7]  Mayer-Sch¨onberger V, Cukier K, “Big data: a revolution that will transform how we live, work, and think” Eamon Dolan/Houghton Mifflin Harcourt, 2013.
 
[8]  O. R. Team, “Big data now: current perspectives from OReilly Radar”, OReilly Media, 2011.
 
[9]  Manyika J, McKinsey Global Institute, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Byers AH (2011) “Big data: the next frontier for innovation, competition, and productivity”, McKinsey Global Institute.
 
[10]  Yuki Noguchi, “Following Digital Breadcrumbs To 'Big Data' Gold”, November 29, 2011.
 
[11]  Sagiroglu, S.; Sinanc, D., (20-24 May 2013), “Big Data: A Review”.
 
[12]  Mayer-Schonberger V, Cukier K. “Big data: a revolution that will transform how we live, work, and think”, Boston: Houghton Mifflin Harcourt; 2013.
 
[13]  K. Bakshi, "Considerations for Big Data: Architecture and Approach", Aerospace Conference IEEE, Big Sky Montana, March 2012.
 
[14]  Fisher D, DeLine R, Czerwinski M, Drucker S. “Interactions with big data analytics”, Interactions. 2012; 19(3):50-9.
 
[15]  Russom P.,” Big data analytics” TDWI: Tech. Rep ; 2011.
 
[16]  J. Fan, F. Han, H. Liu, “Challenges of big data analysis”, National Science Review, 1 (2) (2014), pp. 293-314.
 
[17]  Nyce, Charles (2007), “Predictive Analytics White Paper (PDF)”, American Institute for Chartered Property Casualty Underwriters/Insurance Institute of America.
 
[18]  A. Labrinidis, H.V. Jagadish,” Challenges and opportunities with big data”, Proceedings of the VLDB Endowment, 5 (12) (2012), pp. 2032-2033.
 
[19]  Laney D. “3D data management: controlling data volume, velocity, and variety”, META Group, Tech. Rep. 2001.
 
[20]  Changqing Ji, Yu Li, Wenming Qiu, Uchechukwu Awada, Keqiu Li,” Big Data Processing in Cloud Computing Environments”, International Symposium on Pervasive Systems, Algorithms and Networks, 2012.
 
[21]  Russom, P.. “Big Data Analytics” In: TDWI Best Practices Report, pp. 1-40 (2011).
 
[22]  Cloud Security Alliance Big Data Working Group, “Expanded Top Ten Big Data Security and Privacy Challenges”, 2013.
 
[23]  A.A. Cardenas, P.K. Manadhata, S.P. Rajan, “Big Data Analytics for Security”, IEEE Security & Privacy, vol. 11, issue 6, pp. 74-76, 2013.
 
[24]  T. Omer, P. Jules, “Big Data for All: Privacy and User Control in the Age of Analytics”, Northwestern Journal of Technology and Intellectual Property, article 1, vol. 11, issue 5, 2013.
 
[25]  De Cristofaro, E., Soriente, C., Tsudik, G., & Williams, A. “Hummingbird: Privacy at the time of twitter. In Security and Privacy (SP)”, 2012 IEEE Symposium on (pp. 285-299), 2012.
 
[26]  Li N, et al. “t-Closeness: privacy beyond k-anonymity and L-diversity”, In: Data engineering (ICDE) IEEE 23rd international conference; 2007.
 
[27]  Machanavajjhala A, Gehrke J, Kifer D, Venkitasubramaniam “M. L-diversity: privacy beyond k-anonymity” In: Proc. 22nd international conference data engineering (ICDE); 2006. p. 24.
 
[28]  Mehmood A, Natgunanathan I, Xiang Y, Hua G, Guo S. “Protection of big data privacy” In: IEEE translations and content mining are permitted for academic research. 2016.
 
[29]  Sweeney L. “K-anonymity: a model for protecting privacy”, Int J Uncertain Fuzz. 2002; 10(5): 557-70.
 
[30]  Xu L, Jiang C, Wang J, Yuan J, Ren Y. “Information security in big data: privacy and data mining”, IEEE Access. 2014; 2: 1149-76.
 
[31]  Sedayao J, Bhardwaj R. “Making big data, privacy, and anonymization work together in the enterprise experiences and issues”, Big Data Congress; 2014.
 
[32]  Zhang, Xiaoxue Xu, Feng, (2-4 Sep. 2013), “Survey of Research on Big Data Storage”.
 
[33]  Zhang, Y., Feng, H., Hao, W., et al.: “Research on the storage of file big data based on NoSQL”, Manufact. Autom. 6, 27-30 (2014).
 
[34]  Ji, C., Li, Y., Qiu, W., Awada, U., and Li, K. (2012) “Big data processing in cloud computing environments. Pervasive Systems”, Algorithms and Networks (ISPAN), 2012 12th International Symposium on, pp. 17-23, IEEE.
 
[35]  Bu, Y., Howe, B., Balazinska, M., and Ernst, M. (2010). “Haloop: Efficient iterative data processing on large clusters”, Proceedings of the VLDB Endowment, 3, 285-296.
 
[36]  Hu, H., et. al. (2014). “Toward scalable systems for Big Data analytics: A technology tutorial” Access IEEE, 2, 652-687.
 
[37]  Purcell, B. “The emergence of big data technology and analytics” Journal of Technology Research, Holy Family University, pp. 1-6.
 
[38]  Yang, X., & Sun, J. (2011). “An analytical performance model of MapReduce” In IEEE Conference on Cloud Computing and Intelligence Systems (CCIS), pp. 306-310.
 
[39]  R. Agrawal, T. Imielinski, and A. Swami, “Mining association rules between sets of items in large databases,” SIGMOD Conference 1993: 207-16; P. Hajek, I. Havel, and M. Chytil, “The GUHA method of automatic hypotheses determination,” Computing 1(4), 1966; 293-308.
 
[40]  Jeff Howe, “The Rise of Crowdsourcing,” Wired, Issue 14.06, June 2006.
 
[41]  C.L. Philip Chen, Chun-Yang Zhang, “Data intensive applications, challenges, techniques and technologies: A survey on Big Data”, Information Sciences, www.elsevier.com/locate/ins, January 2014.
 
[42]  Jeffrey Dean and Sanjay Ghemawat, “MapReduce: Simplified data processing on large clusters,” Sixth Symposium on Operating System Design and Implementation, San Francisco, CA, December 2004.
 
[43]  Almeida, Fernando; Santos, Mário. 2014. “A Conceptual Framework for Big Data Analysis. In Organizational, Legal, and Technological Dimensions of Information System Administration”, ed. Irene Maria Portela, Fernando Almeida, 199-223. ISBN: 9781466645264. USA: IGI Global.