ISSN 2348-1218 (print) International Journal of Interdisciplinary Research and Innovations ISSN 2348-1226 (online) Vol. 9, Issue 1, pp: (29-32), Month: January - March 2021, Available at: www.researchpublish.com
BIG DATA REVIEW BASED ON SECURITY CHALLENGES AND PRIVACY ISSUES Yazeed Al Moaiad 1, Wafa Al-Haithami 2 Al-Madinah International University, Malaysia 1
yazeed.alsayed@mediu.edu.my 2
wafaaalhithmy@yahoo.com
Abstract: This paper is about the trend of gathering, storing, and managing high volume data sets known as big data. It will be gone to introduce the term and review some basics and characteristics of Big Data followed by security challenges and privacy issues. The researchers will also discuss the three versus (Velocity, Volume, and Variety) of big data with the addition of two recently added versus (Veracity and Value). Keywords: Big Data, Hadoop, Apache, Velocity, Volume, Variety.
I. INTRODUCTION As we can see, we are turning to the age of speed in all aspects of life, which helps to save a large variety of data from different devices. Therefore, processing data by traditional methods becomes nearly impossible, which reduces the collection and analysis of data to produce specific insights and concepts for that data. If we want to process that data by using relational database engines, then it is not possible because the data may be disorganized, sensitive to time, and most importantly, it may be very large. Therefore, this data requires a different method of processing it in terms of capacity and approach, which has come to be called "big data"[1]. It is through knowing the big data reformer who is defined as a description of collecting and storing big data. This big data is either sequentially organized, or it is not randomly organized, but on the other hand, processing that big data helps in better decision-making and defining the strategic goals of the business. Also, increasing knowledge of a certain thing will increase reliability, gain new perspectives and insights, and help to better predict the decisions to be made later. Comparing data and the relationships between them helps in raising the efficiency of learning to make appropriate and smarter decisions. Therefore, companies do not care about the size of big data, but the most important thing is what they will use this data, not after processing it, because in its initial stages it has no value until it is processed with the right methods and tools. So, the goal of this big data is to process it from the original pattern into a mature, usable idea, and hence the problem with this big data.
II. LECTURE REVIEW The term "big data" is considered relatively new according to [2], as it has been known to professionals in this field to collect, store and analyze big data from a long time. Accordingly, the term gained momentum when industry analyst Doug Laney explained the definition of big data as a concept of three components: volume, variety, and velocity. Indeed, the author [3], added two other elements, namely value and honesty, during the past few years. The term "big data" was discussed by the author [4], from a certain angle, as he said that this concept may have been around for quite some time, but there was and still is some ambiguity and confusion as to what exactly this term means. He pointed out that this concept is developing over time, so what it means should be reconsidered, as it remains the
Page | 29 Research Publish Journals