Challenges arise of Privacy Preserving Big Data Mining Techniques

Page 1

International Research Journal of Engineering and Technology (IRJET)

e-ISSN: 2395 -0056

Volume: 04 Issue: 05 | May -2017

p-ISSN: 2395-0072

www.irjet.net

Challenges arise of Privacy Preserving Big Data Mining Techniques M.Chalapathi Rao1, A.Kiran Kumar2 1

Assistant Professor, Dept of CSE, CMR Technical Campus, Telangana, India.

2

Assistant Professor, Dept of CSE, CMR Technical Campus, Telangana, India.

---------------------------------------------------------------------***---------------------------------------------------------------------

Abstract - Big data is being generated from various

sources-transactions, social media, sensors, digital images, videos, audios and for domains including healthcare etc this data is known as big data. The characteristics of Big Data include 3 Vs they are Volume, Velocity and Variety. Useful data can be extracted from this big data with the help of data mining techniques The massive volume of big data sets are too complicated to be managed and processed by conventional relational databases the term “Big Data” was coined to address this massive volume of data storage and processing. The quality of captured data can vary greatly, affecting accurate analysis. Protecting privacy is mechanism for data processing and producing right information to favor corporate sectors, business managers, stake holders and other users make highly informed business decisions. In this paper we are proposing a big data on privacy preserving Big Data. Key Words: BI, Velocity, Volume, Variety.

1. INTRODUCTION A combination of policy decisions, technical and legal mechanisms are use to address privacy concerns. A brief description of some of the major principles for protecting the privacy of data in its lifecycle is given below Data collection and limitations: This principle limits the unnecessary excessive collection of personal data. Once the purpose for which data is collected is known, collected data should be just sufficient enough for that purpose. This principle is clearly a policy decision on the part of collector. Usage limitations: While collecting sensitive or personal data, collector needs to specify what for and how the data is used and limit the usage of collected data for other purposes than the original one. Security of data It is an obligation of data collector to keep the data safe once collected. Adequate security mechanism should be in place to protect it from breaches. Transfer Policy Often the usage of data is governed by laws which are prevalent at the place of collection and usage. If the data is moved outside the jurisdiction where the law enforcement is not prevalent in the new place it carries the danger of misuse. Accountability when dealing with third party data, the party may ask to designate a person who is the point of contact and take processing and usage of data. Collection limitation is the policy decision on part of data collector usage limitation, securing data and transfer policy can be addressed by technical means and the last one, accountability is addresses by having a legal team sign a declaration. © 2017, IRJET

|

Impact Factor value: 5.181

|

2. BIG DATA There are various Challenges that companies are facing in identifying and processing value from the “Big Data” .It is a big data challenge to preserve the privacy of end users at various stages of data life cycle. Today we live in the computerized world with drastic digitization the volume of structured and unstructured data being generated and stored in exploding. The data which is proposed is being taken from various origins In addition to business and companies, individuals spend to the data volume. For particular 40billion content are being distributed on face book every month .Big data is massive volume of both structured and unstructured data from various origins such as social data, machine generated data, traditional enterprise which is so large that it is complex to process with traditional database and software techniques. Big Data is data who’s metric, diversity, and complexity require new framework, techniques, algorithms, and analytics to manage it and fetch value and hidden knowledge from it. Characteristics of Big Data include 3 Vs. They are Volume, Velocity, Variety and Veracity.

3. BIG DATA CHALLENGES Here we have several challenges that companies are facing in identifying, processing and fetching value from the “Big Data”. As in the following some of the key challenges include 1.

Preserving Privacy

2.

Integrating the Big Data Technologies

3.

Proposing real time needs with higher data volumes and varieties

4.

Data maintenance and management

5.

Data distribution and Analytical systems.

4. PRIVACY PRESERVING POLICIES 4.1 Anonymization Techniques Substitute delicate attribute values with some other values. This decreases disclosure of private data. In some situations, this clear substitution alone wills not sophisticated anonymization techniques need to be worked. ISO 9001:2008 Certified Journal

|

Page 234


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.