International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395 -0056
Volume: 04 Issue: 05 | May -2017
p-ISSN: 2395-0072
www.irjet.net
A Survey on: Challenges to Data Analytic in HealthCare Sector Adarsha S P1, Dr. K. Thippeswamy2 Dept. of Studies in Computer Science and Engineering, VTU Postgraduate Centre, Mysuru, Karnataka, India Dept. of Studies in Computer Science and Engineering, VTU Postgraduate Centre, Mysuru, Karnataka, India ---------------------------------------------------------------------***--------------------------------------------------------------------1
2Professor,
Abstract - In nowadays, Healthcare has become an evolving and emerging concept in research and development area. There is a huge amount of data is generating in a field of Healthcare. Use of Big data and data mining technique made an easy way to store such large amount of data. In the present era, big data has become a very evolving concept. The data that generate in a hospital is sometimes it might be unstructured data, structured data and semistructured data. The data is collected through a mobile device, smart wearable device. There are many hospital which uses a bid data technique to store the data. The data of patient is stored in an Electronic Health Record (HER). The generated data is to be stored in Hadoop Distributed File System (HDFS) via MapReduce. There are several challenges in the Healthcare analytics such as bioinformatics and cancer treatment. We also have to give an importance to the data security.
2. Big data technology In Biomedical researchers are facing new challenges of storing, managing, and analyzing huge amounts of datasets. The characteristics of big data require more powerful technologies to extract the useful information and enable more broad-based health-care solutions. 1.1 Parallel computing In present era, parallel computing models, such as MapReduce [2] by Google, have been proposed for a new big data infrastructure. More recently, an open-source MapReduce package called Hadoop [3] was released by Apache for distributed data management. The Hadoop Distributed File System (HDFS) supports concurrent data access to clustered machines.
Keywords—Big data, Healthcare, Electronic Health Record(HER), Mapreduce, HDFS, Data Security
1.2 Cloud Computing. Cloud computing [4] is based on internet computing that provides shared a computer processing resources and data to computers and other devices on demand. Cloud computing is a novel model for sharing configurable computational resources data over the network and can serve as infrastructure, platform, and/or software for providing an integrated data solution. Furthermore, cloud computing can be used for improve system speed, agility, and flexibility of data because it reduces the need to maintain hardware or software capacities and requires fewer resources for system maintenance, such as installation, configuration, and testing. There many new big data applications are based on cloud computing technologies.
1. INTRODUCTION In huge information space, health care could be a new paradigm and a system that transforms a case studies in analysis space like giant scale and information driven. It had been a wide accepted space as a result of the characteristics of huge information is outlined by Volume, Variety, and Speed. The term Analytics of HealthCare is employed to explain analyzing patient health care activities which may be thought of because the part of results of information collected among healthcare; claims and price information, and analysis and development (R&D) information, clinical information and patient behavior and sentiment information (patient behaviors and preferences [1]. Analytics of Health Care could be a quick growing trade within the World.
2 METHODOLOGY Data Analytics helps healthcare insurance corporations realize other ways to spot and stop fraud at associate degree earlystage. Using Hadooptechnology,
Š 2017, IRJET
|
Impact Factor value: 5.181
|
ISO 9001:2008 Certified Journal
|
Page 1305