Data analysis is one amongst the hard tasks within the coinciding world. the dimensions of knowledge is increasing at
a awfully high rate owing to the sexual practice of peripatetic gadgets and sensors connected. to create that information legible is
another difficult task. The dataset contains sure columns like State, District, year, Rape, dowry Deaths, snatch and Abduction,
Assault on girls, Insult to modesty, Cruelty by husband and importation of ladies. This analysis is extremely vital to grasp within
which state or within which District most variety of cases happened and within which Year what proportion specific offense
happened and plenty of a lot of. to try to to this analysis we tend to used Hadoop, Pig, Hive, etc. Hadoop is one amongst the most
effective and economical tools to figure on an outsized dataset, we are able to conjointly work on live information sets like twitter
data by mistreatment Hadoop.