International Research Journal of Engineering and Technology (IRJET)
e-ISSN: 2395 -0056
Volume: 04 Issue: 03 | Mar -2017
p-ISSN: 2395-0072
www.irjet.net
Document Recovery from Degraded Images 1
Jyothis T S, 2Sreelakshmi G, 3Poornima John, 4Simpson Joseph Stanley, 5Snithin P R, 6Tara Elizabeth Paul 1AP,
CSE Department, Jyothi Engineering College, Kerala, India
2 3 4 5 6 Students,
CSE Department, Jyothi Engineering College, Kerala, India ---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Recovery of document from its damaged
paper works. In such cases there is an essentiality for a system that can help read all these degraded documents.
fragments plays an important role in the field of forensics and archival study. Also, now-a-days, there are many activities which depend upon the internet.. Many a times it happens that institutes and organizations have to maintain the books for a longer time span. Books being a physical object, so it will definitely have the issues of wear and tear. The pages definitely get degraded and so does the text on the pages. Due to this degradation many of the document images are not in readable. So, there is a need to separate out text from those degraded images and preserve them for future reference. This paper introduces a method for accomplishing the task of recovering the contents from the degraded papers. The image is converted to contrast image, whose difference in luminance makes an object clear. The edges are detected which is then binarized. The segmentation of document text is carried out by a local Threshold which is estimated based on the intensities of detected edge strokes. Experiments are carried out on several challenging bad quality document images which show the best performance of the proposed system within a shorter period of time.
(a)
Key Words: Image contrast, Binarization, Edge Detection, Pixel classification.
1. INTRODUCTION Recovery of degraded documents has always been a challenge to people. There are many situations where paper documents become a crucial part. Recovering the paper documents plays an important role in forensics and archival studies. Such situation needs an efficient solution to get the exact contents of the paper documents. Now-a-days everything being digitized it is really hard to convert old paper works to computerized one’s. It happens many a times that many organizations and instituted store their record works in paper books and with time it would have been severely spoiled. There also Exists situations where people try it hard to read the contents being written on the old Š 2017, IRJET
|
Impact Factor value: 5.181
|
(b) Fig.1 Degraded document image example. An optimal solution for eliminating these problems is to use binarization technique which converts grayscale document images to binary document image. The image is initially converted to contrast image which helps
ISO 9001:2008 Certified Journal
|
Page 2337