Reduction of Noises from Degraded Document Images

Abstract: Evolution of digital devices and computers makes an increasing attraction in document image analysis. Many of the paper documents have been transferred and stored using digital devices in large manner. In this work we have done image enhancement techniques to reduce the noises from degraded document images. Here we have taken sample images from Document Image Binarization Contest (DIBCO) dataset images. We have done contrast stretching, histogram equalization, noise filtering, laplacian transformation, global and local thresholding methods to remove show-through noise, un even illumination noise, shot noise from degraded document images using OpenCV open source software and performance metrics had taken for the above methods.

Don't use plagiarized sources. Get Your Custom Essay on
Reduction of Noises from Degraded Document Images
Just from $13/Page
Order Essay

Keywords- Document Image Enhancement, Contrast stretching, Histogram equalization, Laplacian transformation, Thresholding.

I.          Introduction

The study of image processing is an interesting topic and has various applications in different fields. One of the applications is document image analysis. Now the world is very fastly changed to digitization. Here the information is also digitized. Documents are a universal communicating way in our day to day life. Image processing techniques are very largely useful part in document image analysis (DIA) to transmit, process, store, analyse, enhancing and recognizing the document images [1]. At different stages, these image processing methods are used to de-noise the document images from noise affected images. Degraded images are affected by uneven illumination noise, show-through noise, salt and pepper noise etc. To reduce this noise contrast stretching, histogram equalization, noise filtering, laplacian transformation, thresholding methods of image processing techniques are discussed here.

II.         Related works

Document Image Enhancement is the process which is essential to have a uniform background and a good quality of printed/handwritten document images. This process aims to enhance the readable format of text in document and it allows reducing the noise. The major artifacts takes part in degraded document images are low contrast and uneven background illumination, show-through and shadow-through effects, damaged characters and noisy black borders [2]. To rectify this type of noises from documents; some image processing techniques are done using OpenCV software.

Leung et al [5], proposed a contrast enhancement method to increase the readability of text and histogram equalization method to reduce background noise from degraded document images. Deivalakshmi et al [6], proposed a median filter method to reduce the salt and pepper noise from images. Otsu [7] proposed a global threshold method by using 0th and 1st order cumulative movements of gray levels histogram. A modified Otsu’s method is approached by Cheriet et al [8] which is based on recursive application. Feng et al [9] approached a local thresholding concept to overcome the difficulties encountered in low contrast images and non-uniform illumination and random noise. Here the threshold determined by computing the local mean, minimum and variance of local window. Firdousi et al [10] provide a various type of local thresholding methods are used in document image binarization. And they explained popular methods of Sauvola’s technique, Niblack’s technique and Bernsen’s technique etc.

III.        Proposed Method

A.    Contrast Stretching

Contrast stretching is used here to reduce the show-through noise from document images. Show-through noise affected image is shown in the Fig1 which is taken from DIBCO dataset. Back side of the document information is interfered with front side of the page is known as show-through effect. Contrast stretching method is used to enhance the image by increasing or decreasing the intensity values. Usually background pixels are lower intensity values than the foreground pixels.

To adjust the brightness of an image, by increasing or decreasing the output pixel value. By adding a constant value with the input pixel value it gives high contrast image and subtracting a constant value with the input pixel it gives low contrast image. Equation for adjusting the brightness and contrast adjustment is mentioned below.

                

Where a and b are arbitrary constants which control the brightness and contrast [3].

B.    Noise Filtering

Noise filtering method is used to filter out the unwanted information from an image. In this project, we implemented median filter for the document image enhancement application.

The median filter method is very well suited in removing “salt and pepper noise” or “shot noise”. This noise represents as aimlessly occurring white and black pixels here and there of an image. Here it is used to smoothing the degraded document image. It is also called rank filter and works based on re-ordering mechanism.

Consider a 3×3 kernel in the matrix image. The Median value is calculated by arranging the neighborhood pixel values in numerical order, and select the middle pixel from the sorted list, then replace the pixel value into an output image. The noisy input image and smoothened median filtered output image is shown in the Fig.4. Compared with mean filter and median filter is good at preserving the edge pixels and higher kernel value will produce good smoothing.

C.    Histogram Equalization

In image enhancement technique, histogram plays an important role. It represents the characteristics plot of an image. If the histogram alters, image characteristics can also change. To remove the uneven illumination noise in document images histogram equalization is used here.

Histogram equalization alters the contrast of an image and non-linear stretch out this intensity range. Equalization maps narrow distribution of intensity values to wider distribution of intensity values. Noisy input image and equalized output image of histogram is shown in the Fig7 and Fig8.

D.   Laplacian Transformation

The Laplacian of an image is denoted by f  and is given by the below equation2. Laplacian operator is 2nd order derivative operator. To find out the edges in document images here we used Laplacian transformation operation. Degraded original document image is converted into gray scale image. Then the noise is removed using Gaussian smoothing filter. The output of an image is convoluted with the Laplacian mask and is shown in the Fig9. It is also named as zero crossing detector.

                            

E.    Thresholding

One of the main techniques used in image segmentation is thresholding. It is a process of converting grey-level (which contains 256 intensity values) to a binary level image (which contains 2 intensity values black and white) by selecting a single threshold value.

1)       Global thresholding: This method creates binary images from grey level ones by turning all pixels to below the threshold value to zero and above the threshold value to one. Global thresholding method is a very fastest technique compared to other methods but it is not suitable for all type of document images in particular which document images contains picture object and text. Global method is expressed by the equation is below mentioned.

Where f(x, y) is the input image, t is the threshold value and T(x,y) is the threshold image.    

2)       Local Thresholding: It is also called as adaptive thresholding which selects an particular threshold value automatically for an image by each pixel based on the range of intensity values in its neighbourhood pixels. It examines the relationships between brightness of neighbouring pixels to adapt the thresholding according to the intensity statistics.

IV.        Results and Discussion

We have presented document noise reduction methods from degraded document images using image enhancement techniques like contrast enhancement, histogram equalization, median filtering, Laplace transformation and thresholding. In contrast enhancement, high contrast image gives the better results compared to low contrast image. For smoothing the degraded document image, here we used median filter. It can also use to reduce the salt and pepper noise and shot noise from document images. Histogram equalization method is used to reduce the non-uniform illumination. In Global thresholding, we manually selected the threshold value as 150, but in local thresholding it automatically selects the threshold value according to the input pixels.

Figure1. Original Show-through noised image (DIBCO-2014)

Figure2. High Contrast Image

Figure3. Low Contrast Image

     

Figure4. Median Filter Input and Output Image

Figure5. Noisy Input Image (DIBCO_2014)

Figure6. Histogram Equalized Output Image

Figure7. Histogram plot of Input Image

Figure8. Histogram plot of Equalized Output Image

Figure9. Laplacian Transformation Output Image

Figure10. Global Thresholding Output Image

Figure11. Local Thresholding Output Image

Based on the above proposed image enhancement techniques, the performance metrics measure such as Mean Square Error (MSE), Peak signal to Noise Ratio (PSNR), Normalized Absolute Error (NAE) and Normalized Cross Correlation (NCC) has been carried out in the Table 1. It explains the better results obtained for de-noised document images.

TABLE I.              Performance metrics measures for de-noised document images using Opencv

Proposed Methods

Performance metrics

Normalized Cross Correlation(NCC)

Mean Squared Error(MSE)

Peak Signal to Noise Ratio(PSNR)

Normalized Absolute Error(NAE)

High Contrast

1

0

99

0

Global threshold

1

10.2965

38.0039

0.0165

Adaptive Threshold

1

11.6172

37.4798

0.0198

Median Filter

1

22.3331

34.6413

0.0192

Histogram Equalized

1.0372

191.0293

25.3198

0.4872

Low Contrast

1.0043

254.9990

24.0654

0.5370

Laplacian transform

5.0212

250.8787

24.1362

37.2466

Figure12. Histogram Plot of performance metrics measures for proposed methods.

V.    Conclusion

Conclusion of this work reveals that, using image enhancement techniques, we reduced the show-through noise, uneven illumination noise from document images and achieved high PSNR values for high contrast enhancement, global and local thresholding methods and median filtering methods.

 

References

[1]      R.C. Gonzalez, R.E. Woods, “Digital Image Processing”, 3rd ed., Upper Saddle River, N.J.: Prentice Hall, 2008.

[2]      D. Doermann and K. Tombre, Handbook of Document Image Processing and Recognition. London: Springer London, 2014.

[3]      D. G. Bailey, Design for embedded image processing on FPGAs. Singapore: John Wiley & Sons (Asia), 2011.

[4]      R. Szeliski, Computer vision algorithms and applications. London: Springer, 2011.

[5]      C.-C. Leung, K.-S. Chan, H.-M. Chan, and W.-K. Tsui, “A new approach for image enhancement applied to low-contrast–low-illumination IC and document images,” Pattern Recognition Letters, vol. 26, no. 6, pp. 769–778, 2005.

[6]      S. Deivalakshmi, S. Sarath, and P. Palanisamy, “Detection and removal of Salt and Pepper noise in images by improved median filter,” 2011 IEEE Recent Advances in Intelligent Computational Systems, pp. 363–368, 2011.

[7]      N. Otsu, “A Threshold Selection Method from Gray-Level Histograms,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62–66, 1979.

[8]      M. Cheriet, J. Said, and C. Suen, “A recursive thresholding technique for image segmentation,” IEEE Transactions on Image Processing, vol. 7, no. 6, pp. 918–921, 1998.

[9]      M.-L. Feng and Y.-P. Tan, “Contrast adaptive binarization of low quality document images,” IEICE Electronics Express, vol. 1, no. 16, pp. 501–506, 2004.

[10]   R. Firdousi, S. Parveen, “Local Thresholding Techniques in Image Binarization,” Internation Journal of Engineering and Computer Science, vol. 3, no. 3, pp. 4062-4065, 2014.

What Will You Get?

We provide professional writing services to help you score straight A’s by submitting custom written assignments that mirror your guidelines.

Premium Quality

Get result-oriented writing and never worry about grades anymore. We follow the highest quality standards to make sure that you get perfect assignments.

Experienced Writers

Our writers have experience in dealing with papers of every educational level. You can surely rely on the expertise of our qualified professionals.

On-Time Delivery

Your deadline is our threshold for success and we take it very seriously. We make sure you receive your papers before your predefined time.

24/7 Customer Support

Someone from our customer support team is always here to respond to your questions. So, hit us up if you have got any ambiguity or concern.

Complete Confidentiality

Sit back and relax while we help you out with writing your papers. We have an ultimate policy for keeping your personal and order-related details a secret.

Authentic Sources

We assure you that your document will be thoroughly checked for plagiarism and grammatical errors as we use highly authentic and licit sources.

Moneyback Guarantee

Still reluctant about placing an order? Our 100% Moneyback Guarantee backs you up on rare occasions where you aren’t satisfied with the writing.

Order Tracking

You don’t have to wait for an update for hours; you can track the progress of your order any time you want. We share the status after each step.

image

Areas of Expertise

Although you can leverage our expertise for any writing task, we have a knack for creating flawless papers for the following document types.

Areas of Expertise

Although you can leverage our expertise for any writing task, we have a knack for creating flawless papers for the following document types.

image

Trusted Partner of 9650+ Students for Writing

From brainstorming your paper's outline to perfecting its grammar, we perform every step carefully to make your paper worthy of A grade.

Preferred Writer

Hire your preferred writer anytime. Simply specify if you want your preferred expert to write your paper and we’ll make that happen.

Grammar Check Report

Get an elaborate and authentic grammar check report with your work to have the grammar goodness sealed in your document.

One Page Summary

You can purchase this feature if you want our writers to sum up your paper in the form of a concise and well-articulated summary.

Plagiarism Report

You don’t have to worry about plagiarism anymore. Get a plagiarism report to certify the uniqueness of your work.

Free Features $66FREE

  • Most Qualified Writer $10FREE
  • Plagiarism Scan Report $10FREE
  • Unlimited Revisions $08FREE
  • Paper Formatting $05FREE
  • Cover Page $05FREE
  • Referencing & Bibliography $10FREE
  • Dedicated User Area $08FREE
  • 24/7 Order Tracking $05FREE
  • Periodic Email Alerts $05FREE
image

Our Services

Join us for the best experience while seeking writing assistance in your college life. A good grade is all you need to boost up your academic excellence and we are all about it.

  • On-time Delivery
  • 24/7 Order Tracking
  • Access to Authentic Sources
Academic Writing

We create perfect papers according to the guidelines.

Professional Editing

We seamlessly edit out errors from your papers.

Thorough Proofreading

We thoroughly read your final draft to identify errors.

image

Delegate Your Challenging Writing Tasks to Experienced Professionals

Work with ultimate peace of mind because we ensure that your academic work is our responsibility and your grades are a top concern for us!

Check Out Our Sample Work

Dedication. Quality. Commitment. Punctuality

Categories
All samples
Essay (any type)
Essay (any type)
The Value of a Nursing Degree
Undergrad. (yrs 3-4)
Nursing
2
View this sample

It May Not Be Much, but It’s Honest Work!

Here is what we have achieved so far. These numbers are evidence that we go the extra mile to make your college journey successful.

0+

Happy Clients

0+

Words Written This Week

0+

Ongoing Orders

0%

Customer Satisfaction Rate
image

Process as Fine as Brewed Coffee

We have the most intuitive and minimalistic process so that you can easily place an order. Just follow a few steps to unlock success.

See How We Helped 9000+ Students Achieve Success

image

We Analyze Your Problem and Offer Customized Writing

We understand your guidelines first before delivering any writing service. You can discuss your writing needs and we will have them evaluated by our dedicated team.

  • Clear elicitation of your requirements.
  • Customized writing as per your needs.

We Mirror Your Guidelines to Deliver Quality Services

We write your papers in a standardized way. We complete your work in such a way that it turns out to be a perfect description of your guidelines.

  • Proactive analysis of your writing.
  • Active communication to understand requirements.
image
image

We Handle Your Writing Tasks to Ensure Excellent Grades

We promise you excellent grades and academic excellence that you always longed for. Our writers stay in touch with you via email.

  • Thorough research and analysis for every order.
  • Deliverance of reliable writing service to improve your grades.
Place an Order Start Chat Now
image

Order your essay today and save 30% with the discount code Happy