Web-Books
in the Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Page - 22 -
  • User
  • Version
    • full version
    • text only version
  • Language
    • Deutsch - German
    • English

Page - 22 - in Document Image Processing

Image of the Page - 22 -

Image of the Page - 22 - in Document Image Processing

Text of the Page - 22 -

J. Imaging 2018,4, 27 ThenormalizationconstantKisgivenas: K= 1 ∑x+di=x−d∑ x+d j=y−dGs(i;x, j;y)Gr[I(i, j), I(x,y)] (6) Equations (4)and(5) showthat thebilateralfilterhas threeparameters: σ2s (thefilterdomain), σ2r (thefilter range),andthe thirdparameter is thewindowsizeN×N[15]. Thegeometric spreadof thebilateral filter is controlledbyσ2s . If thevalueofσ2s is increased, more neighbours are combined in the diffusion process yielding a “smoother” image, while σ2r represents thephotometric spreading. Onlypixelswithapercentagedifferenceof less thanσ2r are processed[13]. 2.4.OtsuFiltering Afterpassing throughthebilateralfilter, the image is split into itsoriginal (non-gammacorrected) Red,GreenandBluecomponents,asshownintheblockdiagraminFigure2.Thekernelof thebilateral filter alters the balance of the colors in the original image in such away towiden thedifferences betweenthecolorof thefrontandback-to-front interference.AmodifiedversionofOtsu[16]algorithm is applied to each RGB channel using the thresholds determined by theDecisionMaking Block, whichmaybeconsideredas the“optimal” threshold for eachRGBchannel, and then threebinary imagesaregenerated. 2.5. ImageClassification Theimageclassificationblockwasalsotrainedwiththesynthetic images insuchawaytoanalyze the threebinary imagesgenerated ineachof thechannelsandoutputs theonethat isconsideredthe bestone. ThisdecisionwasalsomadebyanaïveBayesautomaticclassifierwhichwastrainedusing thecalculatedco-occurrencematrix foreachof the32,000synthetic imagesbycomparingeachof them with theoriginalgroundtruth image, theFront image. 3. ExperimentsandResults As already explained, the enormous variety of kinds of text documents makes extremely improbable that one single algorithm is able to satisfactorily binarize all kinds of documents. Depending on the nature (or degree of complexity) of the image several or no algorithm will be able to provide good results. This paper follows the assessment methodology proposed in reference [9], inwhichone compares thenumbers of backgroundand foregroundpixels correctly matchedwith aground-truth image. Twenty-three binarization algorithmswere testedusing the methodologydescribed: 1. Mello-Lins [5] 2. DaSilva-Lins-Rocha[6] 3. Otsu[16] 4. Johannsen-Bille [17] 5. Kapur-Sahoo-Wong[18] 6. RenyEntropy(variationof [18]) 7. Li-Tam[19] 8. Mean[20] 9. MinError [21] 10. Mixture-Modeling[22] 11. Moments [23] 12. IsoData [24] 13. Percentile [25] 22
back to the  book Document Image Processing"
Document Image Processing
Title
Document Image Processing
Authors
Ergina Kavallieratou
Laurence Likforman-Sulem
Editor
MDPI
Location
Basel
Date
2018
Language
German
License
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Size
17.0 x 24.4 cm
Pages
216
Keywords
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Category
Informatik
Web-Books
Library
Privacy
Imprint
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing