Web-Books
in the Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Page - 21 -
  • User
  • Version
    • full version
    • text only version
  • Language
    • Deutsch - German
    • English

Page - 21 - in Document Image Processing

Image of the Page - 21 -

Image of the Page - 21 - in Document Image Processing

Text of the Page - 21 -

J. Imaging 2018,4, 27 Theverso image is“blurred”bypassingthroughtwodifferentGaussianfilters that simulate the low-passeffectof the translucidityof theversoasseen in the frontpartof thepaper. Twodifferent parameterswereused tosimulate twodifferent classesofpaper translucidity. The“blurred”verso image isnowfadedwith a coefficientαvaryingbetween0and1 in stepsof 0.01. Then, a circular shiftof the linesof thedocument ismadeofeither5or10pixels, tominimize thechancesof the front andverso lines coincideentirely. Finally, the two imagesareoverlappedbyperforminga“darker” operationpixel-by-pixel in the images. Paper texture is added to the image to simulate the effect ofdocumentaging. Thetexturepatternwasextractedfromdocument fromlate19thcenturyto the year 2000. Theanalysis of 3450documents representativeof awidevarietyofdocuments of such aperiodwasanalyzedyielding100differentclustersof textures. Thesynthetic texture tobeapplied to the image tosimulatepaperaging isgeneratedusing those100clustersby imagequilting [11]and randomly,asexplained inreference [9]. The trainingperformedin thecurrentversionof thepresented algorithmwasmadewith16of those200 synthetic textures. The total numberof imagesused for trainingherewas thus 16 (textures), times 10 (0<α<1 in steps of 0.10), times 2blurparameters for theGaussianfilters, times100differentbinary images, totaling32,000 images.Detailsof the full generationprocessof thesynthetic imagedatabaseareoutof thescopeof thispaperandmaybefound inreference [9]. 2.3. TheBilateralFilter Thebilateral filterwasfirst introducedbyAurichandWeule [12]under thename“nonlinear Gaussianfilter”. Itwas later rediscoveredbyTomasiandManduchi [13]whocalled it the“bilateral filter”which isnowthemostcommonlyusednameaccordingtoreference [14]. Thebilateral filter is a technique to smoothen imageswhilepreserving their edges. Thefilter outputat eachpixel is aweightedaverageof itsneighbors. Theweight assigned toeachneighbor decreaseswithboththedistancevaluesamongpixelsof the imageplane(thespatialdomainS)and thedistanceonthe intensityaxis (therangedomainR).Thefilterappliesspatialweightedaveraging withoutsmoothingtheedges. It combines twoGaussianfilters;onefilterworks in thespatialdomain, while theotherfilterworks in the intensitydomain. Therefore,notonly thespatialdistancebutalso the intensitydistance is important for thedeterminationofweights. Thebilateralfiltercombines two stagesoffiltering. Theseare thegeometric closeness (i.e.,filterdomain)andthephotometric similarity (i.e., filter range)among thepixels inawindowof sizeN×N.Let I(x,y) bea2Ddiscrete imageof sizeN×N,such that {x,y}∈ {0, 1, ...,N−1}X {0, 1, ...,N−1}. Assume that I(x,y) is corruptedby anadditivewhiteGaussiannoiseofvarianceσ2n. Forapixel (x,y), theoutputofabilateralfiltercanbe asdescribedbyEquation(1): IBF(x,y)= 1 K∑ x+d i=x−d∑ x+d j=y−dGs(i;x, j;y)Gr[I(i, j), I(x,y)]I(i, j), (3) where I(x,y) is the pixel intensity in the image before applying the bilateral filter, IBF(x,y) is the resultingpixel intensityafter applying thebilateralfilter andd is anon-negative integer such that (2d+1)× (2d+1)stands for thesizeof theneighborhoodwindow.LetGs andGrbethedomainand therangecomponents, respectively,whicharedefinedas: Gs(i;x, j;y)= e −|(i−x)2+(j−y)2| 2σ2s (4) and Gr(I(i, j); I(x,y))= e −|I(i,j)−I(x,y)|2 2σ2r (5) 21
back to the  book Document Image Processing"
Document Image Processing
Title
Document Image Processing
Authors
Ergina Kavallieratou
Laurence Likforman-Sulem
Editor
MDPI
Location
Basel
Date
2018
Language
German
License
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Size
17.0 x 24.4 cm
Pages
216
Keywords
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Category
Informatik
Web-Books
Library
Privacy
Imprint
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing