Page - 50 - in Document Image Processing
Image of the Page - 50 -
Text of the Page - 50 -
J. Imaging 2018,4, 57
in thepresentwork,wehaveamalgamatedtheconceptof âuniformpatternsâwithRLBPtogenerate
RULBP.Theformaldeï¬nitionofRULBPisgivenbelow:
RULBP(M,R)(xcen,ycen)= {
âMn=1 f(Inâ Icenâ th), ifU(RILBP(M,R)(xcen,ycen))â¥2,
M+1, otherwise. (10)
ThevalueofU(RLBP(M,R)(xcen,ycen)) is computedusingEquation(8).
2.5.2. SelectingtheValueof th
FromEquation(9), it canbe inferredthat the threshold(th) inRLBPplaysan important roleand
whosevaluemightbeapplicationspeciï¬c tosomeextent. Thus, in thiswork,wehaveattemptedto
rationalize it in thecontextof text/non-text separation inhandwrittendocuments.
Mosthandwrittendocumentsgenerallypossessa large intensityvariationat thestroke leveldue
tothevariednatureofwritinginstrumentsandnon-uniformityintheamountofpressureappliedwhile
writing. Thisnon-homogeneityoverasinglestrokecanonlybe identiï¬ed ifwemagnify the image
(see thedarkandbrightpatcheswithin thestroke inFigure5. Forexample,LBPfor the3Ã3segment,
markedinred, inFigure5 is â00010001â.However, thevisualperceptionofahumanbeingconsiders
thisasahomogeneousregionwithall zeros â00000000â. Thispropertyofhandwrittendocumentsmay
generateerroneousLBPfeaturevalues,which, in turn, fail todistinguish the textcomponents fromthe
non-textones. Inorder tosolvesuchproblems,a threshold âthâhasbeen introducedinLBPtogenerate
RLBP.This thresholdensures that twograyvalues thatarenotperceptiblydifferentarenot labeled
differently. Theproblemwithselectingavalueof th is that, if thevalue isextremely large, thenthe
entire regionwillbehave likeahomogeneousregionwithnointensityvariation. This isbecause the
binarypattern ,according toEquation (10),will be all zeros for everypixel. Therefore,weneed to
provideanupper limit, thmax , on thevalueof th.
Figure5.Magniï¬edimageofastrokeshowsthevariation ingrayvalues.A3Ã3matrixshowsthe
intensityvaluesof thegray imagesegmentmarkedinred.
Toaddress this issue,wehavesetanupper limit, thmax , onthevalueof th. Generally, inareal-life
handwrittendocument image, the intensityof thebackgroundpixels residewithinacloseproximity
of themaximumintensity255.Here,weassumethat the intensityof thebackgroundpixelswillbe ina
rangeof [245,255].Now,foreachimage,weï¬ndthehighestgray-scale intensity(Igraymax) less than245.
Weclaimthat thepixelPhavingthis intensityvaluehas tobeapartof somewritingstroke. thmaxhas
tobesuchthat, ifweconsider Icenhasavalue Igraymax andaneighboringpixelhasavalue245, f(x)
as given inEquation (2) for x= Inâ Icenâ thgives avalue 1. Therefore, thmax = 245â Igraymax.
Thevalueof thcanbeanythingbetween thmax and0.Wehaveperformedaweightedaverageof the
thresholdvalues in therange,with theweights increasingforhighervaluesof thandfoundthe ideal
50
back to the
book Document Image Processing"
Document Image Processing
- Title
- Document Image Processing
- Authors
- Ergina Kavallieratou
- Laurence Likforman-Sulem
- Editor
- MDPI
- Location
- Basel
- Date
- 2018
- Language
- German
- License
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03897-106-1
- Size
- 17.0 x 24.4 cm
- Pages
- 216
- Keywords
- document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
- Category
- Informatik