Page - 7 - in Document Image Processing
Image of the Page - 7 -
Text of the Page - 7 -
J. Imaging 2018,4, 68
for i=1,...,N,whereâ.â0 is the 0-norm,whichcounts thenon-zeroelements inx, andthedictionary
updatestage for theXobtainedfromthesparsecodingstage
D=argmin
D âYâDXâ2F . (3)
Dictionary learningalgorithmsareoftensensitive to thechoiceofm. Theupdatestepcaneither
be sequential (oneatomata time) [51,52], orparallel (all atomsatonce) [53,54]. Adictionarywith
sequentialupdate,althoughcomputationallyabitexpensive,willgenerallyprovidebetterperformance
than theparallel update, due to theï¬ner tuningof eachdictionaryatom. In sequential dictionary
learning, thedictionaryupdateminimizationproblem(3) is split intoK sequentialminimizations,
byoptimizingthecost function(3) foreach individualatomwhilekeepingï¬xedtheremainingones.
Most of the proposed algorithmshave kept the two stage optimizationprocedure, the difference
appearingmainly in thedictionaryupdate stage,with someexceptionshavingadifference in the
sparsecodingstageaswell [43]. In themethodproposedin[51],whichhasbecomeabenchmark in
dictionary learning,eachcolumndkofDanditscorrespondingrowofcoefï¬cientsxrowk areupdated
basedonarank-1matrixapproximationof theerror forall thesignalswhendk is removed
{dk,xrowk } = arg mindk,xrowk âYâDXâ2F
= arg min
dk,xrowk âEkâdkxrowk â2F, (4)
whereEk =YââKi=1,i =kdixrowi . Thesingularvaluedecomposition (SVD)ofEk =UÎV isused to
ï¬ndtheclosest rank-1matrixapproximationofEk. Thedkupdate is takenas theï¬rst columnofU,
andthexrowk update is takenas theï¬rst columnofVmultipliedbytheï¬rstelementofÎ. Toavoid the
lossof sparsity inxrowk thatwouldbecreatedbythedirectapplicationof theSVDonEk, in [51], itwas
proposedtomodifyonly thenon-zeroentriesofxrowk resultingfromthesparsecodingstage. This is
achievedbytaking intoaccountonly thesignalsyi thatuse theatomdk inEquation(4),or,by taking,
insteadof theSVDofEk, theSVDofERk =EkIwk,wherewk= {i|1†iâ€N;xrowk (i) = 0}, and Iwk is
theNÃ|wk| submatrixof theNÃN identitymatrixobtainedbyretainingonly thosecolumnswhose
indexnumbersare inwk.
3.Bleed-ThroughIdentiï¬cation
Thealgorithmusedtorecognise thepixels thatbelongto thebleed-throughpatternmakesuseof
bothsidesof thedocument, i.e., therectoandtheverso images,andsuitablycompares their intensities
inapixel-by-pixelmodality.Hence, it is essential that twocorresponding,oppositepixelsexactly refer
to the samepieceof information. Inotherwords, at location (i, j), to thepixel in a side, let us say
ableed-throughpixel,mustcorrespond, in theoppositeside, the foregroundpixel thathasgenerated
it, and vice versa. In order to ensure thismatching, one of the two images needs to be reï¬ected
horizontally,andthenthe twoimagesmustbeperfectlyaligned[55].
Thewayinwhichweperformthecomparisonbetweenpairsofcorrespondingpixels ismotivated
bysomeconsiderationsabout thephysicalphenomenon. Indeed, throughexperience,weobserved
that, in themajority of themanuscripts examined, due topaperporosity, the seeped inkhas also
diffusedthroughthepaperï¬ber.Hence, ingeneral, thebleed-throughpattern isasmearedandlighter
version of the opposite text that has generated it. Note that this assumptiondoes notmean that,
on thesameside,bleed-throughis lighter thanthe foregroundtext. In fact,oneachside, the intensity
ofbleed-through isusuallyveryvariable,which ishighlynon-stationary, andsometimescanbeas
darkas the foregroundtext.
Otherconsiderationscanbemadebyreasoningintermsofâquantityof inkâ.Indeed, it isapparent
that thequantityof ink shouldbezero in thebackground, i.e., theunwrittenpaper, nomatter the
colorof thepaper,maximuminthedarkerandsharper foregroundtext,andminimuminthe lighter
7
back to the
book Document Image Processing"
Document Image Processing
- Title
- Document Image Processing
- Authors
- Ergina Kavallieratou
- Laurence Likforman-Sulem
- Editor
- MDPI
- Location
- Basel
- Date
- 2018
- Language
- German
- License
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03897-106-1
- Size
- 17.0 x 24.4 cm
- Pages
- 216
- Keywords
- document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
- Category
- Informatik