Page - 201 - in Document Image Processing
Image of the Page - 201 -
Text of the Page - 201 -
J. Imaging 2018,4, 32
Note that the sizeof the inputblock is set to 1Ã4 forProtocols 6.1 and3 (not 2Ã4), respectively.
Toï¬ne-tune theseparameterswe justpickoutasetof2000 labeled images fromAcTiV-R, inwhich
190areusedasavalidationset.
Table6.Bestparameters for trainingthenetwork.
Parameters Values
MDLSTMSize 2,10and50
Feed-forwardSize 6and20
InputBlockSize 2Ã4
HiddenBlockSizes 1Ã4and1Ã4
Learnrate 10â4
Momentum 0.9
5.3. ExperimentalResults
Several experiments have been conducted using the AcTiV-D and AcTiV-R subsets. These
experimentscanbedivided into twocategories: Theï¬rstoneconcerns thecomparisonofoursystems
with tworecentmethods. Thesecondcategoryaimsatanalyzingtheeffectof increasingthe training
dataontheaccuracyof theLADItextdetector.
5.3.1.ComparisonwithOtherMethods
As proof of concept of the proposed benchmark, we compare our systemswith two recent
methods. Theï¬rstonewasproposedbyGaddouretal. [52] tobasicallydetectArabic texts innatural
scene images. Themainsteps involvedare:
⢠Pixel-colorclusteringusingk-means to formpairsof thresholds foreachRGBchannel.
⢠Creationofbinarymapforeachpairof thresholds.
⢠ExtractionofCCs.
⢠Preliminaryï¬lteringaccordingtoâareastabilityâcriterion.
⢠Secondï¬lteringbasedonasetof statisticalandgeometric rules.
⢠Horizontalmergingof theremainingcomponents to formtextlines.
Thesecondmethodwasput forwardbyIwataetal. [53] torecognizeartiï¬cialArabic text invideo
frames. Itoperatesas follows:
⢠Textlinesegmentation intowordsbythresholdinggapsbetweenCCs.
⢠Over-segmentationofcharacters intoprimitivesegments.
⢠Character recognition using 64-dimensional feature vector of chain code histogram and the
modiï¬edquadraticdiscriminant function.
⢠Word recognition by dynamic programming using total likelihood of characters as
objective function.
⢠Falseword reduction bymeasuring the average of the character likelihoods in aword and
comparing it toapredeï¬nedthreshold.
Thedetectionsystemshavebeentrainedonthe training-set1ofTable4. Theevaluationhasbeen
doneonthe test set for thedetectionandrecognitiontasks. Table7presentsevaluationresultsof the
detectionprotocols in termsofprecision, recallandF-measure. Thebest resultsaremarkedinbold.
TheLADI systemscoresbest for all protocolswith anF-measurebetween0.73 and0.85 forAllSD
protocol (p4.4) andAljazeeraHDprotocol (p1) respectively. In contrast to theSysAthat represents
a fully heuristic-basedmethod, the LADI system increased the F-measure by 11% for Protocol 1.
ForProtocols 4.1, 4.2, 4.3 and4.4 (SDchannels), the results arehigher,withagainof, respectively,
11%,17%,14%and24%.Thisreï¬ects theeffectivenessofusingamachine-learningsolutiontoï¬lter the
resultsgivenbytheSWTalgorithm.TheGaddosystemhasstrongfragmentationandmissdetection
tendency asdepictedby its obtainednumerical results. Table 8presents evaluation results of the
201
back to the
book Document Image Processing"
Document Image Processing
- Title
- Document Image Processing
- Authors
- Ergina Kavallieratou
- Laurence Likforman-Sulem
- Editor
- MDPI
- Location
- Basel
- Date
- 2018
- Language
- German
- License
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03897-106-1
- Size
- 17.0 x 24.4 cm
- Pages
- 216
- Keywords
- document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
- Category
- Informatik