Seite - 19 - in Document Image Processing
Bild der Seite - 19 -
Text der Seite - 19 -
J. Imaging 2018,4, 27
Figure2.Blockdiagramof theproposedalgorithm.
2.1. TheDecisionMakingBlock
Thedecisionmakingblock takesas input the image tobebinarizedandoutputsavectorwith
fourparameters: thevalueof thekernel (kernel) for thebilateralfilterandthree thresholdvalues (tR, tG,
tB) thatwillbe laterused in themodifiedOtsufiltering.
The trainingof thebinarizationprocessproposedhere ismadewithsynthetic imageswhichwere
generatedasexplained inSection2.2.Afterfiltering, thematrixofco-occurrenceprobabilitiesbetween
theoriginal imageandof thebinary imagewascalculated for eachof the images in thedocument
trainingset,whosegeneration isexplainedbelow.
Theprobabilistic structure applied in the analysis to eachof the images in the training set is
similar to the transmissionof binarydata in aBinaryAsymmetricChannel, as shown inFigure 3.
The probabilities P(f/b) and P(b/f) represent an additive noise in communication channels in
information theory, here it represents the inability of the algorithm to correct the back-to-front
interference of the image tested in the binarization process. The probabilities P(b/b) and P(f/f)
arecalculatedfromthepixel-to-pixel comparisonof thebinarizedimagegeneratedbytheproposed
algorithmwith theground-truth image.
Figure3.Generationof theco-occurrencematrix foreachof the images in the trainingset.
19
zurück zum
Buch Document Image Processing"
Document Image Processing
- Titel
- Document Image Processing
- Autoren
- Ergina Kavallieratou
- Laurence Likforman-Sulem
- Herausgeber
- MDPI
- Ort
- Basel
- Datum
- 2018
- Sprache
- deutsch
- Lizenz
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03897-106-1
- Abmessungen
- 17.0 x 24.4 cm
- Seiten
- 216
- Schlagwörter
- document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
- Kategorie
- Informatik