Web-Books
in the Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Page - 18 -
  • User
  • Version
    • full version
    • text only version
  • Language
    • Deutsch - German
    • English

Page - 18 - in Document Image Processing

Image of the Page - 18 -

Image of the Page - 18 - in Document Image Processing

Text of the Page - 18 -

J. Imaging 2018,4, 27 capableoffiltering-outsuchanoise increasesenormously,asanewsetofhuesofpaperandprinting colorsappear. Thedirectapplicationofbinarizationalgorithmsmayyieldacompletelyunreadable document, as the interfering inkof the backside of thepaper overlapswith the binary one in the foreground. Severaldocument imagecompressionschemes forcolor imagesarebasedon“adding color” toabinary image. Suchcompressionstrategy isunable tohandledocumentswithback-to-front interference [5]. OpticalCharacterRecognizers (OCRs) are alsounable toworkproperly for such documents. Severalalgorithmsweredevelopedspecifically tobinarizedocumentswithback-to-front interference [3,4,6–9]. There isnobinarizationtechniquetobeanall casewinnerasmanyparameters mayinterfere inthequalityof theresultingimage[9]. Thedevelopmentofnewbinarizationalgorithms is still an important research topic. International competitionsonbinarizationalgorithms, suchas DIBCO-Document ImageBinarizationCompetition [10], areanevidenceof therelevanceof thisarea. Figure1. Imageswithback-to-front interference fromthe three test setsused in thispaper:Nabuco bequest (left),LiveMemory(center) andDIBCO(right). This paper presents a new global filter [1] to binarize documents, which is able to remove the back-to-front noise in awide range of documents. Quantitative and qualitative assessments made inawidevarietyofdocuments fromthreedifferent“real-world”datasets (typed,printedand handwritten, using different kinds of paper, ink, etc.) allow to witness the efficiency of the proposedscheme. 2.TheNewAlgorithm Thealgorithmproposedhere isperformedinfoursteps: 1. decision-makingforfindingthevector ofparametersof the imagetobefiltered,2. filteringthe imageusingabilateralfilter,3. splittingthe image into theRGBcomponents, andperformingtheirbinarizationusingamethodinspiredbyOtsu’s algorithm for eachRGB channel, and 4. choice ofwhich of theRGB components best preserved thedocument information in the foreground,which isconsideredthefinaloutputof thealgorithm. Figure 2presents theblockdiagramof theproposedalgorithm. The functionalityof eachblock is detailedas follows. 18
back to the  book Document Image Processing"
Document Image Processing
Title
Document Image Processing
Authors
Ergina Kavallieratou
Laurence Likforman-Sulem
Editor
MDPI
Location
Basel
Date
2018
Language
German
License
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Size
17.0 x 24.4 cm
Pages
216
Keywords
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Category
Informatik
Web-Books
Library
Privacy
Imprint
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing