Web-Books
in the Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Page - 175 -
  • User
  • Version
    • full version
    • text only version
  • Language
    • Deutsch - German
    • English

Page - 175 - in Document Image Processing

Image of the Page - 175 -

Image of the Page - 175 - in Document Image Processing

Text of the Page - 175 -

J. Imaging 2017,3, 62 Figure2. Syntheticdocument imagegeneration. (Left) originaldocument image. (Right) synthetic document imagegeneratedautomaticallywith therandomtext“Loremipsum”. Theautomatically generated image lookssimilar to theoriginalone. Theresult is stillperfectible.Here,as theoriginal imagesuffers fromlocaldeformations, the characters extracted tobuild the fontarequitedifferent andmay look too randomwhenassembledon the syntheticdocument. Abetter font extractionor compositionusingthecontext tochoosenewcharactersmayalleviate thisproblem. 3.DocumentDegradationModels Physicaldegradationduetoageing, storageconditionsorpoorqualityofprintingmaterialsmay bepresentondocuments. DocCreatorcurrentlyproposessevendegradationmodels. Asdetailed inFigure1 (rightpart), all thesemodelscanbeappliedonreal images toextendany document imagedatabase. TheusercaninteractwithDocCreator inorder toset thequantityofdefects togenerate. In the following sections, we describe the main ideas of these seven degradation models. AsDocCreator is anopensource software, readers canconsult the source code togetmoredetails about the implementationof thesemodels. 3.1. InkDegradation DocCreatorprovidesagrayscale inkdegradationmodel (detailed in [41]) able to simulate the most commoncharacterdegradationsdue to the ageof thedocument itself andprinting/writing process, such as ink splotches,white specks or streaks. Thismodel locallydegrades the image in theneighbourhoodof the charactersboundaries. Noise is thengenerated to create somesmall ink spotsnearcharactersor toerasesomecharacters inkarea.Contrary to thewellknownKanungonoise model [47] thatworksonlyonblackandwhite images, thisdegradationmethodcanprocessgrayscale images. SeeFigure3 foran inkdegradationexample. 175
back to the  book Document Image Processing"
Document Image Processing
Title
Document Image Processing
Authors
Ergina Kavallieratou
Laurence Likforman-Sulem
Editor
MDPI
Location
Basel
Date
2018
Language
German
License
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Size
17.0 x 24.4 cm
Pages
216
Keywords
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Category
Informatik
Web-Books
Library
Privacy
Imprint
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing