Web-Books
in the Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Page - 38 -
  • User
  • Version
    • full version
    • text only version
  • Language
    • Deutsch - German
    • English

Page - 38 - in Document Image Processing

Image of the Page - 38 -

Image of the Page - 38 - in Document Image Processing

Text of the Page - 38 -

J. Imaging 2018,4, 80 3.4. PrintDB ThePrintDBconsistsoffiveprinteddocument images thatareartificiallyslantedoverarange from−45◦ to +45◦, yielding a total of 455 slanted, printed document images. The exact slant is predetermined,makingevaluationof the techniqueeasier andmoreprecise. Thedocumentswere madefrompartsof .pdffilestoensurepreciseslantvalues. Thepageswereselectedtoincludedifferent typeof text types (including sparewriting and single/double columns). All the textwas slanted, keepingtheoriginaldimensions ((1/5) *A4_height * (1/2) *A4_width). Figure9showsanexampleof thePrintDBafter theapplicationof the technique. Figure 9. Application of our slant removal technique on sample of PrintDB (left); corrected by 35◦ (right). 3.5. Set-Upof theTextRatioRParameter Theamount of text in thewindow isvery important inorder todetect the slant angle. Little text includes very little information,while toomuch textwould increase the computational cost. In [17], therateof10%wasusedasapprovedamountof text in theselectedwindows justbytestand trial.Here,moredetailedexperimentsareperformed.Thetext ratioofeachwindowiscountedand comparedto theslantdetectionerror. Theexperimentwasperformedforeverywindowonseveral imagesof thevalidationset: 5document images fromtheTrigraphSlantDB,1 fromtheWashington DB, 1 from theBH2MDB, and 20 from thePrintDB. Since the handwritten imageswere of high resolution, theprocedurewasvery timeconsumingand just fewof themwereused. On theother hand, the imagesofprintedtextwereallused. InFigure10, thecurveofsumofslantsquareerrors (SSE)withreference to the text ratio ispresentedforprintedandhandwritten text. KDQGZULWWHQ 3ULQW'%66( 7H[W 5DWLR 5 Figure10.Sumofsquareerrorsaccordingto text ratioR for theprinteddb(dotted line)andthe three handwrittendatabases (broken line). 38
back to the  book Document Image Processing"
Document Image Processing
Title
Document Image Processing
Authors
Ergina Kavallieratou
Laurence Likforman-Sulem
Editor
MDPI
Location
Basel
Date
2018
Language
German
License
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Size
17.0 x 24.4 cm
Pages
216
Keywords
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Category
Informatik
Web-Books
Library
Privacy
Imprint
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing