Web-Books
im Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Seite - 38 -
  • Benutzer
  • Version
    • Vollversion
    • Textversion
  • Sprache
    • Deutsch
    • English - Englisch

Seite - 38 - in Document Image Processing

Bild der Seite - 38 -

Bild der Seite - 38 - in Document Image Processing

Text der Seite - 38 -

J. Imaging 2018,4, 80 3.4. PrintDB ThePrintDBconsistsoffiveprinteddocument images thatareartificiallyslantedoverarange from−45◦ to +45◦, yielding a total of 455 slanted, printed document images. The exact slant is predetermined,makingevaluationof the techniqueeasier andmoreprecise. Thedocumentswere madefrompartsof .pdffilestoensurepreciseslantvalues. Thepageswereselectedtoincludedifferent typeof text types (including sparewriting and single/double columns). All the textwas slanted, keepingtheoriginaldimensions ((1/5) *A4_height * (1/2) *A4_width). Figure9showsanexampleof thePrintDBafter theapplicationof the technique. Figure 9. Application of our slant removal technique on sample of PrintDB (left); corrected by 35◦ (right). 3.5. Set-Upof theTextRatioRParameter Theamount of text in thewindow isvery important inorder todetect the slant angle. Little text includes very little information,while toomuch textwould increase the computational cost. In [17], therateof10%wasusedasapprovedamountof text in theselectedwindows justbytestand trial.Here,moredetailedexperimentsareperformed.Thetext ratioofeachwindowiscountedand comparedto theslantdetectionerror. Theexperimentwasperformedforeverywindowonseveral imagesof thevalidationset: 5document images fromtheTrigraphSlantDB,1 fromtheWashington DB, 1 from theBH2MDB, and 20 from thePrintDB. Since the handwritten imageswere of high resolution, theprocedurewasvery timeconsumingand just fewof themwereused. On theother hand, the imagesofprintedtextwereallused. InFigure10, thecurveofsumofslantsquareerrors (SSE)withreference to the text ratio ispresentedforprintedandhandwritten text. KDQGZULWWHQ 3ULQW'%66( 7H[W 5DWLR 5 Figure10.Sumofsquareerrorsaccordingto text ratioR for theprinteddb(dotted line)andthe three handwrittendatabases (broken line). 38
zurück zum  Buch Document Image Processing"
Document Image Processing
Titel
Document Image Processing
Autoren
Ergina Kavallieratou
Laurence Likforman-Sulem
Herausgeber
MDPI
Ort
Basel
Datum
2018
Sprache
deutsch
Lizenz
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Abmessungen
17.0 x 24.4 cm
Seiten
216
Schlagwörter
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Kategorie
Informatik
Web-Books
Bibliothek
Datenschutz
Impressum
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing