Web-Books
im Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Seite - 183 -
  • Benutzer
  • Version
    • Vollversion
    • Textversion
  • Sprache
    • Deutsch
    • English - Englisch

Seite - 183 - in Document Image Processing

Bild der Seite - 183 -

Bild der Seite - 183 - in Document Image Processing

Text der Seite - 183 -

J. Imaging 2017,3, 62 images;and(3) font,backgroundandlayoutarefinallyusedwiththetranscribedtexts toautomatically generate syntheticversionsof theoriginal ones. Thewholedatabase is composedof 93document imagescontaining18.240wordsand115.622characters. (a) (DB1)Contemporaryfrenchtypewrittendocument. (b) (DB2)Oldfrenchtypewrittendocument. (c) (DB3)OldfrenchManuscriptdocument. Figure11. Imagesextractedfromthedatabaseusedfor testingourpredictionalgorithm. (Left)original images. (Right) syntheticgenerated images. Toevaluate theOCRtext recognitionrate,weuse theLevenshteindistance (ametricmeasuring the difference between two strings) between the whole original transcribed text and the whole recognized text. Wecompute themeanof theLevenshteindistances for theNdocuments of each database. Using this Levenshtein distance, the difference between theOCR text recognition rate computedonreal imagesandtheonecomputedon“Loremipsum”version(Table2Column1and Column2) is, on average, only overestimated by 0.04. Thedifference between the realOCR rate and theone computedon the synthetic versions (Table 2Column1andColumn3) is, onaverage, onlyoverestimatedby0.03.Mostof thesuccessofdifferentexistingOCRpredictionmethods ([63–66]) arerelatedto thequalityandquantityof theneededgroundtruth.Ourpredictionmethodpresented hereprovidescomparableresultswith theones formthestateof theart. Table2.ComparisonbetweenOCRrecognitionratesobtainedonthreedifferentbooksoriginal images and their syntheticversions. Column1: OCRrecognition rateonoriginal images,Column2: OCR recognitionrateonsynthetic imagesgeneratedwithboththetextandthefont fromtheoriginal images, Column3:OCRrecognitionrateonsynthetic imagesgeneratedwith loremipsumrandomtextandthe font fromtheoriginal images. Original Image FontFrom SameText LoremText DB1 0.95 DB1 0.94 0.88 DB2 0.80 DB2 0.85 0.84 DB3 0.24 DB3 0.21 0.23 5.Conclusions DocCreator gives toDIAR researchers a simple and rapidway to extend existing document imagedatabasesor tocreatenewonesavoidingthe tedious taskofmanualgroundtruthgeneration. DocCreator embeds many fonts, backgrounds, meshes and realistic degradation models which, whencombined, result inan interestingcombinationofground-trutheddatabases. Theexperiments 183
zurück zum  Buch Document Image Processing"
Document Image Processing
Titel
Document Image Processing
Autoren
Ergina Kavallieratou
Laurence Likforman-Sulem
Herausgeber
MDPI
Ort
Basel
Datum
2018
Sprache
deutsch
Lizenz
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Abmessungen
17.0 x 24.4 cm
Seiten
216
Schlagwörter
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Kategorie
Informatik
Web-Books
Bibliothek
Datenschutz
Impressum
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing