Seite - 122 - in Document Image Processing

Bild der Seite - 122 -

Text der Seite - 122 -

J. Imaging 2018,4, 43 recognizinganddirectly transliteratingBalinesewords. For theKhmerandSundanesedatasets, the LSTMarchitectureseemstostruggle to learn the trainingdata.Moresyntheticdata trainingwitha more frequentwordshouldbegenerated inorder tosupport the trainingprocess. For theBalinese dataset, asequencedepthof100pixelswithaneuronsizeof200givesabetter result forbothLSTM andBLTSMarchitecture.Mostof theSoutheastAsianscriptsaresyllabic scripts.Onecharacter/glyph in these scripts represents a syllable,with a sequence of letters inLatin script. In this case,word transliteration isnot justwordrecognitionwithone-to-oneglyph-to-letterassociation. Thismakes wordtransliterationmorechallengingthancharacter/glyphrecognition. Table8.Experimental results forwordrecognitionandtransliterationtasks (in%errorrate for test). Methods(withOCRopy[56]Framework) Balinese Khmer Sundanese BLSTM1(seq_depth60,neuronsize100) 43.13 Latin text: 73.76Khmer text: 77.88 75.52 LSTM1(seq_depth100,neuronsize100) 42.88 - - BLSTM2(seq_depth100,neuronsize200) 40.54 - - LSTM2(seq_depth100,neuronsize200) 39.70 - - Figure26.Errorrate forBalinesewordrecognitionandtransliterationtest set. Figure27.Errorrate forKhmerwordrecognitionandtransliterationtest set. 122

zurück zum Buch Document Image Processing"

Document Image Processing

Titel: Document Image Processing
Autoren: Ergina Kavallieratou; Laurence Likforman-Sulem
Herausgeber: MDPI
Ort: Basel
Datum: 2018
Sprache: deutsch
Lizenz: CC BY-NC-ND 4.0
ISBN: 978-3-03897-106-1
Abmessungen: 17.0 x 24.4 cm
Seiten: 216
Schlagwörter: document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Kategorie: Informatik