Seite - 122 - in Document Image Processing
Bild der Seite - 122 -
Text der Seite - 122 -
J. Imaging 2018,4, 43
recognizinganddirectly transliteratingBalinesewords. For theKhmerandSundanesedatasets, the
LSTMarchitectureseemstostruggle to learn the trainingdata.Moresyntheticdata trainingwitha
more frequentwordshouldbegenerated inorder tosupport the trainingprocess. For theBalinese
dataset, asequencedepthof100pixelswithaneuronsizeof200givesabetter result forbothLSTM
andBLTSMarchitecture.Mostof theSoutheastAsianscriptsaresyllabic scripts.Onecharacter/glyph
in these scripts represents a syllable,with a sequence of letters inLatin script. In this case,word
transliteration isnot justwordrecognitionwithone-to-oneglyph-to-letterassociation. Thismakes
wordtransliterationmorechallengingthancharacter/glyphrecognition.
Table8.Experimental results forwordrecognitionandtransliterationtasks (in%errorrate for test).
Methods(withOCRopy[56]Framework) Balinese Khmer Sundanese
BLSTM1(seq_depth60,neuronsize100) 43.13 Latin text:
73.76Khmer
text: 77.88 75.52
LSTM1(seq_depth100,neuronsize100) 42.88 - -
BLSTM2(seq_depth100,neuronsize200) 40.54 - -
LSTM2(seq_depth100,neuronsize200) 39.70 - -
Figure26.Errorrate forBalinesewordrecognitionandtransliterationtest set.
Figure27.Errorrate forKhmerwordrecognitionandtransliterationtest set.
122
zurück zum
Buch Document Image Processing"
Document Image Processing
- Titel
- Document Image Processing
- Autoren
- Ergina Kavallieratou
- Laurence Likforman-Sulem
- Herausgeber
- MDPI
- Ort
- Basel
- Datum
- 2018
- Sprache
- deutsch
- Lizenz
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03897-106-1
- Abmessungen
- 17.0 x 24.4 cm
- Seiten
- 216
- Schlagwörter
- document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
- Kategorie
- Informatik