Page - 61 - in Document Image Processing
Image of the Page - 61 -
Text of the Page - 61 -
J. Imaging 2018,4, 6
searchformoreeffectivesolutions to tackle theproblemofclassification.Amoredirectandefficient
methodology canbeprovidedusingholistic recognition [6]. Holistic approachhandles thewhole
wordasaunifiedunit. Aglobal featurevector is calculated for the indivisible inputwordsample
which is thenutilized to classify thewordagainst a stored lexiconofwords. Holistic recognition
is inspired fromwhat isknownas thewordsuperiorityeffect,whichstates thatpeoplehavebetter
recognitionof letterspresentedwithinwordsascomparedto isolated lettersandto letterspresented
withinnon-words [7].Holisticparadigmsarenotonlyeffective,butalsohavetheability tomaintain
certaineffectswhicharespecial to theclassunderoperationsuchascoarticulationeffects [8].
Severalpreviousresearcheffortshave investigatedtheholisticapproachforArabiccursivescript
recognition for both printed and handwritten types. Erlandson et al. [9] reported aword-level
recognitionsystemformachine-printedArabic. Theyusedanimage-morphologicalbasedvectorof
featuressuchasdotsandhamzas, thedirectionofsegments, the junctionsandendpoints,directionof
cavities,holes,descendersand intra-wordgaps. All these featuresarecomputed foraqueryword
image in therecognitionphaseandarematchedagainstapre-computeddatabaseofvectors froman
Arabicwords lexiconandthat systemachievedawordrecognitionrateof65%. Thisaccuracywas
achievedwith the integrationofa lexiconpruningsubsystemthat isbasedonanother recognition
methodthatwasdevelopedunder thesameproject fora trainingsetof8436word imagesscanned
at300dpi.
Al-Badretal. [10]developedanArabicholisticwordrecognitionsystembasedonasetof shape
primitives that aredetectedwithmathematicalmorphologyoperations. That systemwas trained
using a single fontwith three types of documents: ideal (noise-free), synthetically degraded and
scanned. Theused feature extractionoperatorswerevery sensitive to the scanningnoise and the
degradedlowresolutiondocuments. Thatsystemachievedarecognitionrateof99.4%fornoise-free
documents. Forsyntheticallydegradeddocuments, thesystemaccuracydecreasedto95.6%andto73%
forscanneddocuments.All theseevaluationswereperformedusinga limited lexiconthatcontained
4317words [10].
KhorsheedandClocksin [11]presentedatechniqueforrecognizingArabiccursivewords from
scannedimagesof textbytransformingeachwordinacertain lexicon intoanormalizedpolar image,
andthenapplieda two-dimensionalFourier transformtothatpolar image. Eachwordis represented
by a template that includes a set of Fourier’s coefficients, and for recognition, the systemused a
normalizedEuclideandistance thatmeasures thedistancebetween thewordunder test and those
templates. Thatsystemachievedarecognitionrateof90%fora lexiconsizeof145wordsandused
1700wordsamples for training.
Togetbetterperformance,Khorsheed [12]presentedanewsystembasedonHiddenMarkov
Models (HMMs). In that system, eachwordwas representedbyasingleHMM.Thewordmodels
were trainedusing thewordsampleFourier’s spectrum. Theexperimentswereconductedonfour
fonts, andthereportedresultsare forSimplifiedArabicandArabicTraditional fontsonly. Thesystem
achievedahigherrecognitionratecomparedto the template-basedrecognizer. Thehighestachieved
results forbothfontsare: 90%as thefirst choiceand98%within the top-tenchoices.
In a laterwork, Khorsheed [13] presented a cursiveArabic text recognition systembasedon
HMM.This systemwasalsosegmentation-freewithaneasy-to-extract statistical featuresvectorof
length60 elements, representing threedifferent typesof features. This systemwas trainedwith a
datacorpuswhich includesArabic textofmore than600A4-sizesheets typewritten insixdifferent
computer-generated fonts: Tahoma, SimplifiedArabic, Traditional Arabic, Andalus, Naskh and
Thuluth. Thehighestachievedresultswere88.7%and92.4%forAndalus font inmono-modeland
tri-model, respectively. Inanotherexperiment, that systemwastrainedwithamulti-fontdataset that
wasselectedrandomlywithsamesamplesize fromall fontsandtestedwithadatasetconsistingof
200 lines fromeachfont,andachievedanaccuracyof95%usingthe tri-model.
Inanother effort,Krayemetal. [14]presentedaword level recognition systemusingdiscrete
hiddenMarkov classifier alongwith a block based discrete cosine transform. This systemwas
61
back to the
book Document Image Processing"
Document Image Processing
- Title
- Document Image Processing
- Authors
- Ergina Kavallieratou
- Laurence Likforman-Sulem
- Editor
- MDPI
- Location
- Basel
- Date
- 2018
- Language
- German
- License
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03897-106-1
- Size
- 17.0 x 24.4 cm
- Pages
- 216
- Keywords
- document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
- Category
- Informatik