Page - 152 - in Document Image Processing

Image of the Page - 152 -

Text of the Page - 152 -

J. Imaging 2018,4, 39 Themain contributionof thepresentwork is the comprehensive evaluationof themajor classiﬁer combinationapproacheswhichareeitherrulebasedorapplyasecondaryclassiﬁer for information fusion. Themotivation is to improvetheclassiﬁcationaccuracyat theword-levelhandwrittenscript recognitionbycombiningtheresultsof thebestperformingclassiﬁeronthreepreviouslyusedfeature sets. It isamulti-classclassiﬁcationproblemandin thepresentcase,12ofﬁciallyused Indic-scriptsare consideredwhichare:Devanagari,Bangla,Odia,Gujarati,Gurumukhi,Tamil,Telugu,Kannada,Malayalam, Manipuri,UrduandRoman. Threedifferent setsof featurevectorsbasedonbothshapeandtexture analysishavebeenestimatedfromeachof thehandwrittenwordimages. Identiﬁcationof thescripts inwhich theword images arewritten, is donewith these featurevaluesby feeding the same into differentMLPclassiﬁers. Soft-decisionsprovidedbythe individualclassiﬁersare thencombinedusing anarrayof classiﬁer combination techniques. This kindofwork is implemented for theﬁrst time assumingthenumberof Indicscriptsundertakenandtherangeofcombinationtechniquesapplied. Thesystemdevelopedfor thescript recognitiontaskhere, isapartof thegeneral frameworkwhere different featuresetsandclassiﬁeroutputscanbemodelledintoasinglesystemwithoutmuchincrease in thecomputation involved. Blockdiagramof thepresentwork isshowninFigure1. Figure1.Schematicdiagramof theproposedmethodology. 2. FeatureExtraction In thispaper, threepopular featureextractionmethodologieshavebeenusedfor thecombination namely,EllipticalFeatures [21],HistogramofOrientedGradients (HOG)[30]andModiﬁedlog-Gabor ﬁlter transform[20]. Theﬁrst featureset isapplied tocapture theoverall structurepresent in thescript wordimageswhereas therest twofeaturesetsdealwith the textureof thesame. These featureshave alreadyprovidedsatisfactoryresults to thischallengingtaskofhandwrittenscript identiﬁcation. 2.1. EllipticalFeatures Thewordimagesaregenerally foundtobeelongated innaturewhichcanbettercoveredbyan ellipse. That iswhy;elliptical featuresareextractedfromthecontourandthe local regionsofaword image so that it is easier to isolate aparticular script. Twomore important notationsused in this subsectionare: (a)Pixel ratio (Pr)and(b)Pixelcount (Pc). Pr isdeﬁnedas theratioof thenumberof contourpixels (object) to thenumberofbackgroundpixelsandthepixel countwhereasPc isdeﬁned as thenumberofcontourpixels. Thefeaturesaredescribed indetail: 2.1.1.MaximumInscribedEllipse Theheightandwidthof theboundingboxarecalculatedforeachwordimage.Arepresentative ellipse is theninscribed(consideringtheorientationof theellipse) inside thisboundingboxhaving 152

back to the book Document Image Processing"

Document Image Processing

Title: Document Image Processing
Authors: Ergina Kavallieratou; Laurence Likforman-Sulem
Editor: MDPI
Location: Basel
Date: 2018
Language: German
License: CC BY-NC-ND 4.0
ISBN: 978-3-03897-106-1
Size: 17.0 x 24.4 cm
Pages: 216
Keywords: document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Category: Informatik