Seite - 63 - in Document Image Processing

Bild der Seite - 63 -

Text der Seite - 63 -

J. Imaging 2018,4, 6 Figure2.BlockDiagramof theHolisticOCRSystem. 3. FeatureExtraction Themainconceptof theproposedalgorithmisbasedontheproperty that theDCTtransform compressed image is adecompositionvectorwhich canuniquely represent the input image tobe correctly reconstructed later at a decompression stage. In this work, the ﬁrst 100–200 2D-DCT coefﬁcients are used as word features that provide good approximation about the word image information. Inoursystem, three featureswereexperimented. Those featuresare: DiscreteCosine Transforms(DCT),DiscreteCosineTransforms4-Blocks(DCT_4B),andafeaturewhichisacombination ofDCTandDCT_4B. 3.1.DiscreteCosineTransform(DCT) TheDCTfeatures inoursystemareextractedvia twodimensionalDCT.Thetwodimensional DCTofanM×N imagef(x,y) isdeﬁnedas follows: T(u,v)= 1√ MN CuCv M−1 ∑ x=0 N−1 ∑ y=0 f(x,y)cos( (2x+1)uπ 2M )cos( (2y+1)vπ 2N ) (1) where0≤ x≤M−1,0≤ y≤N−1 Cu= { 1√ M ,x=0 2√ M ,1≤ x≤M−1 , Cv= { 1√ N ,y=0 2√ N ,1≤ y≤N−1 . After applyingDCT to thewholeword image, the features are extracted in avector formby usingthemostsigniﬁcantDCTcoefﬁcients. Thesteps involved inDCTfeatureextractionasshownin Figure3are: 1. Apply theDCTtothewholewordimage. 2. PerformzigzagoperationontheDCTcoefﬁcients Idct. Thezigzagmatrix Iz isa rowvectormatrixcontaininghighfrequencycoefﬁcients in itsﬁrstN values thatcontainmostwordinformation. This formsfeaturesvector fdct foreachword. 63

zurück zum Buch Document Image Processing"

Document Image Processing

Titel: Document Image Processing
Autoren: Ergina Kavallieratou; Laurence Likforman-Sulem
Herausgeber: MDPI
Ort: Basel
Datum: 2018
Sprache: deutsch
Lizenz: CC BY-NC-ND 4.0
ISBN: 978-3-03897-106-1
Abmessungen: 17.0 x 24.4 cm
Seiten: 216
Schlagwörter: document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Kategorie: Informatik