Web-Books
in the Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Page - 63 -
  • User
  • Version
    • full version
    • text only version
  • Language
    • Deutsch - German
    • English

Page - 63 - in Document Image Processing

Image of the Page - 63 -

Image of the Page - 63 - in Document Image Processing

Text of the Page - 63 -

J. Imaging 2018,4, 6 Figure2.BlockDiagramof theHolisticOCRSystem. 3. FeatureExtraction Themainconceptof theproposedalgorithmisbasedontheproperty that theDCTtransform compressed image is adecompositionvectorwhich canuniquely represent the input image tobe correctly reconstructed later at a decompression stage. In this work, the first 100–200 2D-DCT coefficients are used as word features that provide good approximation about the word image information. Inoursystem, three featureswereexperimented. Those featuresare: DiscreteCosine Transforms(DCT),DiscreteCosineTransforms4-Blocks(DCT_4B),andafeaturewhichisacombination ofDCTandDCT_4B. 3.1.DiscreteCosineTransform(DCT) TheDCTfeatures inoursystemareextractedvia twodimensionalDCT.Thetwodimensional DCTofanM×N imagef(x,y) isdefinedas follows: T(u,v)= 1√ MN CuCv M−1 ∑ x=0 N−1 ∑ y=0 f(x,y)cos( (2x+1)uπ 2M )cos( (2y+1)vπ 2N ) (1) where0≤ x≤M−1,0≤ y≤N−1 Cu= { 1√ M ,x=0 2√ M ,1≤ x≤M−1 , Cv= { 1√ N ,y=0 2√ N ,1≤ y≤N−1 . After applyingDCT to thewholeword image, the features are extracted in avector formby usingthemostsignificantDCTcoefficients. Thesteps involved inDCTfeatureextractionasshownin Figure3are: 1. Apply theDCTtothewholewordimage. 2. PerformzigzagoperationontheDCTcoefficients Idct. Thezigzagmatrix Iz isa rowvectormatrixcontaininghighfrequencycoefficients in itsfirstN values thatcontainmostwordinformation. This formsfeaturesvector fdct foreachword. 63
back to the  book Document Image Processing"
Document Image Processing
Title
Document Image Processing
Authors
Ergina Kavallieratou
Laurence Likforman-Sulem
Editor
MDPI
Location
Basel
Date
2018
Language
German
License
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Size
17.0 x 24.4 cm
Pages
216
Keywords
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Category
Informatik
Web-Books
Library
Privacy
Imprint
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing