Web-Books
im Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Seite - 63 -
  • Benutzer
  • Version
    • Vollversion
    • Textversion
  • Sprache
    • Deutsch
    • English - Englisch

Seite - 63 - in Document Image Processing

Bild der Seite - 63 -

Bild der Seite - 63 - in Document Image Processing

Text der Seite - 63 -

J. Imaging 2018,4, 6 Figure2.BlockDiagramof theHolisticOCRSystem. 3. FeatureExtraction Themainconceptof theproposedalgorithmisbasedontheproperty that theDCTtransform compressed image is adecompositionvectorwhich canuniquely represent the input image tobe correctly reconstructed later at a decompression stage. In this work, the ïŹrst 100–200 2D-DCT coefïŹcients are used as word features that provide good approximation about the word image information. Inoursystem, three featureswereexperimented. Those featuresare: DiscreteCosine Transforms(DCT),DiscreteCosineTransforms4-Blocks(DCT_4B),andafeaturewhichisacombination ofDCTandDCT_4B. 3.1.DiscreteCosineTransform(DCT) TheDCTfeatures inoursystemareextractedvia twodimensionalDCT.Thetwodimensional DCTofanM×N imagef(x,y) isdeïŹnedas follows: T(u,v)= 1√ MN CuCv M−1 ∑ x=0 N−1 ∑ y=0 f(x,y)cos( (2x+1)uπ 2M )cos( (2y+1)vπ 2N ) (1) where0≀ x≀M−1,0≀ y≀N−1 Cu= { 1√ M ,x=0 2√ M ,1≀ x≀M−1 , Cv= { 1√ N ,y=0 2√ N ,1≀ y≀N−1 . After applyingDCT to thewholeword image, the features are extracted in avector formby usingthemostsigniïŹcantDCTcoefïŹcients. Thesteps involved inDCTfeatureextractionasshownin Figure3are: 1. Apply theDCTtothewholewordimage. 2. PerformzigzagoperationontheDCTcoefïŹcients Idct. Thezigzagmatrix Iz isa rowvectormatrixcontaininghighfrequencycoefïŹcients in itsïŹrstN values thatcontainmostwordinformation. This formsfeaturesvector fdct foreachword. 63
zurĂŒck zum  Buch Document Image Processing"
Document Image Processing
Titel
Document Image Processing
Autoren
Ergina Kavallieratou
Laurence Likforman-Sulem
Herausgeber
MDPI
Ort
Basel
Datum
2018
Sprache
deutsch
Lizenz
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Abmessungen
17.0 x 24.4 cm
Seiten
216
Schlagwörter
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Kategorie
Informatik
Web-Books
Bibliothek
Datenschutz
Impressum
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing