Seite - 63 - in Document Image Processing
Bild der Seite - 63 -
Text der Seite - 63 -
J. Imaging 2018,4, 6
Figure2.BlockDiagramof theHolisticOCRSystem.
3. FeatureExtraction
Themainconceptof theproposedalgorithmisbasedontheproperty that theDCTtransform
compressed image is adecompositionvectorwhich canuniquely represent the input image tobe
correctly reconstructed later at a decompression stage. In this work, the ïŹrst 100â200 2D-DCT
coefïŹcients are used as word features that provide good approximation about the word image
information. Inoursystem, three featureswereexperimented. Those featuresare: DiscreteCosine
Transforms(DCT),DiscreteCosineTransforms4-Blocks(DCT_4B),andafeaturewhichisacombination
ofDCTandDCT_4B.
3.1.DiscreteCosineTransform(DCT)
TheDCTfeatures inoursystemareextractedvia twodimensionalDCT.Thetwodimensional
DCTofanMĂN imagef(x,y) isdeïŹnedas follows:
T(u,v)= 1â
MN CuCv Mâ1
â
x=0 Nâ1
â
y=0 f(x,y)cos( (2x+1)uÏ
2M )cos( (2y+1)vÏ
2N ) (1)
where0†xâ€Mâ1,0†yâ€Nâ1
Cu= { 1â
M ,x=0
2â
M ,1†xâ€Mâ1 , Cv= { 1â
N ,y=0
2â
N ,1†yâ€Nâ1 .
After applyingDCT to thewholeword image, the features are extracted in avector formby
usingthemostsigniïŹcantDCTcoefïŹcients. Thesteps involved inDCTfeatureextractionasshownin
Figure3are:
1. Apply theDCTtothewholewordimage.
2. PerformzigzagoperationontheDCTcoefïŹcients Idct.
Thezigzagmatrix Iz isa rowvectormatrixcontaininghighfrequencycoefïŹcients in itsïŹrstN
values thatcontainmostwordinformation. This formsfeaturesvector fdct foreachword.
63
zurĂŒck zum
Buch Document Image Processing"
Document Image Processing
- Titel
- Document Image Processing
- Autoren
- Ergina Kavallieratou
- Laurence Likforman-Sulem
- Herausgeber
- MDPI
- Ort
- Basel
- Datum
- 2018
- Sprache
- deutsch
- Lizenz
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03897-106-1
- Abmessungen
- 17.0 x 24.4 cm
- Seiten
- 216
- Schlagwörter
- document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
- Kategorie
- Informatik