Page - 196 - in Document Image Processing
Image of the Page - 196 -
Text of the Page - 196 -
J. Imaging 2018,4, 32
typesofXMLfileshavebeengenerated,basedonthe informationcontainedintheglobalXMLfile,
one for thedetectiondataset and the other for the recognitiondataset. ThedetectionXMLfile is
providedat the line level foreachframe. Figure9adepictsapartof thedetectionXMLfileofFrance24
TVchannel. Oneboundingbox isdescribedby theelementRectanglewhichcontains the rectangle
attributes: (x,y) coordinates,widthandheight. Therecognitionground-truthfilesareprovidedat the
line level foreachtext image. TheXMLfile iscomposedof twomarkupsections:ArabicTranscription
andLatinTranscription. Figure9bdepictsanexampleofaground-truthXMLfileandits textline image.
Figure8.ApartofaglobalXMLannotatingavideosequenceofAljazeeraTV.Thisfigurecontains
ground-truth informationabout three text-boxes fromatotalof17.
a
b
Figure9.ExampleofAcTiV2.0specificXMLfiles: (a) apartof thedetectionXMLfileofFrance24TV;
and(b) a recognitionground-truthfileanditscorrespondingtextline image.
196
back to the
book Document Image Processing"
Document Image Processing
- Title
- Document Image Processing
- Authors
- Ergina Kavallieratou
- Laurence Likforman-Sulem
- Editor
- MDPI
- Location
- Basel
- Date
- 2018
- Language
- German
- License
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03897-106-1
- Size
- 17.0 x 24.4 cm
- Pages
- 216
- Keywords
- document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
- Category
- Informatik