Web-Books
im Austria-Forum
Austria-Forum
Web-Books
Informatik
Document Image Processing
Seite - 191 -
  • Benutzer
  • Version
    • Vollversion
    • Textversion
  • Sprache
    • Deutsch
    • English - Englisch

Seite - 191 - in Document Image Processing

Bild der Seite - 191 -

Bild der Seite - 191 - in Document Image Processing

Text der Seite - 191 -

J. Imaging 2018,4, 32 segmentationtask, thebestF-score,90%,wasobtainedbyMishraetal. [30]. Thealgorithmismainly basedontwosteps: aGMMrefinementusingstrokeandcolor featuresandagraphcutprocedure. TheKAISTdataset [31]consistsof3000 images taken in indoorandoutdoorscenes (seeFigure2d forexamples). This isamultilingualdataset,which includesEnglishandKoreantexts.KAISTcanbe usedforbothdetectionandsegmentationtasks,as itprovidesbinarymasks foreachcharacter in the image. The text segmentationalgorithmofZhuandZhang[32]outperformsexistingmethodsonthis datasetwithanF-scoreof88%.Themethodisbasedonsuperpixel clustering. First, anadaptiveSLIC textsuperpixelgenerationprocedure isperformed.Next,aDBSCAN-basedsuperpixelclustering is usedto fusestrokesuperpixels. Finally,astrokesuperpixelverificationprocess isapplied. TheNEOCRdataset [33] contains 659natural scene imageswithmulti-oriented texts of high variability (see Figure 2c for examples). This database is intended for scene text recognition and providedmultilingualevaluationenvironments,as it includes texts ineightEuropeanlanguages. In2016,Veitetal. [34]proposedadataset forEnglishscene textdetectionandrecognitioncalled COCO-Text. Thedataset isbasedontheMicrosoftCOCOdataset,whichcontains imagesofcomplex everydayscenes. Thebestresultonthisdataset (67.16%)wasobtainedbythewinnerof theCOCO-Text ICDAR2017competition[35].Note that theparticipatingmethodsonthiscompetitionwereranked basedontheirAverageprecision(AP)withanIntersectionoverUnion(IoU)of0.5. Recently,ChngandChan[36] introducedanewdataset,namelyTotal-text, forcurvedscene text detectionandrecognitionproblems. It contains1555scene imagesand9330annotatedwordswith threedifferent textorientations. Figure 3. Some examples of text detection systems [18–20] showing the evolution of this area of researchover tenyears. As forArabic language,majorcontributionshavealreadybeenmadeintheconventionalfield ofprintedandhandwrittenOCRsystems[7,10].Muchprogressofsuchsystemshasbeentriggered thanks to theavailabilityofpublicdatasets. Examples include the IFN/ENIT[37]andKHATT[38] datasets for offline handwriting recognition andwriter identification; theAPTI database [39] for printedwordrecognition;andtheADABdataset [40] thatworksononlinehandwritingrecognition. However,handlingArabic textdetectionandrecognition formultimediadocuments is limitedto veryfewstudies [41–43]. Table 1 presents commonly used datasets for text processing in images and videos, and summarizes their features in terms of text categories, sources, tasks, script, information of 191
zurück zum  Buch Document Image Processing"
Document Image Processing
Titel
Document Image Processing
Autoren
Ergina Kavallieratou
Laurence Likforman-Sulem
Herausgeber
MDPI
Ort
Basel
Datum
2018
Sprache
deutsch
Lizenz
CC BY-NC-ND 4.0
ISBN
978-3-03897-106-1
Abmessungen
17.0 x 24.4 cm
Seiten
216
Schlagwörter
document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Kategorie
Informatik
Web-Books
Bibliothek
Datenschutz
Impressum
Austria-Forum
Austria-Forum
Web-Books
Document Image Processing