Seite - 93 - in Document Image Processing

Bild der Seite - 93 -

Text der Seite - 93 -

J. Imaging 2018,4, 41 Figure2.Layer-wise trainingofdeepconvolutionalneuralnetwork. Algorithm1.Layerwise trainingofdeepconvolutionalneuralnetwork INPUT:Model,T, t,α1,α2,n\\T=(TrainData), t= (TestData), OUTPUT:TM \\TrainedModel Begin\\Addﬁrst layerofconvolutional layerandpooling layer Model.add (xCy,T,Relu) Model.add (xPy) Model.add (xFC) Model.add (xOU) Model.compile (optimizer) Model.ﬁt (T, t,α1) forall I :=1: n-1step1do \\Removethe last twolayers (FC&OU) ofexistingmodel toaddnext layerofconvolutionalandpooling Model.layer.pop() Model.layer.pop() Model.add (xCy,T,Relu) Model.add (xPy) \\Againaddedfullyconnectedandoutput layer Model.add (xFC) Model.add (xOU) Model.compile (optimizer) Model.ﬁt (T, t,α1)\\Trainedthemodelwithhigh learningrate endfor Model.ﬁt (T, t,α2)\\Performﬁnetuningwith lowlearningrate end 4. ExperimentsandDiscussions Experimentswerecarriedoutontwodatabases: ISIDCHARandV2DMDCHARusingtheDCNN, layer-wiseDCNNanddifferentadaptivegradientmethods.As it ishardtodelineate thenumberof layersofDCNNthatcanproduce thebest result,weconsideredsixdifferentnetworkarchitectures (NA) ofDCNNas shown inTable 1. NA-1 contains only single convolutional-pooling layer and 500 fully connected neurons to observe the ﬁrst response ofDCNN. The next, NA-2 has double thenumberof fullyconnectedneurons. Theaimis toobserve the impactofenhancement. Further, NA-3andNA-4have twoC-P layerswithvariation in thenumberofkernels toanalysis the impactof twoC-Players. The last,NA-5andNA-6havethreeC-P layers. Initially, thedifferentnetworkarchitecturesofDCNNwereappliedoneachdatabase toﬁndout thebestmodel for thatparticulardatabaseandthentheproposedlayer-wiseDCNNwasappliedto observe the impactof thatmodel. Themodelshavealsobeentestedwithdifferentadaptivegradient methodsto thesemethods; theyarealsounderexperiment toobserve theirperformance.Ourwork alsoshowsthe impactofdifferentadaptivegradientmethodsonrecognitionaccuracy. 93

zurück zum Buch Document Image Processing"

Document Image Processing

Titel: Document Image Processing
Autoren: Ergina Kavallieratou; Laurence Likforman-Sulem
Herausgeber: MDPI
Ort: Basel
Datum: 2018
Sprache: deutsch
Lizenz: CC BY-NC-ND 4.0
ISBN: 978-3-03897-106-1
Abmessungen: 17.0 x 24.4 cm
Seiten: 216
Schlagwörter: document image processing, preprocessing, binarizationl, text-line segmentation, handwriting recognition, indic/arabic/asian script, OCR, Video OCR, word spotting, retrieval, document datasets, performance evaluation, document annotation tools
Kategorie: Informatik