Seite - 358 - in Differential Geometrical Theory of Statistics

Bild der Seite - 358 -

Text der Seite - 358 -

Entropy2016,18, 396 Themean-shift isdeﬁnedby m(x)= k ∑ i=1 1 rn+2g ( d(x,xi)2 r2 ) ∑ki=1 1 rn+2g ( d(x,xi)2 r2 )logx(xi)∝∇fKrfgr , wherem(x) is in the tangentspaceatx. Thealgorithmmoves fromx to expx(m(x))until convergence toa localmaximum.Thepointsof thespacearesegmentedaccordingto the localmaximatowhich theyconverge. In order to assess the quality of unsupervised classiﬁcation,weuse the notion of Silhouette, see [36],whichcomputes foreachpointaproximitycriterionwithrespect tootherpointsof thesame clusterandotherpointsofdifferentclusters (seeFigure4). Letxbe in theclusterA.Werespectively deﬁne a(x)=miny∈Ad(x,y)andb(x)=miny =Ad(x,y), theminimumdistance topointsof thesame (resp. other) class(es). TheSilhouetteofx is a(x)−b(x) max{a(x),b(x)}, which takesvaluesbetween−1and1, respectively,whenthedatapoint is considered“badly”and “well” clustered. The average of all the silhouettes provides an indication of the relevance of the classiﬁcation.Onecanrepresentgraphically thesilhouetteproﬁlebyplotting foreachclasshorizontal segmentsof the lengthof thesilhouettevalue (seeFigure5). Figure4. Intraandinterclusterdistances. Figure5.Exampleofsilhouette. 358

zurück zum Buch Differential Geometrical Theory of Statistics"

Differential Geometrical Theory of Statistics

Titel: Differential Geometrical Theory of Statistics
Autoren: Frédéric Barbaresco; Frank Nielsen
Herausgeber: MDPI
Ort: Basel
Datum: 2017
Sprache: englisch
Lizenz: CC BY-NC-ND 4.0
ISBN: 978-3-03842-425-3
Abmessungen: 17.0 x 24.4 cm
Seiten: 476
Schlagwörter: Entropy, Coding Theory, Maximum entropy, Information geometry, Computational Information Geometry, Hessian Geometry, Divergence Geometry, Information topology, Cohomology, Shape Space, Statistical physics, Thermodynamics
Kategorien: Naturwissenschaften Physik