Page - 334 - in Differential Geometrical Theory of Statistics
Image of the Page - 334 -
Text of the Page - 334 -
Entropy2016,18, 421
Foreachtopic, theresultspresented invite further investigation.
4.1. TransitionBetweenDiscrete andContinuousFeaturesofSamplingDistributions
Earlierwork[2]usedthedecomposition:
Dβ/2= β
{0β€iβ€k:nβi>0} nβi log(n β
i/ΞΌi)=Ξ β+Ξβ,
Ξβ := k
β
i=0 Ξ±inβi andΞ β := β
{0β€iβ€k:nβi>1} nβi logn β
i β₯0, whereΞ±i :=β logΞΌi,
toshowthataparticularlybadcasefor theadequacyofanycontinuousapproximationtothesampling
distributionof thedevianceD :=Dβ|(Nβ=N) is theuniformdiscretedistribution:Οi=1/(k+1).
In thiscase, theΞβ termcontributesaconstant to thedeviance,while theΞβ termhasnocontributions
fromcellswith0or1observationsβthesebeinginthevastmajorityintheN<< ksituationconsidered
here. Inotherwords, allof thevariability inD comes fromthatbetween theni lognivalues for the
(relativelyrare) cell countsabove1. Thisgivesrise toadiscretenessphenomenontermedβgranularityβ
in [2],whosemeaningwas conveyedgraphically there in the caseN= 30 and k= 200. Workby
Holst [19]predicts thatcontinuous (indeed,normal)approximationswill improvewith largervalues
ofN/k, as is intuitive. Remarkably, simplydoubling thesamplesize toN=60wasshownin[2] tobe
sufο¬cient togiveagoodenoughapproximationformostgoodness-of-ο¬t testingpurposes. Inother
words,Nbeing30%ofk=200wasfoundtobegoodenoughforpracticalpurposes.
Here,we illustrate the roleof k-asymptotics (Section2) in this transitionbetweendiscreteand
continuous features by repeating the above analyses for different values of k. Figures 3 and 4
(wherek= 100whileN= 20and40, respectively)arequalitatively thesameas thosepresentedin[2].
Thedifferencehere is that thesmallervalueof kmeans thatahighervalueofN/k (40%) isneeded
inFigure4 toadequately remove thegranularityevident inFigure3. For k= 400, theο¬gureswith
N=50andN=100(omittedhere forbrevity)are,again,qualitatively thesameas in [2]βthe larger
valueofkneedingonlyasmallervalueofN/k (25%) forpracticalpurposes. Note theQQ-plotsused
in these twoο¬guresarerelative tonormalquantiles.
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
0 20 40 60 80 100
(a) Null distribution, N = 20
Rank of cell probability β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
β
0 200 400 600 800 1000
(b) Sample of Deviance Statistic
Index β
β
β β
β
β β β
β β
β
ββ β
β
ββ β
ββ β β
β β
ββββ β
β
β β
ββ β
ββ β
β
β β
β β
β β
β
β β
β β β
β
β
β β β
β β
β
β β
β
β β β
β
β β
β β β
β
β β
β β β β
β
β β
β β β
β
β β β β β
β β β β
β
β
β β
β
β
β3 β2 β1 0 1 2 3
(c) QQplot Deviance Statistic
Theoretical Quantiles
Figure3. k=100,N=20.
334
Differential Geometrical Theory of Statistics
- Title
- Differential Geometrical Theory of Statistics
- Authors
- FrΓ©dΓ©ric Barbaresco
- Frank Nielsen
- Editor
- MDPI
- Location
- Basel
- Date
- 2017
- Language
- English
- License
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03842-425-3
- Size
- 17.0 x 24.4 cm
- Pages
- 476
- Keywords
- Entropy, Coding Theory, Maximum entropy, Information geometry, Computational Information Geometry, Hessian Geometry, Divergence Geometry, Information topology, Cohomology, Shape Space, Statistical physics, Thermodynamics
- Categories
- Naturwissenschaften Physik