Seite - 327 - in Differential Geometrical Theory of Statistics
Bild der Seite - 327 -
Text der Seite - 327 -
Entropy2016,18, 421
Theorem1. Fork>1andNâ„6, theïŹrst threemomentsofWare:
E(W)= k
N , Var(W)= {
Ï(â1)â(k+1)2 }
+2k(Nâ1)
N3
andE[{WâE(W)}3]givenby
{
Ï(â2)â(k+1)3 }
â(3k+25â22N) {
Ï(â1)â(k+1)2 }
+g(k,N)
N5 ,
whereg(k,N)=4(Nâ1)k(k+2Nâ5)>0.
Inparticular, forïŹxedkandN,asÏminâ0
Var(W)ââandÎł(W)â+â,
whereÎł(W) :=E[{WâE(W)}3]/{Var(W)}3/2.
Adetailedproof is foundinAppendixA,andwegivehereanoutlineof its important features.
Themachinerydeveloped iscapableofdeliveringmuchmore thanaproofofTheorem1.As indicated
there, it provides a generic way to explicitly compute arbitrary moments or mixed moments of
multinomialcounts,andcould inprinciplebe implementedbycomputeralgebra.Overall, thereare
fourstages. First, akeyrecurrencerelation isestablished; secondly, it is exploitedtodelivermoments
of a single cell count. Third,mixedmoments of anyorder arederived from those of lower order,
exploitingacertainfunctionaldependence. Finally, resultsarecombinedtoïŹndtheïŹrst threemoments
ofW, highermomentsbeingsimilarlyobtainable.
The practical implication of Theorem 1 is that standard ïŹrst (and higher-order) asymptotic
approximations to thesamplingdistributionof theWald,Ï2, andscorestatisticsbreakdownwhen
thedata generationprocess is âclose toâ the boundary,where at least one cell probability is zero.
This result isqualitativelysimilar toresults in [10],whichshowshowasymptoticapproximations to
thedistributionof themaximumlikelihoodestimate fail; forexample, in thecaseof logistic regression,
whentheboundary isclose in termsofdistancesasdeïŹnedbytheFisher information.
Unlike statistics considered in Theorem 1, the deviance has a workable distribution in the
same limit: that is, for ïŹxed N and k as we approach the boundary of the probability simplex.
Insharpcontrast to that theorem,wesee theverystableandworkablebehaviourof thek-asymptotic
approximation to thedistributionof thedeviance, inwhich thenumberofcells increaseswithout limit.
DeïŹnethedevianceDvia
D/2 = â{0â€iâ€k:ni>0}ni log(ni/N)â k
â
i=0 ni log(Ïi)
= â{0â€iâ€k:ni>0}ni log(ni/ÎŒi),
whereÎŒi :=E(ni)=NÏi. Wewill exploit the characterisation that themultinomial randomvector
(ni)has thesamedistributionasavectorof independentPoissonrandomvariablesconditionedon
their sum. SpeciïŹcally, let theelementsof (nâi )be independentlydistributedasPoissonPo(ÎŒi). Then,
Nâ :=âki=0nâi âŒPo(N),while (ni) :=(nâi |Nâ=N)⌠Multinomial(N,(Ïi)). DeïŹnethevector
Sâ := (
Nâ
Dâ/2 )
= k
â
i=0 (
nâi
nâi log(n â
i/ÎŒi) )
,
327
Differential Geometrical Theory of Statistics
- Titel
- Differential Geometrical Theory of Statistics
- Autoren
- Frédéric Barbaresco
- Frank Nielsen
- Herausgeber
- MDPI
- Ort
- Basel
- Datum
- 2017
- Sprache
- englisch
- Lizenz
- CC BY-NC-ND 4.0
- ISBN
- 978-3-03842-425-3
- Abmessungen
- 17.0 x 24.4 cm
- Seiten
- 476
- Schlagwörter
- Entropy, Coding Theory, Maximum entropy, Information geometry, Computational Information Geometry, Hessian Geometry, Divergence Geometry, Information topology, Cohomology, Shape Space, Statistical physics, Thermodynamics
- Kategorien
- Naturwissenschaften Physik