Page - 442 - in Differential Geometrical Theory of Statistics

Image of the Page - 442 -

Text of the Page - 442 -

Entropy2016,18, 110 In thismodel,weonlyconsiderbinaryparameterswithvalues±1 (here identiﬁedwith letters0 or1 inF2)andweignoreparameters inaneutralizedstate following implicationsacrossparameters, as in thedatasetsof [3,4]. Theentailmentofparameters, that is, thephenomenonbywhichaparticular valueofoneparameter (butnot thecomplementaryvalue) rendersanotherparameter irrelevant,was addressed ingreaterdetail in [22].Weﬁrstdiscussaversionofourcodingtheorymodel thatdoesnot incorporateentailment.Wewill thencomment inSection2.7belowonhowthemodelcanbemodiﬁed to incorporate thisphenomenon. The idea thatnatural languagescanbedescribed,at the levelof their coregrammatical structures, in termsofastringofbinarycharacters (codewords)wasalreadyusedextensively in [23]. 2.1. CodeParameters In the theory of error-correcting codes, one assigns two main parameters to a code C, the transmission rate and the relative minimum distance. More precisely, a binary code C ⊂ Fn2 is an [n,k,d]2-code if thenumberofcodewords is#C=2k, that is, k= log2#C, (1) wherekneednotbean integer,andtheminimalHammingdistancebetweencodewords is d= min L1 =L2∈C dH(L1,L2), (2) where theHammingdistance isgivenby dH(L1,L2)= n ∑ i=1 |xi−yi|, forL1=(xi)ni=1 andL2=(yi) n i=1 inC. The transmissionrateof thecodeC isgivenby R= k n . (3) OnedenotesbyδH(L1,L2) therelativeHammingdistance δH(L1,L2)= 1 n n ∑ i=1 |xi−yi|, andonedeﬁnes therelativeminimumdistanceof thecodeCas δ= d n = min L1 =L2∈C δH(L1,L2). (4) Incodingtheory,onewouldlike toconstructcodes thatsimultaneouslyoptimizebothparameters (δ,R): a largervalueofR representsa faster transmissionrate (betterencoding),anda largervalueof δ represents the fact thatcodewordsaresufﬁcientlysparse in theambientspaceFn2 (betterdecoding, withbettererror-correctingcapability). Constraintsonthisoptimizationproblemareexpressed in the formofbounds in thespaceof (δ,R)parameters, see [12,13]. Inoursetting, theRparametermeasures theratiobetweenthe logarithmicsizeof thenumberof languagesencompassingthegivenfamilyandthetotalnumberofparameters,orequivalentlyhow densely thegiven languagefamily is in theambientconﬁgurationspaceFn2 ofparameterpossibilities. The parameter δ is theminimum, over all pairs of languages in the given family, of the relative Hammingdistanceusedin thePCMmethodof [3,4]. 442

back to the book Differential Geometrical Theory of Statistics"

Differential Geometrical Theory of Statistics

Title: Differential Geometrical Theory of Statistics
Authors: Frédéric Barbaresco; Frank Nielsen
Editor: MDPI
Location: Basel
Date: 2017
Language: English
License: CC BY-NC-ND 4.0
ISBN: 978-3-03842-425-3
Size: 17.0 x 24.4 cm
Pages: 476
Keywords: Entropy, Coding Theory, Maximum entropy, Information geometry, Computational Information Geometry, Hessian Geometry, Divergence Geometry, Information topology, Cohomology, Shape Space, Statistical physics, Thermodynamics
Categories: Naturwissenschaften Physik