Estimation of experimental data redundancy and related statistics
I. Grabec
Igor Grabec update to 2007-10-10
https://arxiv.org/abs/0704.0162
Redundancy of experimental data is the basic statistic from which thecomplexity of a natural phenomenon and the proper number of experiments neededfor its exploration can be estimated. The redundancy is expressed by theentropy of information pertaining to the probability density function ofexperimental variables. Since the calculation of entropy is inconvenient due tointegration over a range of variables, an approximate expression for redundancyis derived that includes only a sum over the set of experimental data aboutthese variables. The approximation makes feasible an efficient estimation ofthe redundancy of data along with the related experimental information andinformation cost function. From the experimental information the complexity ofthe phenomenon can be simply estimated, while the proper number of experimentsneeded for its exploration can be determined from the minimum of the costfunction. The performance of the approximate estimation of these statistics isdemonstrated on two-dimensional normally distributed random data.