We consider the problem of approximating a continuous random variable, characterized by a cumulative distribution function (cdf) F(x), by means of k points, x(1) < x(2) < . . . < x(k), with probabilities p(i), i = 1,..., k. For a given k, a criterion for determining the xi and pi of the approximating k-point discrete distribution can be the minimization of some distance to the original distribution. Here we consider the weighted Cramer-von Mises distance between the original cdf F( x) and the step-wise cdf <^> F (x) of the approximating discrete distribution, characterized by a nonnegative weighting function w( x). This problem has been already solved analytically when w(x) corresponds to the probability density function of the continuous random variable, w(x) = <(F)over cap> (x), and when w(x) is a piece-wise constant function, through a numerical iterative procedure based on a homotopy continuation approach. In this paper, we propose and implement a solution to the problem for different choices of the weighting function w(x), highlighting how the results are affected by w(x) itself and by the number of approximating points k, in addition to F(x); although an analytic solution is not usually available, yet the problem can be numerically solved through an iterative method, which alternately updates the two sub-sets of k unknowns, the x(i) 's (or a transformation thereof) and the p(i) 's, till convergence. The main apparent advantage of these discrete approximations is their universality, since they can be applied tomost continuous distributions, whether they possess or not the first moments. In order to shed some light on the proposed approaches, applications to several well-known continuous distributions (among them, the normal and the exponential) and to a practical problem where discretization is a useful tool are also illustrated.

Discrete approximations of continuous probability distributions obtained by minimizing Cramer-von Mises-type distances / A. Barbiero, A. Hitaj. - In: STATISTICAL PAPERS. - ISSN 0932-5026. - (2022), pp. 1-29. [Epub ahead of print] [10.1007/s00362-022-01356-2]

Discrete approximations of continuous probability distributions obtained by minimizing Cramer-von Mises-type distances

A. Barbiero
;
2022

Abstract

We consider the problem of approximating a continuous random variable, characterized by a cumulative distribution function (cdf) F(x), by means of k points, x(1) < x(2) < . . . < x(k), with probabilities p(i), i = 1,..., k. For a given k, a criterion for determining the xi and pi of the approximating k-point discrete distribution can be the minimization of some distance to the original distribution. Here we consider the weighted Cramer-von Mises distance between the original cdf F( x) and the step-wise cdf <^> F (x) of the approximating discrete distribution, characterized by a nonnegative weighting function w( x). This problem has been already solved analytically when w(x) corresponds to the probability density function of the continuous random variable, w(x) = <(F)over cap> (x), and when w(x) is a piece-wise constant function, through a numerical iterative procedure based on a homotopy continuation approach. In this paper, we propose and implement a solution to the problem for different choices of the weighting function w(x), highlighting how the results are affected by w(x) itself and by the number of approximating points k, in addition to F(x); although an analytic solution is not usually available, yet the problem can be numerically solved through an iterative method, which alternately updates the two sub-sets of k unknowns, the x(i) 's (or a transformation thereof) and the p(i) 's, till convergence. The main apparent advantage of these discrete approximations is their universality, since they can be applied tomost continuous distributions, whether they possess or not the first moments. In order to shed some light on the proposed approaches, applications to several well-known continuous distributions (among them, the normal and the exponential) and to a practical problem where discretization is a useful tool are also illustrated.
Cumulative distribution function; Moment matching; Quantile function; Quantization; Statistical distance
Settore SECS-S/01 - Statistica
Settore SECS-S/06 - Metodi mat. dell'economia e Scienze Attuariali e Finanziarie
2022
23-set-2022
Article (author)
File in questo prodotto:
File Dimensione Formato  
s00362-022-01356-2.pdf

accesso aperto

Descrizione: versione pubblicata online
Tipologia: Publisher's version/PDF
Dimensione 460.78 kB
Formato Adobe PDF
460.78 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/943433
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact