We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with k$$ k $$ categories, which are assigned the first k$$ k $$ natural values 1,2,…,k$$ 1,2,\dots, k $$ , and arbitrary probabilities pi$$ {p}_i $$ . For different parametric distributions, we derive a closed-form formula for the maximal point-polyserial correlation as a function of the pi$$ {p}_i $$ and of the distribution's parameters; we devise an algorithm for obtaining its maximum value numerically for any given k$$ k $$ . These maximum values and the features of the corresponding k$$ k $$ -point discrete random variables are discussed with respect to the underlying continuous distribution. Furthermore, we prove that if we do not assign the values of the ordinal random variable a priori but instead include them in the optimization problem, this latter approach is equivalent to the optimal quantization problem. In some circumstances, it leads to a significant increase in the maximum value of the point-polyserial correlation. An application to real data exemplifies the main findings. A comparison between the discretization leading to the maximum point-polyserial correlation and those obtained from optimal quantization and moment matching is sketched.
Maximal point‐polyserial correlation for non‐normal random distributions / A. Barbiero. - In: BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY. - ISSN 0007-1102. - (2024), pp. 1-37. [Epub ahead of print] [10.1111/bmsp.12362]
Maximal point‐polyserial correlation for non‐normal random distributions
A. Barbiero
Primo
2024
Abstract
We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with k$$ k $$ categories, which are assigned the first k$$ k $$ natural values 1,2,…,k$$ 1,2,\dots, k $$ , and arbitrary probabilities pi$$ {p}_i $$ . For different parametric distributions, we derive a closed-form formula for the maximal point-polyserial correlation as a function of the pi$$ {p}_i $$ and of the distribution's parameters; we devise an algorithm for obtaining its maximum value numerically for any given k$$ k $$ . These maximum values and the features of the corresponding k$$ k $$ -point discrete random variables are discussed with respect to the underlying continuous distribution. Furthermore, we prove that if we do not assign the values of the ordinal random variable a priori but instead include them in the optimization problem, this latter approach is equivalent to the optimal quantization problem. In some circumstances, it leads to a significant increase in the maximum value of the point-polyserial correlation. An application to real data exemplifies the main findings. A comparison between the discretization leading to the maximum point-polyserial correlation and those obtained from optimal quantization and moment matching is sketched.File | Dimensione | Formato | |
---|---|---|---|
Brit J Math Statis - 2024 - Barbiero - Maximal point‐polyserial correlation for non‐normal random distributions.pdf
accesso aperto
Descrizione: versione pubblicata
Tipologia:
Publisher's version/PDF
Dimensione
2.54 MB
Formato
Adobe PDF
|
2.54 MB | Adobe PDF | Visualizza/Apri |
BMSP.pdf
accesso aperto
Descrizione: prima versione dell'articolo inviata alla rivista, liberamente scaricabile
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
243.02 kB
Formato
Adobe PDF
|
243.02 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.