We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with k$$ k $$ categories, which are assigned the first k$$ k $$ natural values 1,2,…,k$$ 1,2,\dots, k $$ , and arbitrary probabilities pi$$ {p}_i $$ . For different parametric distributions, we derive a closed-form formula for the maximal point-polyserial correlation as a function of the pi$$ {p}_i $$ and of the distribution's parameters; we devise an algorithm for obtaining its maximum value numerically for any given k$$ k $$ . These maximum values and the features of the corresponding k$$ k $$ -point discrete random variables are discussed with respect to the underlying continuous distribution. Furthermore, we prove that if we do not assign the values of the ordinal random variable a priori but instead include them in the optimization problem, this latter approach is equivalent to the optimal quantization problem. In some circumstances, it leads to a significant increase in the maximum value of the point-polyserial correlation. An application to real data exemplifies the main findings. A comparison between the discretization leading to the maximum point-polyserial correlation and those obtained from optimal quantization and moment matching is sketched.
Maximal point‐polyserial correlation for non‐normal random distributions / A. Barbiero. - In: BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY. - ISSN 0007-1102. - 78:1(2025 Feb), pp. 341-377. [10.1111/bmsp.12362]
Maximal point‐polyserial correlation for non‐normal random distributions
A. Barbiero
Primo
2025
Abstract
We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with k$$ k $$ categories, which are assigned the first k$$ k $$ natural values 1,2,…,k$$ 1,2,\dots, k $$ , and arbitrary probabilities pi$$ {p}_i $$ . For different parametric distributions, we derive a closed-form formula for the maximal point-polyserial correlation as a function of the pi$$ {p}_i $$ and of the distribution's parameters; we devise an algorithm for obtaining its maximum value numerically for any given k$$ k $$ . These maximum values and the features of the corresponding k$$ k $$ -point discrete random variables are discussed with respect to the underlying continuous distribution. Furthermore, we prove that if we do not assign the values of the ordinal random variable a priori but instead include them in the optimization problem, this latter approach is equivalent to the optimal quantization problem. In some circumstances, it leads to a significant increase in the maximum value of the point-polyserial correlation. An application to real data exemplifies the main findings. A comparison between the discretization leading to the maximum point-polyserial correlation and those obtained from optimal quantization and moment matching is sketched.| File | Dimensione | Formato | |
|---|---|---|---|
|
Brit J Math Statis - 2024 - Barbiero - Maximal point‐polyserial correlation for non‐normal random distributions.pdf
accesso aperto
Descrizione: versione pubblicata, online first
Tipologia:
Publisher's version/PDF
Licenza:
Creative commons
Dimensione
2.54 MB
Formato
Adobe PDF
|
2.54 MB | Adobe PDF | Visualizza/Apri |
|
BMSP.pdf
accesso aperto
Descrizione: prima versione dell'articolo inviata alla rivista, liberamente scaricabile
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
243.02 kB
Formato
Adobe PDF
|
243.02 kB | Adobe PDF | Visualizza/Apri |
|
Brit J Math Statis - 2024 - Barbiero - Maximal point‐polyserial correlation for non‐normal random distributions.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Licenza:
Creative commons
Dimensione
2.46 MB
Formato
Adobe PDF
|
2.46 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




