We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with k$$ k $$ categories, which are assigned the first k$$ k $$ natural values 1,2,…,k$$ 1,2,\dots, k $$ , and arbitrary probabilities pi$$ {p}_i $$ . For different parametric distributions, we derive a closed-form formula for the maximal point-polyserial correlation as a function of the pi$$ {p}_i $$ and of the distribution's parameters; we devise an algorithm for obtaining its maximum value numerically for any given k$$ k $$ . These maximum values and the features of the corresponding k$$ k $$ -point discrete random variables are discussed with respect to the underlying continuous distribution. Furthermore, we prove that if we do not assign the values of the ordinal random variable a priori but instead include them in the optimization problem, this latter approach is equivalent to the optimal quantization problem. In some circumstances, it leads to a significant increase in the maximum value of the point-polyserial correlation. An application to real data exemplifies the main findings. A comparison between the discretization leading to the maximum point-polyserial correlation and those obtained from optimal quantization and moment matching is sketched.

Maximal point‐polyserial correlation for non‐normal random distributions / A. Barbiero. - In: BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY. - ISSN 0007-1102. - (2024), pp. 1-37. [Epub ahead of print] [10.1111/bmsp.12362]

Maximal point‐polyserial correlation for non‐normal random distributions

A. Barbiero
Primo
2024

Abstract

We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with k$$ k $$ categories, which are assigned the first k$$ k $$ natural values 1,2,…,k$$ 1,2,\dots, k $$ , and arbitrary probabilities pi$$ {p}_i $$ . For different parametric distributions, we derive a closed-form formula for the maximal point-polyserial correlation as a function of the pi$$ {p}_i $$ and of the distribution's parameters; we devise an algorithm for obtaining its maximum value numerically for any given k$$ k $$ . These maximum values and the features of the corresponding k$$ k $$ -point discrete random variables are discussed with respect to the underlying continuous distribution. Furthermore, we prove that if we do not assign the values of the ordinal random variable a priori but instead include them in the optimization problem, this latter approach is equivalent to the optimal quantization problem. In some circumstances, it leads to a significant increase in the maximum value of the point-polyserial correlation. An application to real data exemplifies the main findings. A comparison between the discretization leading to the maximum point-polyserial correlation and those obtained from optimal quantization and moment matching is sketched.
attainable correlations; biserial correlation; discretization; latent variable; non‐normal distribution
Settore STAT-01/A - Statistica
2024
22-ott-2024
Article (author)
File in questo prodotto:
File Dimensione Formato  
Brit J Math Statis - 2024 - Barbiero - Maximal point‐polyserial correlation for non‐normal random distributions.pdf

accesso aperto

Descrizione: versione pubblicata
Tipologia: Publisher's version/PDF
Dimensione 2.54 MB
Formato Adobe PDF
2.54 MB Adobe PDF Visualizza/Apri
BMSP.pdf

accesso aperto

Descrizione: prima versione dell'articolo inviata alla rivista, liberamente scaricabile
Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 243.02 kB
Formato Adobe PDF
243.02 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1119335
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact