IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with k$$ k $$ categories, which are assigned the first k$$ k $$ natural values 1,2,…,k$$ 1,2,\dots, k $$ , and arbitrary probabilities pi$$ {p}_i $$ . For different parametric distributions, we derive a closed-form formula for the maximal point-polyserial correlation as a function of the pi$$ {p}_i $$ and of the distribution's parameters; we devise an algorithm for obtaining its maximum value numerically for any given k$$ k $$ . These maximum values and the features of the corresponding k$$ k $$ -point discrete random variables are discussed with respect to the underlying continuous distribution. Furthermore, we prove that if we do not assign the values of the ordinal random variable a priori but instead include them in the optimization problem, this latter approach is equivalent to the optimal quantization problem. In some circumstances, it leads to a significant increase in the maximum value of the point-polyserial correlation. An application to real data exemplifies the main findings. A comparison between the discretization leading to the maximum point-polyserial correlation and those obtained from optimal quantization and moment matching is sketched.

Maximal point‐polyserial correlation for non‐normal random distributions / A. Barbiero. - In: BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY. - ISSN 0007-1102. - 78:1(2025 Feb), pp. 341-377. [10.1111/bmsp.12362]

Maximal point‐polyserial correlation for non‐normal random distributions

A. Barbiero^Primo

2025

Abstract

We consider the problem of determining the maximum value of the point-polyserial correlation between a random variable with an assigned continuous distribution and an ordinal random variable with k$$ k $$ categories, which are assigned the first k$$ k $$ natural values 1,2,…,k$$ 1,2,\dots, k $$ , and arbitrary probabilities pi$$ {p}_i $$ . For different parametric distributions, we derive a closed-form formula for the maximal point-polyserial correlation as a function of the pi$$ {p}_i $$ and of the distribution's parameters; we devise an algorithm for obtaining its maximum value numerically for any given k$$ k $$ . These maximum values and the features of the corresponding k$$ k $$ -point discrete random variables are discussed with respect to the underlying continuous distribution. Furthermore, we prove that if we do not assign the values of the ordinal random variable a priori but instead include them in the optimization problem, this latter approach is equivalent to the optimal quantization problem. In some circumstances, it leads to a significant increase in the maximum value of the point-polyserial correlation. An application to real data exemplifies the main findings. A comparison between the discretization leading to the maximum point-polyserial correlation and those obtained from optimal quantization and moment matching is sketched.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				attainable correlations; biserial correlation; discretization; latent variable; non‐normal distribution
			
	Settori scientifico-disciplinari dell'articolo (validi dal 09/05/2024)
	
				Settore STAT-01/A - Statistica
			
	Data di pubblicazione
	
				feb-2025
			
	Data ahead of print o data di stampa
	
				22-ott-2024
			
	Rivista in ANCE
	
				BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY
			
	DOI
	
				https://dx.doi.org/10.1111/bmsp.12362
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
Brit J Math Statis - 2024 - Barbiero - Maximal point‐polyserial correlation for non‐normal random distributions.pdf accesso aperto Descrizione: versione pubblicata, online first Tipologia: Publisher's version/PDF Licenza: Creative commons Dimensione 2.54 MB Formato Adobe PDF Visualizza/Apri	2.54 MB	Adobe PDF	Visualizza/Apri
BMSP.pdf accesso aperto Descrizione: prima versione dell'articolo inviata alla rivista, liberamente scaricabile Tipologia: Pre-print (manoscritto inviato all'editore) Dimensione 243.02 kB Formato Adobe PDF Visualizza/Apri	243.02 kB	Adobe PDF	Visualizza/Apri
Brit J Math Statis - 2024 - Barbiero - Maximal point‐polyserial correlation for non‐normal random distributions.pdf accesso aperto Tipologia: Publisher's version/PDF Licenza: Creative commons Dimensione 2.46 MB Formato Adobe PDF Visualizza/Apri	2.46 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1119335

Citazioni

1

1

1

ND

social impact