We introduce a new scalar coefficient to measure linear correlation between random vec- tors which preserves all the relevant properties of Pearson’s correlation in arbitrarily large dimensions. The new measure and its bounds are derived from a mass transportation approach in which the expected inner product of two random vectors is taken as a measure of their covariance and then standardized by the maximal attainable value given their mar- ginal covariance matrices. The new correlation is maximized when the average squared Euclidean distance between the random vectors is minimal and attains value one when, additionally, it is possible to establish an affine relationship between the vectors. In several simulative studies we show the limiting distribution of the empirical estimator of the newly defined index and of the corresponding rank correlation. A comparative study based on financial data shows that our proposed correlation, though derived from a novel approach, behaves similarly to some of the multivariate dependence notions recently introduced in the literature. Throughout the paper, we also give some auxiliary results of independent interest in matrix analysis and mass transportation theory, including an improvement to the Cauchy–Schwarz inequality for positive definite covariance matrices.

Measuring linear correlation between random vectors / G. Puccetti. - In: INFORMATION SCIENCES. - ISSN 0020-0255. - 607:(2022 Aug), pp. 1328-1347. [10.1016/j.ins.2022.06.016]

Measuring linear correlation between random vectors

G. Puccetti
Primo
2022

Abstract

We introduce a new scalar coefficient to measure linear correlation between random vec- tors which preserves all the relevant properties of Pearson’s correlation in arbitrarily large dimensions. The new measure and its bounds are derived from a mass transportation approach in which the expected inner product of two random vectors is taken as a measure of their covariance and then standardized by the maximal attainable value given their mar- ginal covariance matrices. The new correlation is maximized when the average squared Euclidean distance between the random vectors is minimal and attains value one when, additionally, it is possible to establish an affine relationship between the vectors. In several simulative studies we show the limiting distribution of the empirical estimator of the newly defined index and of the corresponding rank correlation. A comparative study based on financial data shows that our proposed correlation, though derived from a novel approach, behaves similarly to some of the multivariate dependence notions recently introduced in the literature. Throughout the paper, we also give some auxiliary results of independent interest in matrix analysis and mass transportation theory, including an improvement to the Cauchy–Schwarz inequality for positive definite covariance matrices.
Multivariate correlation; Cyclical monotonicity; Financial time series; Dissimilarity measure; Multivariate Cauchy–Schwarz inequality
Settore SECS-S/06 - Metodi mat. dell'economia e Scienze Attuariali e Finanziarie
Settore SECS-S/01 - Statistica
Settore MAT/06 - Probabilita' e Statistica Matematica
ago-2022
15-giu-2022
Article (author)
File in questo prodotto:
File Dimensione Formato  
22INFSCPUCCETTI.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 3.36 MB
Formato Adobe PDF
3.36 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
gP22_IS.pdf

accesso aperto

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 2.34 MB
Formato Adobe PDF
2.34 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/933546
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 4
social impact