The main purpose of the paper is to improve research on school effectiveness by applying a new strategy for uncovering subpopulations of schools that differ in terms of distributionof student outcomes. We propose a semiparametric mixed effects model with an expectation–maximization algorithm to estimate its parameters and we apply it to the Italian Institute for theEducational Evaluation of Instruction and Training data of 2013–2014 as a tool for the identification of latent subpopulations of schools. The semiparametric assumption provides the random effects of the mixed effects model to be distributed according to a discrete distribution with an(a priori) unknown number of support points. This modelling induces an automatic clustering of schools (the higher level of hierarchy), where schools within the same cluster share the same random effects. The latent subpopulations of schools identified may then be exploited through the use of multinomial models that include school level features. The novelties introduced by this paper are twofold: first, the semiparametric expectation–maximization algorithm is an innovative method that could be used in many classification problems; second, its application to education data represents a new approach to study school effectiveness.
Semiparametric mixed-effects models for unsupervised classification of Italian schools / C. Masci, A. Paganoni, F. Ieva. - In: JOURNAL OF THE ROYAL STATISTICAL SOCIETY. SERIES A. STATISTICS IN SOCIETY. - ISSN 0964-1998. - 182:4(2019 Oct), pp. 1313-1342. [10.1111/rssa.12449]
Semiparametric mixed-effects models for unsupervised classification of Italian schools
C. Masci
Primo
;
2019
Abstract
The main purpose of the paper is to improve research on school effectiveness by applying a new strategy for uncovering subpopulations of schools that differ in terms of distributionof student outcomes. We propose a semiparametric mixed effects model with an expectation–maximization algorithm to estimate its parameters and we apply it to the Italian Institute for theEducational Evaluation of Instruction and Training data of 2013–2014 as a tool for the identification of latent subpopulations of schools. The semiparametric assumption provides the random effects of the mixed effects model to be distributed according to a discrete distribution with an(a priori) unknown number of support points. This modelling induces an automatic clustering of schools (the higher level of hierarchy), where schools within the same cluster share the same random effects. The latent subpopulations of schools identified may then be exploited through the use of multinomial models that include school level features. The novelties introduced by this paper are twofold: first, the semiparametric expectation–maximization algorithm is an innovative method that could be used in many classification problems; second, its application to education data represents a new approach to study school effectiveness.| File | Dimensione | Formato | |
|---|---|---|---|
|
Masci_et_al-2019-Journal_of_the_Royal_Statistical_Society__Series_A_(Statistics_in_Society).pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
871.35 kB
Formato
Adobe PDF
|
871.35 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
jrsssa_182_4_1313.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
1.21 MB
Formato
Adobe PDF
|
1.21 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




