Learning Bayesian networks with heterogeneous agronomic data sets via mixed-effect models and hierarchical clustering

Valleggi, L.; Scutari, M.; Stefanini, F.M.

doi:10.1016/j.engappai.2024.107867

Maize, a crucial crop globally cultivated across vast regions, especially in sub-Saharan Africa, Asia, and Latin America, occupies 197 million hectares as of 2021. Various statistical and machine learning models, including mixed-effect models, random coefficients models, random forests, and deep learning architectures, have been devised to predict maize yield. These models consider factors such as genotype, environment, genotype-environment interaction, and field management. However, the existing models often fall short of fully exploiting the complex network of causal relationships among these factors and the hierarchical structure inherent in agronomic data. This study introduces an innovative approach integrating random effects into Bayesian networks (BNs), leveraging their capacity to model causal and probabilistic relationships through directed acyclic graphs. Rooted in the linear mixed-effects models framework and tailored for hierarchical data, this novel approach demonstrates enhanced BN learning. Application to a real-world agronomic trial produces a model with improved interpretability, unveiling new causal connections. Notably, the proposed method significantly reduces the error rate in maize yield prediction from 28% to 17%. These results advocate for the preference of BNs in constructing practical decision support tools for hierarchical agronomic data, facilitating causal inference.

Learning Bayesian networks with heterogeneous agronomic data sets via mixed-effect models and hierarchical clustering / L. Valleggi, M. Scutari, F.M. Stefanini. - In: ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE. - ISSN 0952-1976. - 131:(2024 Jan), pp. 107867.1-107867.12. [10.1016/j.engappai.2024.107867]

Learning Bayesian networks with heterogeneous agronomic data sets via mixed-effect models and hierarchical clustering

Valleggi, Lorenzo;Scutari, Marco;F.M. Stefanini^Ultimo

2024

Abstract

Maize, a crucial crop globally cultivated across vast regions, especially in sub-Saharan Africa, Asia, and Latin America, occupies 197 million hectares as of 2021. Various statistical and machine learning models, including mixed-effect models, random coefficients models, random forests, and deep learning architectures, have been devised to predict maize yield. These models consider factors such as genotype, environment, genotype-environment interaction, and field management. However, the existing models often fall short of fully exploiting the complex network of causal relationships among these factors and the hierarchical structure inherent in agronomic data. This study introduces an innovative approach integrating random effects into Bayesian networks (BNs), leveraging their capacity to model causal and probabilistic relationships through directed acyclic graphs. Rooted in the linear mixed-effects models framework and tailored for hierarchical data, this novel approach demonstrates enhanced BN learning. Application to a real-world agronomic trial produces a model with improved interpretability, unveiling new causal connections. Notably, the proposed method significantly reduces the error rate in maize yield prediction from 28% to 17%. These results advocate for the preference of BNs in constructing practical decision support tools for hierarchical agronomic data, facilitating causal inference.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Hierarchical data sets; Bayesian networks; Causal networks; Structure learning; Prediction of maize yield;
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore SECS-S/01 - Statistica
			
	Data di pubblicazione
	
				gen-2024
			
	Rivista in ANCE
	
				ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE
			
	DOI
	
				https://dx.doi.org/10.1016/j.engappai.2024.107867
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0952197624000253-main.pdf accesso aperto Descrizione: Article Tipologia: Publisher's version/PDF Dimensione 955.18 kB Formato Adobe PDF Visualizza/Apri	955.18 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1024851

Citazioni

ND

7

5

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca