DNA methylation is an epigenetic regulator of gene expression and cell identity, which can be shaped by both physiological and pathological factors, including environmental exposure. The identification of sites with high methylation variability can be computationally challenging, especially in large-scale studies. To address this, we propose a framework based on the integrated nested Laplace approximation (INLA) to model methylation with Bayesian generalized linear mixed models (GLMMs), accounting for subject covariates, genomic annotations, and cell composition. To validate the methodology, we sequenced 158 healthy subjects with nanopore and analyzed a panel of 13 genes related to inflammation and stress response. We identified a set of hypervariable CpG sites whose genomic context and methylation levels were consistent with a regulatory role, making them potential candidates for epigenomic association studies. In our comparison, INLA results were concordant with those obtained with MCMC-based methods, with runtimes shorter by orders of magnitude. The computational efficiency of the framework allows for fast exploratory data analysis, model testing, and iterative prototyping, making it viable for large-scale studies that otherwise would be computationally prohibitive.
Application of integrated nested Laplace approximation to identify hot spots of methylation heterogeneity in healthy individuals from the MAMELI cohort / T. Nardi, E. Dariol, R. Matsagani, D. Zojaji, S. Gustincich, L. Pandolfini, E. Biganzoli, V. Bollati. - In: FRONTIERS IN GENETICS. - ISSN 1664-8021. - 17:(2026 Apr 29), pp. 1-10. [10.3389/fgene.2026.1787544]
Application of integrated nested Laplace approximation to identify hot spots of methylation heterogeneity in healthy individuals from the MAMELI cohort
T. Nardi
Primo
;E. DariolSecondo
;R. Matsagani;D. Zojaji;E. BiganzoliPenultimo
;V. BollatiUltimo
2026
Abstract
DNA methylation is an epigenetic regulator of gene expression and cell identity, which can be shaped by both physiological and pathological factors, including environmental exposure. The identification of sites with high methylation variability can be computationally challenging, especially in large-scale studies. To address this, we propose a framework based on the integrated nested Laplace approximation (INLA) to model methylation with Bayesian generalized linear mixed models (GLMMs), accounting for subject covariates, genomic annotations, and cell composition. To validate the methodology, we sequenced 158 healthy subjects with nanopore and analyzed a panel of 13 genes related to inflammation and stress response. We identified a set of hypervariable CpG sites whose genomic context and methylation levels were consistent with a regulatory role, making them potential candidates for epigenomic association studies. In our comparison, INLA results were concordant with those obtained with MCMC-based methods, with runtimes shorter by orders of magnitude. The computational efficiency of the framework allows for fast exploratory data analysis, model testing, and iterative prototyping, making it viable for large-scale studies that otherwise would be computationally prohibitive.| File | Dimensione | Formato | |
|---|---|---|---|
|
fgene-17-1787544.pdf
accesso aperto
Descrizione: Articolo pubblicato
Tipologia:
Publisher's version/PDF
Licenza:
Creative commons
Dimensione
2.9 MB
Formato
Adobe PDF
|
2.9 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




