Lignocellulosic biomass pyrolysis for bio-oil production stands as a promising route for renewable energy, yet predicting bio-oil yield remains challenging due to the complex interplay of biomass properties and process conditions. Conventional Machine Learning (ML) approaches, while effective, require labor-intensive manual algorithm selection and hyperparameter tuning, hindering their scalability and reproducibility. To address this point, we present a systematic comparison of four state-of-the-art Automated Machine Learning (AutoML) frameworks—AutoGluon, Auto-Sklearn, FLAML, and TPOT—for automating bio-oil yield prediction. Relying on a dataset of 329 experimental samples from 34 biomass types and seven input features (cellulose, hemicellulose, lignin content, nitrogen flow, heating rate, temperature, and particle size), we demonstrate that FLAML coupled with XGBoost achieves superior predictive performance (, ), thus significantly outperforming both traditional ML models and other AutoML tools. Statistical validation via ANOVA and Tukey’s post-hoc test confirms the robustness of these findings. Our study highlights AutoML’s ability to generate accurate and efficient models for complex pyrolysis systems, substantially reducing reliance on expert knowledge and manual configuration. The developed work establishes AutoML as a scalable solution for optimizing bio-oil production, facilitating more sustainable and data-driven biomass conversion strategies.

Automated machine learning for bio-oil yield prediction from lignocellulosic biomass pyrolysis / D. Campos, R. Ervilha, B. Da Silva Macêdo, M. Bodini, C.M. Saporetti, L. Goliatt. - In: FUEL. - ISSN 0016-2361. - 405:Part A(2026 Feb 01), pp. 136480.1-136480.14. [10.1016/j.fuel.2025.136480]

Automated machine learning for bio-oil yield prediction from lignocellulosic biomass pyrolysis

M. Bodini;
2026

Abstract

Lignocellulosic biomass pyrolysis for bio-oil production stands as a promising route for renewable energy, yet predicting bio-oil yield remains challenging due to the complex interplay of biomass properties and process conditions. Conventional Machine Learning (ML) approaches, while effective, require labor-intensive manual algorithm selection and hyperparameter tuning, hindering their scalability and reproducibility. To address this point, we present a systematic comparison of four state-of-the-art Automated Machine Learning (AutoML) frameworks—AutoGluon, Auto-Sklearn, FLAML, and TPOT—for automating bio-oil yield prediction. Relying on a dataset of 329 experimental samples from 34 biomass types and seven input features (cellulose, hemicellulose, lignin content, nitrogen flow, heating rate, temperature, and particle size), we demonstrate that FLAML coupled with XGBoost achieves superior predictive performance (, ), thus significantly outperforming both traditional ML models and other AutoML tools. Statistical validation via ANOVA and Tukey’s post-hoc test confirms the robustness of these findings. Our study highlights AutoML’s ability to generate accurate and efficient models for complex pyrolysis systems, substantially reducing reliance on expert knowledge and manual configuration. The developed work establishes AutoML as a scalable solution for optimizing bio-oil production, facilitating more sustainable and data-driven biomass conversion strategies.
Lignocellulosic biomass pyrolysis; Automated machine learning (AutoML); Bio-oil yield prediction; Predictive modeling; Process optimization;
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
Settore INFO-01/A - Informatica
Settore CHEM-01/A - Chimica analitica
1-feb-2026
13-ago-2025
Article (author)
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0016236125022057-main.pdf

accesso riservato

Descrizione: Versione disponibile online
Tipologia: Publisher's version/PDF
Licenza: Nessuna licenza
Dimensione 7.53 MB
Formato Adobe PDF
7.53 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
preprint.pdf

accesso aperto

Descrizione: Preprint
Tipologia: Pre-print (manoscritto inviato all'editore)
Licenza: Creative commons
Dimensione 3.51 MB
Formato Adobe PDF
3.51 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1179575
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact