Lignocellulosic biomass pyrolysis for bio-oil production stands as a promising route for renewable energy, yet predicting bio-oil yield remains challenging due to the complex interplay of biomass properties and process conditions. Conventional Machine Learning (ML) approaches, while effective, require labor-intensive manual algorithm selection and hyperparameter tuning, hindering their scalability and reproducibility. To address this point, we present a systematic comparison of four state-of-the-art Automated Machine Learning (AutoML) frameworks—AutoGluon, Auto-Sklearn, FLAML, and TPOT—for automating bio-oil yield prediction. Relying on a dataset of 329 experimental samples from 34 biomass types and seven input features (cellulose, hemicellulose, lignin content, nitrogen flow, heating rate, temperature, and particle size), we demonstrate that FLAML coupled with XGBoost achieves superior predictive performance (, ), thus significantly outperforming both traditional ML models and other AutoML tools. Statistical validation via ANOVA and Tukey’s post-hoc test confirms the robustness of these findings. Our study highlights AutoML’s ability to generate accurate and efficient models for complex pyrolysis systems, substantially reducing reliance on expert knowledge and manual configuration. The developed work establishes AutoML as a scalable solution for optimizing bio-oil production, facilitating more sustainable and data-driven biomass conversion strategies.
Automated machine learning for bio-oil yield prediction from lignocellulosic biomass pyrolysis / D. Campos, R. Ervilha, B. Da Silva Macêdo, M. Bodini, C.M. Saporetti, L. Goliatt. - In: FUEL. - ISSN 0016-2361. - 405:Part A(2026 Feb 01), pp. 136480.1-136480.14. [10.1016/j.fuel.2025.136480]
Automated machine learning for bio-oil yield prediction from lignocellulosic biomass pyrolysis
M. Bodini;
2026
Abstract
Lignocellulosic biomass pyrolysis for bio-oil production stands as a promising route for renewable energy, yet predicting bio-oil yield remains challenging due to the complex interplay of biomass properties and process conditions. Conventional Machine Learning (ML) approaches, while effective, require labor-intensive manual algorithm selection and hyperparameter tuning, hindering their scalability and reproducibility. To address this point, we present a systematic comparison of four state-of-the-art Automated Machine Learning (AutoML) frameworks—AutoGluon, Auto-Sklearn, FLAML, and TPOT—for automating bio-oil yield prediction. Relying on a dataset of 329 experimental samples from 34 biomass types and seven input features (cellulose, hemicellulose, lignin content, nitrogen flow, heating rate, temperature, and particle size), we demonstrate that FLAML coupled with XGBoost achieves superior predictive performance (, ), thus significantly outperforming both traditional ML models and other AutoML tools. Statistical validation via ANOVA and Tukey’s post-hoc test confirms the robustness of these findings. Our study highlights AutoML’s ability to generate accurate and efficient models for complex pyrolysis systems, substantially reducing reliance on expert knowledge and manual configuration. The developed work establishes AutoML as a scalable solution for optimizing bio-oil production, facilitating more sustainable and data-driven biomass conversion strategies.| File | Dimensione | Formato | |
|---|---|---|---|
|
1-s2.0-S0016236125022057-main.pdf
accesso riservato
Descrizione: Versione disponibile online
Tipologia:
Publisher's version/PDF
Licenza:
Nessuna licenza
Dimensione
7.53 MB
Formato
Adobe PDF
|
7.53 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
preprint.pdf
accesso aperto
Descrizione: Preprint
Tipologia:
Pre-print (manoscritto inviato all'editore)
Licenza:
Creative commons
Dimensione
3.51 MB
Formato
Adobe PDF
|
3.51 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




