Bagging and boosting machine learning algorithms for modelling sensory perception from simple chemical variables: Wine mouthfeel as a case study
2025
Sáenz-Navajas, María-Pilar | Ferreira, Chelo | Bastian, Susan E. P. | Jeffery, David W. | Agencia Estatal de Investigación (España) | Ministerio de Ciencia e Innovación (España) | European Commission | Ministerio de Universidades (España) | Instituto de Estudios Riojanos | Sáenz-Navajas, María-Pilar [0000-0001-7225-2272] | Consejo Superior de Investigaciones Científicas [https://ror.org/02gfc7t72]
Aiming to predict sensory properties from chemical data, the application of bagging and boosting machine learning (ML) algorithms was comprehensively investigated and applied to modelling of red wine mouthfeel from simple chemical measurements. A panel of 15 Australian winemakers described the mouthfeel properties of a total of 30 commercial red wines from Australia and Spain using rate-all-that-apply sensory methodology. In parallel, linear sweep voltammetry signals and excitation-emission matrix (EEM) and absorbance data were acquired for the wines. Data were analysed following unsupervised statistical strategies including principal component analysis (PCA with varimax rotation) to simplify the interpretation of sensory variables, along with supervised regression models based on ML, namely random forest (RF) and extreme gradient boosting (XGBoost). PCA results showed that four independent and uncorrelated sensory dimensions mainly related to perceptions of ‘drying’, ‘full body’, ‘velvety’, and ‘gummy’ differentiated among the wines. The RF and XGBoost algorithms yielded superior validated regression models compared to classical PLS modelling. The ML algorithms exhibited strong predictive performance on test data, with an average value exceeding 80 % accuracy for any of the three sets of chemical variables employed. Although XGBoost provided slightly better models, the low computational effort required by RF is advantageous. Important variables included in the models are discussed along with the importance of controlling overfitting. Overall, absorbance, voltammetric or EEM signals coupled with RF or XGBoost algorithms are presented as cheap, easy-to-use, and rapid approaches to predicting sensory properties from chemical signals in complex matrices such as wine.
Mostrar más [+] Menos [-]This research was funded by the I + D + i project from the Spanish Ministry of Science and Innovation, the Spanish Research Agency and FEDER with grant number PID2021-126031OB-C22 (call: Generación de Conocimiento) funded by MCIN/ AEI/10.13039/501100011033/, and by “ERDF A way of making Europe”, the José Castillejo program through the Ministerio de Universidades: Programa Estatal de Promoción del Talento y su Empleabilidad en I + D + i. Subprograma Estatal de Movilidad. del Plan Estatal de Investigación Científica y Técnica y de Innovación 2017–2020 (CAS21/00221), and Instituto de Estudios Riojanos (ayudas a la investigación 2023).
Mostrar más [+] Menos [-]Peer reviewed
Mostrar más [+] Menos [-]Palabras clave de AGROVOC
Información bibliográfica
Este registro bibliográfico ha sido proporcionado por Instituto de Ciencias de la Vid y del Vino