Rapid and non-invasive analysis of food products is essential in the agrifood sector for ensuring quality, safety and authenticity. In this context, Volatile Organic Compound (VOC) analysis plays a key role, and direct injection mass spectrometry, Proton Transfer Reaction Mass Spectrometry (PTR-ToF-MS) in particular, offers an optimal tool due to its speed and high sensitivity. The resulting datasets from these analyses are typically modeled using classification, regression, and peak selection methods. In these tasks, gradient boosting methods, and XGBoost in particular, have demonstrated outstanding performance, often surpassing classical machine learning techniques and deep learning approaches. In this work, we investigate the applicability of XGBoost to PTR-ToF-MS datasets of food VOCs in detail. We show that XGBoost requires careful (and time-consuming) optimization to achieve competitive results in this specific domain. Our results indicate that the performance of XGBoost on food products is better in classification than in other analysis tasks, and is comparable on regression and peak selection to that of other state-of-the-art methods, when all methods are appropriately tuned. Given the inherent difficulty of modeling small and noisy real world datasets, our work highlights the importance of carefully evaluating methods within each specific domain, rather than extrapolating their performance as a given.
Granitto, P.M.; Mazzucotelli, M.; Pedrotti, M.; Khomenko, I.; Biasioli, F. (2026). Gradient boosting applied to PTR-ToF-MS analysis of agrifood samples. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 273: 105702. doi: 10.1016/j.chemolab.2026.105702 handle: https://hdl.handle.net/10449/95575
Gradient boosting applied to PTR-ToF-MS analysis of agrifood samples
Granitto, P. M.
Primo
;Mazzucotelli, M.;Pedrotti, M.;Khomenko, I.;Biasioli, F.Ultimo
2026-01-01
Abstract
Rapid and non-invasive analysis of food products is essential in the agrifood sector for ensuring quality, safety and authenticity. In this context, Volatile Organic Compound (VOC) analysis plays a key role, and direct injection mass spectrometry, Proton Transfer Reaction Mass Spectrometry (PTR-ToF-MS) in particular, offers an optimal tool due to its speed and high sensitivity. The resulting datasets from these analyses are typically modeled using classification, regression, and peak selection methods. In these tasks, gradient boosting methods, and XGBoost in particular, have demonstrated outstanding performance, often surpassing classical machine learning techniques and deep learning approaches. In this work, we investigate the applicability of XGBoost to PTR-ToF-MS datasets of food VOCs in detail. We show that XGBoost requires careful (and time-consuming) optimization to achieve competitive results in this specific domain. Our results indicate that the performance of XGBoost on food products is better in classification than in other analysis tasks, and is comparable on regression and peak selection to that of other state-of-the-art methods, when all methods are appropriately tuned. Given the inherent difficulty of modeling small and noisy real world datasets, our work highlights the importance of carefully evaluating methods within each specific domain, rather than extrapolating their performance as a given.| File | Dimensione | Formato | |
|---|---|---|---|
|
2026 CILS Mazzucotelli.pdf
accesso aperto
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Creative commons
Dimensione
1.3 MB
Formato
Adobe PDF
|
1.3 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



