We recently introduced the Random Forest - Recursive Feature Elimination (RF-RFE) algorithm for feature selection. In this paper we apply it to the identification of relevant features in the spectra (fingerprints) produced by Proton Transfer Reaction - Mass Spectrometry (PTR-MS) analysis of four agro-industrial products (two datasets with cultivars of Berries and other two with typical cheeses, all from North Italy). The method is compared with the more traditional Support Vector Machine - Recursive Feature Elimination (SVM-RFE), extended to allow multiclass problems. Using replicated experiments we estimate unbiased generalization errors for both methods. We analyze the stability of the two methods and find that RF-RFE is more stable than SVM-RFE in selecting small subsets of features. Our results also show that RF-RFE outperforms SVM-RFE on the task of finding small subsets of features with high discrimination levels on PTR-MS datasets
Granitto, P.M.; Biasioli, F.; Furlanello, C.; Gasperi, F. (2008). Efficient feature selection for PTR-MS fingerprinting of agroindustrial products. In: Artificial Neural Networks - ICANN 2008: 18th International Conference, Prague, Czech Republic, September 3-6, 2008 (editor(s) Kůrková, V., Neruda, R., Koutník, J.). Heidelberg: Springer. (LECTURE NOTES IN COMPUTER SCIENCE): 42-51. ISBN: 9783540875581 doi: 10.1007/978-3-540-87559-8_5. handle: https://hdl.handle.net/10449/89056
Efficient feature selection for PTR-MS fingerprinting of agroindustrial products
Biasioli, F.;Gasperi, F.Ultimo
2008-01-01
Abstract
We recently introduced the Random Forest - Recursive Feature Elimination (RF-RFE) algorithm for feature selection. In this paper we apply it to the identification of relevant features in the spectra (fingerprints) produced by Proton Transfer Reaction - Mass Spectrometry (PTR-MS) analysis of four agro-industrial products (two datasets with cultivars of Berries and other two with typical cheeses, all from North Italy). The method is compared with the more traditional Support Vector Machine - Recursive Feature Elimination (SVM-RFE), extended to allow multiclass problems. Using replicated experiments we estimate unbiased generalization errors for both methods. We analyze the stability of the two methods and find that RF-RFE is more stable than SVM-RFE in selecting small subsets of features. Our results also show that RF-RFE outperforms SVM-RFE on the task of finding small subsets of features with high discrimination levels on PTR-MS datasetsFile | Dimensione | Formato | |
---|---|---|---|
2008 Biasioli.pdf
non disponibili
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.26 MB
Formato
Adobe PDF
|
2.26 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.