Data from modern -omics technologies provide an holistic representation of the state of biological systems. However, considering the inherent complexity of the datasets, it is necessary to develop filters able to highlight only relevant information to be used for the biological interpretation. Biomarker identification represents a paradigmatic example of the situation faced in data filtering. Biomarkers are variables (metabolites, proteins, genes, ...) which can be used to characterize specific subgroups in the data; in a two-class setting, for example, the biomarkers are those variables that allow discrimination between the classes. A class tag can be used to distinguish many situations: it can be used to discriminate treated vs. non-treated, to mark different varieties of the same organism, etcetera. Most methods for biomarker identification formulate the problem in a classification setting, selecting as important the variables which give good predictive performance. In all the biomarker selection strategies, it is necessary to define a cutoff value which identifies the subset of the features to be considered biomarkers, as with the rather arbitrary alpha level in statistical testing. The choice of a reasonable and robust cutoff has important practical implications for the development of a biological model. Biomarkers identification, indeed, can be a long and expensive process so it is of paramount importance to be able of focus only on reliable biomarkers. In the majority of cases the optimal cutoff point depends on the specific dataset under study, so there is a definitive interest in developing and comparing general strategies able to identify good cutoff points. In this contribution, several strategies to face the problem will proposed and evaluated using spiked -omics datasets. Among them, particular focus will be on Higher Criticism (Donoho; 2008) and on the recently proposed Stability Based Biomarkers Selection (Wehrens; 2011).

Franceschi, P.; Wehrens, H.R.M.J. (2011). Assessing cutoff points for biomarker selection in -omics technologies. In: ICRM 2011: Fifth International Chemometrics Research Meeting: september 25-29, 2011, Berg en Dal The Netherlands: 19. url: http://www.icrm2011.org/program/program.html handle: http://hdl.handle.net/10449/20438

Assessing cutoff points for biomarker selection in -omics technologies

Franceschi, Pietro;Wehrens, Herman Ronald Maria Johan
2011-01-01

Abstract

Data from modern -omics technologies provide an holistic representation of the state of biological systems. However, considering the inherent complexity of the datasets, it is necessary to develop filters able to highlight only relevant information to be used for the biological interpretation. Biomarker identification represents a paradigmatic example of the situation faced in data filtering. Biomarkers are variables (metabolites, proteins, genes, ...) which can be used to characterize specific subgroups in the data; in a two-class setting, for example, the biomarkers are those variables that allow discrimination between the classes. A class tag can be used to distinguish many situations: it can be used to discriminate treated vs. non-treated, to mark different varieties of the same organism, etcetera. Most methods for biomarker identification formulate the problem in a classification setting, selecting as important the variables which give good predictive performance. In all the biomarker selection strategies, it is necessary to define a cutoff value which identifies the subset of the features to be considered biomarkers, as with the rather arbitrary alpha level in statistical testing. The choice of a reasonable and robust cutoff has important practical implications for the development of a biological model. Biomarkers identification, indeed, can be a long and expensive process so it is of paramount importance to be able of focus only on reliable biomarkers. In the majority of cases the optimal cutoff point depends on the specific dataset under study, so there is a definitive interest in developing and comparing general strategies able to identify good cutoff points. In this contribution, several strategies to face the problem will proposed and evaluated using spiked -omics datasets. Among them, particular focus will be on Higher Criticism (Donoho; 2008) and on the recently proposed Stability Based Biomarkers Selection (Wehrens; 2011).
Biomarker Selection
-omics technologies
Selezione di Biomarkers
Tecnologie -omiche
2011
Franceschi, P.; Wehrens, H.R.M.J. (2011). Assessing cutoff points for biomarker selection in -omics technologies. In: ICRM 2011: Fifth International Chemometrics Research Meeting: september 25-29, 2011, Berg en Dal The Netherlands: 19. url: http://www.icrm2011.org/program/program.html handle: http://hdl.handle.net/10449/20438
File in questo prodotto:
File Dimensione Formato  
2011 Oral_Franceschi.pdf

accesso aperto

Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 120.3 kB
Formato Adobe PDF
120.3 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10449/20438
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact