Biomarker identification is of fundamental importance in many fields of biology and medicine. Modern analytical techniques provide a wealth of information, often leading to a holistic view of the state of biological systems. Out of this huge mass of data those variables that are indicative of class differences, the biomarkers, need to be identified in order to obtain either better predictive models, to obtain a better understanding of the complexity of living systems and to prioritize experimental follow-up work. Many statistical approaches focus on creating predictive models, and inspecting the model coefficients to see which variables could be termed putative biomarkers. In practice, however, the applicability of this approach is hindered by the unfavourable sample-to-variable ratio, making it virtually impossible to both build and validate predictive models. We will discuss two principled approaches to obtain good biomarker estimates anyway, not relying on accurate error estimations, and show using experimental spike-in data that also unpredictive models may be used to good effect in the quest for biomarkers.
|Citation:||Wehrens, R. (2011). Statistical biomarker identification in the omics sciences. In: BIT's 2nd Annual Congress of Biomarkers (WCB-2011): making direct bioanalysis nimble: November 7-9, 2011, Beijing, China. url: http://www.bitlifesciences.com/wcb2011/ handle: http://hdl.handle.net/10449/20873|
|Organization unit:||Computational Biology # CRI_2011-JAN2016|
|Title:||Statistical biomarker identification in the omics sciences|
|Keywords ENG:||Data analysis|
|Appears in Collections:||03 - Conference object|