The Gene Promoter Expression Prediction challenge consisted of predicting gene expression from promoter sequences in a previously unknown experimentally generated data set. The challenge was presented to the community in the framework of the sixth Dialogue for Reverse Engineering Assessments and Methods (DREAM6), a community effort to evaluate the status of systems biology modeling methodologies. Nucleotide-specific promoter activity was obtained by measuring fluorescence from promoter sequences fused upstream of a gene for yellow fluorescence protein and inserted in the same genomic site of yeast Saccharomyces cerevisiae. Twenty-one teams submitted results predicting the expression levels of 53 different promoters from yeast ribosomal protein genes. Analysis of participant predictions shows that accurate values for low-expressed and mutated promoters were difficult to obtain, although in the latter case, only when the mutation induced a large change in promoter activity compared to the wild-type sequence. As in previous DREAM challenges, we found that aggregation of participant predictions provided robust results, but did not fare better than the three best algorithms. Finally, this study not only provides a benchmark for the assessment of methods predicting activity of a specific set of promoters from their sequence, but it also shows that the top performing algorithm, which used machine-learning approaches, can be improved by the addition of biological features such as transcription factor binding sites

Meyer, P.; Siwo, G.; Zeevi, D.; Sharon, E.; Norel, R.; Siwo, G.; Rider, A.K.; Tan, A.; Pinapati, R.S.; Emrich, S.; Chawla, N.; Ferdig, M.T.; Tung, Y.A.; Chen, Y.S.; Chen, M.J.; Chen, C.Y.; Knight, J.M.; Sahraeian, S.M.; Esfahani, M.S.; Dreos, R.; Bucher, P.; Maier, E.; Saeys, Y.; Szczurek, E.; Mysicková, A.; Vingron, M.; Klein, H.; Kielbasa, S.M.; Knisley, J.; Bonnell, J.; Knisley, D.; Kursa, M.B.; Rudnicki, W.R.; Bhattacharjee, M.; Sillanpää, M.J.; Yeung, J.; Meysman, P.; Rodríguez, A.S.; Engelen, K.A.; Marchal, K.; Huang, Y.; Mordelet, F.; Hartemink, A.; Pinello, L.; Yuan, G.C.; Segal, E.; Stolovitzky, G. (2013). Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach. GENOME RESEARCH, 23: 1928-1937. doi: 10.1101/gr.157420.113 handle: http://hdl.handle.net/10449/23202

Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach

Engelen, Kristof Arthur;
2013-01-01

Abstract

The Gene Promoter Expression Prediction challenge consisted of predicting gene expression from promoter sequences in a previously unknown experimentally generated data set. The challenge was presented to the community in the framework of the sixth Dialogue for Reverse Engineering Assessments and Methods (DREAM6), a community effort to evaluate the status of systems biology modeling methodologies. Nucleotide-specific promoter activity was obtained by measuring fluorescence from promoter sequences fused upstream of a gene for yellow fluorescence protein and inserted in the same genomic site of yeast Saccharomyces cerevisiae. Twenty-one teams submitted results predicting the expression levels of 53 different promoters from yeast ribosomal protein genes. Analysis of participant predictions shows that accurate values for low-expressed and mutated promoters were difficult to obtain, although in the latter case, only when the mutation induced a large change in promoter activity compared to the wild-type sequence. As in previous DREAM challenges, we found that aggregation of participant predictions provided robust results, but did not fare better than the three best algorithms. Finally, this study not only provides a benchmark for the assessment of methods predicting activity of a specific set of promoters from their sequence, but it also shows that the top performing algorithm, which used machine-learning approaches, can be improved by the addition of biological features such as transcription factor binding sites
Settore BIO/18 - GENETICA
2013
Meyer, P.; Siwo, G.; Zeevi, D.; Sharon, E.; Norel, R.; Siwo, G.; Rider, A.K.; Tan, A.; Pinapati, R.S.; Emrich, S.; Chawla, N.; Ferdig, M.T.; Tung, Y.A.; Chen, Y.S.; Chen, M.J.; Chen, C.Y.; Knight, J.M.; Sahraeian, S.M.; Esfahani, M.S.; Dreos, R.; Bucher, P.; Maier, E.; Saeys, Y.; Szczurek, E.; Mysicková, A.; Vingron, M.; Klein, H.; Kielbasa, S.M.; Knisley, J.; Bonnell, J.; Knisley, D.; Kursa, M.B.; Rudnicki, W.R.; Bhattacharjee, M.; Sillanpää, M.J.; Yeung, J.; Meysman, P.; Rodríguez, A.S.; Engelen, K.A.; Marchal, K.; Huang, Y.; Mordelet, F.; Hartemink, A.; Pinello, L.; Yuan, G.C.; Segal, E.; Stolovitzky, G. (2013). Inferring gene expression from ribosomal promoter sequences, a crowdsourcing approach. GENOME RESEARCH, 23: 1928-1937. doi: 10.1101/gr.157420.113 handle: http://hdl.handle.net/10449/23202
File in questo prodotto:
File Dimensione Formato  
Genome Res.-2013-Meyer-1928-37.pdf

accesso aperto

Licenza: Creative commons
Dimensione 1.44 MB
Formato Adobe PDF
1.44 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10449/23202
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 6
social impact