VESPUCCI, the Vitis Expression Studies Platform Using COLOMBOS Compendia Instances, is an on-line gene expression compendium that integrates publicly available transcriptomic data for grapevine measured using both microarray and RNA-seq. It was originally developed in 2014 and has since grown from the initial ~1500 samples to the current ~2000 samples. Each sample has been manually annotated using a controlled vocabulary developed ad hoc to ensure both human readability and computational tractability to precisely describe which parameters have changed across different experimental conditions. VESPUCCI was created based on an approach for dealing with the large heterogeneity of data formats present in public databases, and to integrate cross-platform gene expression experiments in one dedicated, coherent database. The expression data in the compendium can be visually explored using several tools provided by the web interface, an exploratory tool meant to assist more dedicated research in grapevine genomics, biology, and physiology. The technology underlying VESPUCCI has recently been completely overhauled in order to simplify the data management. This update has involved both the compendium creation step and the data query step. Concerning the former, the new technology supports different normalization strategies within the same compendium, and thus the legacy normalization data in which samples were treated disregarding the fact that they were measuring the same biological condition will be available together with normalized data in which replicated measurement are taken into consideration. Then, the sample annotation has been revisited to better exploit existing ontologies such as the Plant Ontology, the Plant Trait Ontology, the Plant Experimental Conditions Ontology and the Unit of Measurement Ontology. Finally, the programmatic access to the compendium is going to be easier and more powerful, simplifying the integration of VESPUCCI with other services. All these improvements will be implemented in the new release of the VESPUCCI, which will include all publicly available expression experiment up to 2019, raising the number of collected samples to ~3000.
Moretto, M.; Sonego, P.; Pilati, S.; Malacarne, G.; Costantini, L.; Moser, C.; Engelen, K. (2019). VESPUCCI: the integrated gene expression database for grapevine. In: The First Annual Meeting of COST Action CA17111 "INTEGRAPE 2019 - Data Integration as a key step for future grapevine research", Chania, Crete, Greece, March 25-28, 2019: 13. url: http://www.integrape.eu/images/pdf/CA17111-integrape2019.pdf handle: http://hdl.handle.net/10449/53834
VESPUCCI: the integrated gene expression database for grapevine
Moretto, M.
Primo
;Sonego, P.;Pilati, S.;Malacarne, G.;Costantini, L.;Moser, C.;Engelen, K.Ultimo
2019-01-01
Abstract
VESPUCCI, the Vitis Expression Studies Platform Using COLOMBOS Compendia Instances, is an on-line gene expression compendium that integrates publicly available transcriptomic data for grapevine measured using both microarray and RNA-seq. It was originally developed in 2014 and has since grown from the initial ~1500 samples to the current ~2000 samples. Each sample has been manually annotated using a controlled vocabulary developed ad hoc to ensure both human readability and computational tractability to precisely describe which parameters have changed across different experimental conditions. VESPUCCI was created based on an approach for dealing with the large heterogeneity of data formats present in public databases, and to integrate cross-platform gene expression experiments in one dedicated, coherent database. The expression data in the compendium can be visually explored using several tools provided by the web interface, an exploratory tool meant to assist more dedicated research in grapevine genomics, biology, and physiology. The technology underlying VESPUCCI has recently been completely overhauled in order to simplify the data management. This update has involved both the compendium creation step and the data query step. Concerning the former, the new technology supports different normalization strategies within the same compendium, and thus the legacy normalization data in which samples were treated disregarding the fact that they were measuring the same biological condition will be available together with normalized data in which replicated measurement are taken into consideration. Then, the sample annotation has been revisited to better exploit existing ontologies such as the Plant Ontology, the Plant Trait Ontology, the Plant Experimental Conditions Ontology and the Unit of Measurement Ontology. Finally, the programmatic access to the compendium is going to be easier and more powerful, simplifying the integration of VESPUCCI with other services. All these improvements will be implemented in the new release of the VESPUCCI, which will include all publicly available expression experiment up to 2019, raising the number of collected samples to ~3000.File | Dimensione | Formato | |
---|---|---|---|
2019 Moretto COST.pdf
accesso aperto
Tipologia:
Abstract
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
253.86 kB
Formato
Adobe PDF
|
253.86 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.