Cargando…

Evidence classification of high-throughput protocols and confidence integration in RegulonDB

RegulonDB provides curated information on the transcriptional regulatory network of Escherichia coli and contains both experimental data and computationally predicted objects. To account for the heterogeneity of these data, we introduced in version 6.0, a two-tier rating system for the strength of e...

Descripción completa

Detalles Bibliográficos
Autores principales: Weiss, Verena, Medina-Rivera, Alejandra, Huerta, Araceli M., Santos-Zavaleta, Alberto, Salgado, Heladia, Morett, Enrique, Collado-Vides, Julio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548332/
https://www.ncbi.nlm.nih.gov/pubmed/23327937
http://dx.doi.org/10.1093/database/bas059
_version_ 1782256308088471552
author Weiss, Verena
Medina-Rivera, Alejandra
Huerta, Araceli M.
Santos-Zavaleta, Alberto
Salgado, Heladia
Morett, Enrique
Collado-Vides, Julio
author_facet Weiss, Verena
Medina-Rivera, Alejandra
Huerta, Araceli M.
Santos-Zavaleta, Alberto
Salgado, Heladia
Morett, Enrique
Collado-Vides, Julio
author_sort Weiss, Verena
collection PubMed
description RegulonDB provides curated information on the transcriptional regulatory network of Escherichia coli and contains both experimental data and computationally predicted objects. To account for the heterogeneity of these data, we introduced in version 6.0, a two-tier rating system for the strength of evidence, classifying evidence as either ‘weak’ or ‘strong’ (Gama-Castro,S., Jimenez-Jacinto,V., Peralta-Gil,M. et al. RegulonDB (Version 6.0): gene regulation model of Escherichia Coli K-12 beyond transcription, active (experimental) annotated promoters and textpresso navigation. Nucleic Acids Res., 2008;36:D120–D124.). We now add to our classification scheme the classification of high-throughput evidence, including chromatin immunoprecipitation (ChIP) and RNA-seq technologies. To integrate these data into RegulonDB, we present two strategies for the evaluation of confidence, statistical validation and independent cross-validation. Statistical validation involves verification of ChIP data for transcription factor-binding sites, using tools for motif discovery and quality assessment of the discovered matrices. Independent cross-validation combines independent evidence with the intention to mutually exclude false positives. Both statistical validation and cross-validation allow to upgrade subsets of data that are supported by weak evidence to a higher confidence level. Likewise, cross-validation of strong confidence data extends our two-tier rating system to a three-tier system by introducing a third confidence score ‘confirmed’. Database URL: http://regulondb.ccg.unam.mx/
format Online
Article
Text
id pubmed-3548332
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35483322013-01-18 Evidence classification of high-throughput protocols and confidence integration in RegulonDB Weiss, Verena Medina-Rivera, Alejandra Huerta, Araceli M. Santos-Zavaleta, Alberto Salgado, Heladia Morett, Enrique Collado-Vides, Julio Database (Oxford) Original Article RegulonDB provides curated information on the transcriptional regulatory network of Escherichia coli and contains both experimental data and computationally predicted objects. To account for the heterogeneity of these data, we introduced in version 6.0, a two-tier rating system for the strength of evidence, classifying evidence as either ‘weak’ or ‘strong’ (Gama-Castro,S., Jimenez-Jacinto,V., Peralta-Gil,M. et al. RegulonDB (Version 6.0): gene regulation model of Escherichia Coli K-12 beyond transcription, active (experimental) annotated promoters and textpresso navigation. Nucleic Acids Res., 2008;36:D120–D124.). We now add to our classification scheme the classification of high-throughput evidence, including chromatin immunoprecipitation (ChIP) and RNA-seq technologies. To integrate these data into RegulonDB, we present two strategies for the evaluation of confidence, statistical validation and independent cross-validation. Statistical validation involves verification of ChIP data for transcription factor-binding sites, using tools for motif discovery and quality assessment of the discovered matrices. Independent cross-validation combines independent evidence with the intention to mutually exclude false positives. Both statistical validation and cross-validation allow to upgrade subsets of data that are supported by weak evidence to a higher confidence level. Likewise, cross-validation of strong confidence data extends our two-tier rating system to a three-tier system by introducing a third confidence score ‘confirmed’. Database URL: http://regulondb.ccg.unam.mx/ Oxford University Press 2013-01-17 /pmc/articles/PMC3548332/ /pubmed/23327937 http://dx.doi.org/10.1093/database/bas059 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Weiss, Verena
Medina-Rivera, Alejandra
Huerta, Araceli M.
Santos-Zavaleta, Alberto
Salgado, Heladia
Morett, Enrique
Collado-Vides, Julio
Evidence classification of high-throughput protocols and confidence integration in RegulonDB
title Evidence classification of high-throughput protocols and confidence integration in RegulonDB
title_full Evidence classification of high-throughput protocols and confidence integration in RegulonDB
title_fullStr Evidence classification of high-throughput protocols and confidence integration in RegulonDB
title_full_unstemmed Evidence classification of high-throughput protocols and confidence integration in RegulonDB
title_short Evidence classification of high-throughput protocols and confidence integration in RegulonDB
title_sort evidence classification of high-throughput protocols and confidence integration in regulondb
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548332/
https://www.ncbi.nlm.nih.gov/pubmed/23327937
http://dx.doi.org/10.1093/database/bas059
work_keys_str_mv AT weissverena evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb
AT medinariveraalejandra evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb
AT huertaaracelim evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb
AT santoszavaletaalberto evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb
AT salgadoheladia evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb
AT morettenrique evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb
AT colladovidesjulio evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb