Cargando…
Evidence classification of high-throughput protocols and confidence integration in RegulonDB
RegulonDB provides curated information on the transcriptional regulatory network of Escherichia coli and contains both experimental data and computationally predicted objects. To account for the heterogeneity of these data, we introduced in version 6.0, a two-tier rating system for the strength of e...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548332/ https://www.ncbi.nlm.nih.gov/pubmed/23327937 http://dx.doi.org/10.1093/database/bas059 |
_version_ | 1782256308088471552 |
---|---|
author | Weiss, Verena Medina-Rivera, Alejandra Huerta, Araceli M. Santos-Zavaleta, Alberto Salgado, Heladia Morett, Enrique Collado-Vides, Julio |
author_facet | Weiss, Verena Medina-Rivera, Alejandra Huerta, Araceli M. Santos-Zavaleta, Alberto Salgado, Heladia Morett, Enrique Collado-Vides, Julio |
author_sort | Weiss, Verena |
collection | PubMed |
description | RegulonDB provides curated information on the transcriptional regulatory network of Escherichia coli and contains both experimental data and computationally predicted objects. To account for the heterogeneity of these data, we introduced in version 6.0, a two-tier rating system for the strength of evidence, classifying evidence as either ‘weak’ or ‘strong’ (Gama-Castro,S., Jimenez-Jacinto,V., Peralta-Gil,M. et al. RegulonDB (Version 6.0): gene regulation model of Escherichia Coli K-12 beyond transcription, active (experimental) annotated promoters and textpresso navigation. Nucleic Acids Res., 2008;36:D120–D124.). We now add to our classification scheme the classification of high-throughput evidence, including chromatin immunoprecipitation (ChIP) and RNA-seq technologies. To integrate these data into RegulonDB, we present two strategies for the evaluation of confidence, statistical validation and independent cross-validation. Statistical validation involves verification of ChIP data for transcription factor-binding sites, using tools for motif discovery and quality assessment of the discovered matrices. Independent cross-validation combines independent evidence with the intention to mutually exclude false positives. Both statistical validation and cross-validation allow to upgrade subsets of data that are supported by weak evidence to a higher confidence level. Likewise, cross-validation of strong confidence data extends our two-tier rating system to a three-tier system by introducing a third confidence score ‘confirmed’. Database URL: http://regulondb.ccg.unam.mx/ |
format | Online Article Text |
id | pubmed-3548332 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35483322013-01-18 Evidence classification of high-throughput protocols and confidence integration in RegulonDB Weiss, Verena Medina-Rivera, Alejandra Huerta, Araceli M. Santos-Zavaleta, Alberto Salgado, Heladia Morett, Enrique Collado-Vides, Julio Database (Oxford) Original Article RegulonDB provides curated information on the transcriptional regulatory network of Escherichia coli and contains both experimental data and computationally predicted objects. To account for the heterogeneity of these data, we introduced in version 6.0, a two-tier rating system for the strength of evidence, classifying evidence as either ‘weak’ or ‘strong’ (Gama-Castro,S., Jimenez-Jacinto,V., Peralta-Gil,M. et al. RegulonDB (Version 6.0): gene regulation model of Escherichia Coli K-12 beyond transcription, active (experimental) annotated promoters and textpresso navigation. Nucleic Acids Res., 2008;36:D120–D124.). We now add to our classification scheme the classification of high-throughput evidence, including chromatin immunoprecipitation (ChIP) and RNA-seq technologies. To integrate these data into RegulonDB, we present two strategies for the evaluation of confidence, statistical validation and independent cross-validation. Statistical validation involves verification of ChIP data for transcription factor-binding sites, using tools for motif discovery and quality assessment of the discovered matrices. Independent cross-validation combines independent evidence with the intention to mutually exclude false positives. Both statistical validation and cross-validation allow to upgrade subsets of data that are supported by weak evidence to a higher confidence level. Likewise, cross-validation of strong confidence data extends our two-tier rating system to a three-tier system by introducing a third confidence score ‘confirmed’. Database URL: http://regulondb.ccg.unam.mx/ Oxford University Press 2013-01-17 /pmc/articles/PMC3548332/ /pubmed/23327937 http://dx.doi.org/10.1093/database/bas059 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Weiss, Verena Medina-Rivera, Alejandra Huerta, Araceli M. Santos-Zavaleta, Alberto Salgado, Heladia Morett, Enrique Collado-Vides, Julio Evidence classification of high-throughput protocols and confidence integration in RegulonDB |
title | Evidence classification of high-throughput protocols and confidence integration in RegulonDB |
title_full | Evidence classification of high-throughput protocols and confidence integration in RegulonDB |
title_fullStr | Evidence classification of high-throughput protocols and confidence integration in RegulonDB |
title_full_unstemmed | Evidence classification of high-throughput protocols and confidence integration in RegulonDB |
title_short | Evidence classification of high-throughput protocols and confidence integration in RegulonDB |
title_sort | evidence classification of high-throughput protocols and confidence integration in regulondb |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548332/ https://www.ncbi.nlm.nih.gov/pubmed/23327937 http://dx.doi.org/10.1093/database/bas059 |
work_keys_str_mv | AT weissverena evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb AT medinariveraalejandra evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb AT huertaaracelim evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb AT santoszavaletaalberto evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb AT salgadoheladia evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb AT morettenrique evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb AT colladovidesjulio evidenceclassificationofhighthroughputprotocolsandconfidenceintegrationinregulondb |