Cargando…

A negative selection heuristic to predict new transcriptional targets

BACKGROUND: Supervised machine learning approaches have been recently adopted in the inference of transcriptional targets from high throughput trascriptomic and proteomic data showing major improvements from with respect to the state of the art of reverse gene regulatory network methods. Beside trad...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cerulo, Luigi, Paduano, Vincenzo, Zoppoli, Pietro, Ceccarelli, Michele
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548675/ https://www.ncbi.nlm.nih.gov/pubmed/23368951 http://dx.doi.org/10.1186/1471-2105-14-S1-S3

_version_	1782256344118591488
author	Cerulo, Luigi Paduano, Vincenzo Zoppoli, Pietro Ceccarelli, Michele
author_facet	Cerulo, Luigi Paduano, Vincenzo Zoppoli, Pietro Ceccarelli, Michele
author_sort	Cerulo, Luigi
collection	PubMed
description	BACKGROUND: Supervised machine learning approaches have been recently adopted in the inference of transcriptional targets from high throughput trascriptomic and proteomic data showing major improvements from with respect to the state of the art of reverse gene regulatory network methods. Beside traditional unsupervised techniques, a supervised classifier learns, from known examples, a function that is able to recognize new relationships for new data. In the context of gene regulatory inference a supervised classifier is coerced to learn from positive and unlabeled examples, as the counter negative examples are unavailable or hard to collect. Such a condition could limit the performance of the classifier especially when the amount of training examples is low. RESULTS: In this paper we improve the supervised identification of transcriptional targets by selecting reliable counter negative examples from the unlabeled set. We introduce an heuristic based on the known topology of transcriptional networks that in fact restores the conventional positive/negative training condition and shows a significant improvement of the classification performance. We empirically evaluate the proposed heuristic with the experimental datasets of Escherichia coli and show an example of application in the prediction of BCL6 direct core targets in normal germinal center human B cells obtaining a precision of 60%. CONCLUSIONS: The availability of only positive examples in learning transcriptional relationships negatively affects the performance of supervised classifiers. We show that the selection of reliable negative examples, a practice adopted in text mining approaches, improves the performance of such classifiers opening new perspectives in the identification of new transcriptional targets.
format	Online Article Text
id	pubmed-3548675
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-35486752013-02-04 A negative selection heuristic to predict new transcriptional targets Cerulo, Luigi Paduano, Vincenzo Zoppoli, Pietro Ceccarelli, Michele BMC Bioinformatics Research BACKGROUND: Supervised machine learning approaches have been recently adopted in the inference of transcriptional targets from high throughput trascriptomic and proteomic data showing major improvements from with respect to the state of the art of reverse gene regulatory network methods. Beside traditional unsupervised techniques, a supervised classifier learns, from known examples, a function that is able to recognize new relationships for new data. In the context of gene regulatory inference a supervised classifier is coerced to learn from positive and unlabeled examples, as the counter negative examples are unavailable or hard to collect. Such a condition could limit the performance of the classifier especially when the amount of training examples is low. RESULTS: In this paper we improve the supervised identification of transcriptional targets by selecting reliable counter negative examples from the unlabeled set. We introduce an heuristic based on the known topology of transcriptional networks that in fact restores the conventional positive/negative training condition and shows a significant improvement of the classification performance. We empirically evaluate the proposed heuristic with the experimental datasets of Escherichia coli and show an example of application in the prediction of BCL6 direct core targets in normal germinal center human B cells obtaining a precision of 60%. CONCLUSIONS: The availability of only positive examples in learning transcriptional relationships negatively affects the performance of supervised classifiers. We show that the selection of reliable negative examples, a practice adopted in text mining approaches, improves the performance of such classifiers opening new perspectives in the identification of new transcriptional targets. BioMed Central 2013-01-14 /pmc/articles/PMC3548675/ /pubmed/23368951 http://dx.doi.org/10.1186/1471-2105-14-S1-S3 Text en Copyright ©2013 Cerulo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Cerulo, Luigi Paduano, Vincenzo Zoppoli, Pietro Ceccarelli, Michele A negative selection heuristic to predict new transcriptional targets
title	A negative selection heuristic to predict new transcriptional targets
title_full	A negative selection heuristic to predict new transcriptional targets
title_fullStr	A negative selection heuristic to predict new transcriptional targets
title_full_unstemmed	A negative selection heuristic to predict new transcriptional targets
title_short	A negative selection heuristic to predict new transcriptional targets
title_sort	negative selection heuristic to predict new transcriptional targets
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3548675/ https://www.ncbi.nlm.nih.gov/pubmed/23368951 http://dx.doi.org/10.1186/1471-2105-14-S1-S3
work_keys_str_mv	AT ceruloluigi anegativeselectionheuristictopredictnewtranscriptionaltargets AT paduanovincenzo anegativeselectionheuristictopredictnewtranscriptionaltargets AT zoppolipietro anegativeselectionheuristictopredictnewtranscriptionaltargets AT ceccarellimichele anegativeselectionheuristictopredictnewtranscriptionaltargets AT ceruloluigi negativeselectionheuristictopredictnewtranscriptionaltargets AT paduanovincenzo negativeselectionheuristictopredictnewtranscriptionaltargets AT zoppolipietro negativeselectionheuristictopredictnewtranscriptionaltargets AT ceccarellimichele negativeselectionheuristictopredictnewtranscriptionaltargets

A negative selection heuristic to predict new transcriptional targets

Ejemplares similares