Cargando…

Unrealistic phylogenetic trees may improve phylogenetic footprinting

MOTIVATION: The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approa...

Descripción completa

Detalles Bibliográficos
Autores principales: Nettling, Martin, Treutler, Hendrik, Cerquides, Jesus, Grosse, Ivo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5447242/
https://www.ncbi.nlm.nih.gov/pubmed/28130227
http://dx.doi.org/10.1093/bioinformatics/btx033
_version_ 1783239290212319232
author Nettling, Martin
Treutler, Hendrik
Cerquides, Jesus
Grosse, Ivo
author_facet Nettling, Martin
Treutler, Hendrik
Cerquides, Jesus
Grosse, Ivo
author_sort Nettling, Martin
collection PubMed
description MOTIVATION: The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. RESULTS: Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. AVAILABILITY AND IMPLEMENTATION: The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5447242
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54472422017-05-31 Unrealistic phylogenetic trees may improve phylogenetic footprinting Nettling, Martin Treutler, Hendrik Cerquides, Jesus Grosse, Ivo Bioinformatics Original Papers MOTIVATION: The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. RESULTS: Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. AVAILABILITY AND IMPLEMENTATION: The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-06-01 2017-01-27 /pmc/articles/PMC5447242/ /pubmed/28130227 http://dx.doi.org/10.1093/bioinformatics/btx033 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Nettling, Martin
Treutler, Hendrik
Cerquides, Jesus
Grosse, Ivo
Unrealistic phylogenetic trees may improve phylogenetic footprinting
title Unrealistic phylogenetic trees may improve phylogenetic footprinting
title_full Unrealistic phylogenetic trees may improve phylogenetic footprinting
title_fullStr Unrealistic phylogenetic trees may improve phylogenetic footprinting
title_full_unstemmed Unrealistic phylogenetic trees may improve phylogenetic footprinting
title_short Unrealistic phylogenetic trees may improve phylogenetic footprinting
title_sort unrealistic phylogenetic trees may improve phylogenetic footprinting
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5447242/
https://www.ncbi.nlm.nih.gov/pubmed/28130227
http://dx.doi.org/10.1093/bioinformatics/btx033
work_keys_str_mv AT nettlingmartin unrealisticphylogenetictreesmayimprovephylogeneticfootprinting
AT treutlerhendrik unrealisticphylogenetictreesmayimprovephylogeneticfootprinting
AT cerquidesjesus unrealisticphylogenetictreesmayimprovephylogeneticfootprinting
AT grosseivo unrealisticphylogenetictreesmayimprovephylogeneticfootprinting