Cargando…
Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish
Many important model organisms for biomedical and evolutionary research have sequenced genomes, but occupy a phylogenetically isolated position, evolutionarily distant from other sequenced genomes. This phylogenetic isolation is exemplified for zebrafish, a vertebrate model for cis-regulation, devel...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753653/ https://www.ncbi.nlm.nih.gov/pubmed/23814184 http://dx.doi.org/10.1093/nar/gkt557 |
_version_ | 1782281874466406400 |
---|---|
author | Hiller, Michael Agarwal, Saatvik Notwell, James H. Parikh, Ravi Guturu, Harendra Wenger, Aaron M. Bejerano, Gill |
author_facet | Hiller, Michael Agarwal, Saatvik Notwell, James H. Parikh, Ravi Guturu, Harendra Wenger, Aaron M. Bejerano, Gill |
author_sort | Hiller, Michael |
collection | PubMed |
description | Many important model organisms for biomedical and evolutionary research have sequenced genomes, but occupy a phylogenetically isolated position, evolutionarily distant from other sequenced genomes. This phylogenetic isolation is exemplified for zebrafish, a vertebrate model for cis-regulation, development and human disease, whose evolutionary distance to all other currently sequenced fish exceeds the distance between human and chicken. Such large distances make it difficult to align genomes and use them for comparative analysis beyond gene-focused questions. In particular, detecting conserved non-genic elements (CNEs) as promising cis-regulatory elements with biological importance is challenging. Here, we develop a general comparative genomics framework to align isolated genomes and to comprehensively detect CNEs. Our approach integrates highly sensitive and quality-controlled local alignments and uses alignment transitivity and ancestral reconstruction to bridge large evolutionary distances. We apply our framework to zebrafish and demonstrate substantially improved CNE detection and quality compared with previous sets. Our zebrafish CNE set comprises 54 533 CNEs, of which 11 792 (22%) are conserved to human or mouse. Our zebrafish CNEs (http://zebrafish.stanford.edu) are highly enriched in known enhancers and extend existing experimental (ChIP-Seq) sets. The same framework can now be applied to the isolated genomes of frog, amphioxus, Caenorhabditis elegans and many others. |
format | Online Article Text |
id | pubmed-3753653 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-37536532013-08-27 Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish Hiller, Michael Agarwal, Saatvik Notwell, James H. Parikh, Ravi Guturu, Harendra Wenger, Aaron M. Bejerano, Gill Nucleic Acids Res Methods Online Many important model organisms for biomedical and evolutionary research have sequenced genomes, but occupy a phylogenetically isolated position, evolutionarily distant from other sequenced genomes. This phylogenetic isolation is exemplified for zebrafish, a vertebrate model for cis-regulation, development and human disease, whose evolutionary distance to all other currently sequenced fish exceeds the distance between human and chicken. Such large distances make it difficult to align genomes and use them for comparative analysis beyond gene-focused questions. In particular, detecting conserved non-genic elements (CNEs) as promising cis-regulatory elements with biological importance is challenging. Here, we develop a general comparative genomics framework to align isolated genomes and to comprehensively detect CNEs. Our approach integrates highly sensitive and quality-controlled local alignments and uses alignment transitivity and ancestral reconstruction to bridge large evolutionary distances. We apply our framework to zebrafish and demonstrate substantially improved CNE detection and quality compared with previous sets. Our zebrafish CNE set comprises 54 533 CNEs, of which 11 792 (22%) are conserved to human or mouse. Our zebrafish CNEs (http://zebrafish.stanford.edu) are highly enriched in known enhancers and extend existing experimental (ChIP-Seq) sets. The same framework can now be applied to the isolated genomes of frog, amphioxus, Caenorhabditis elegans and many others. Oxford University Press 2013-08 2013-06-27 /pmc/articles/PMC3753653/ /pubmed/23814184 http://dx.doi.org/10.1093/nar/gkt557 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Hiller, Michael Agarwal, Saatvik Notwell, James H. Parikh, Ravi Guturu, Harendra Wenger, Aaron M. Bejerano, Gill Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish |
title | Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish |
title_full | Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish |
title_fullStr | Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish |
title_full_unstemmed | Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish |
title_short | Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish |
title_sort | computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753653/ https://www.ncbi.nlm.nih.gov/pubmed/23814184 http://dx.doi.org/10.1093/nar/gkt557 |
work_keys_str_mv | AT hillermichael computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish AT agarwalsaatvik computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish AT notwelljamesh computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish AT parikhravi computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish AT guturuharendra computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish AT wengeraaronm computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish AT bejeranogill computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish |