Cargando…

Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish

Many important model organisms for biomedical and evolutionary research have sequenced genomes, but occupy a phylogenetically isolated position, evolutionarily distant from other sequenced genomes. This phylogenetic isolation is exemplified for zebrafish, a vertebrate model for cis-regulation, devel...

Descripción completa

Detalles Bibliográficos
Autores principales: Hiller, Michael, Agarwal, Saatvik, Notwell, James H., Parikh, Ravi, Guturu, Harendra, Wenger, Aaron M., Bejerano, Gill
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753653/
https://www.ncbi.nlm.nih.gov/pubmed/23814184
http://dx.doi.org/10.1093/nar/gkt557
_version_ 1782281874466406400
author Hiller, Michael
Agarwal, Saatvik
Notwell, James H.
Parikh, Ravi
Guturu, Harendra
Wenger, Aaron M.
Bejerano, Gill
author_facet Hiller, Michael
Agarwal, Saatvik
Notwell, James H.
Parikh, Ravi
Guturu, Harendra
Wenger, Aaron M.
Bejerano, Gill
author_sort Hiller, Michael
collection PubMed
description Many important model organisms for biomedical and evolutionary research have sequenced genomes, but occupy a phylogenetically isolated position, evolutionarily distant from other sequenced genomes. This phylogenetic isolation is exemplified for zebrafish, a vertebrate model for cis-regulation, development and human disease, whose evolutionary distance to all other currently sequenced fish exceeds the distance between human and chicken. Such large distances make it difficult to align genomes and use them for comparative analysis beyond gene-focused questions. In particular, detecting conserved non-genic elements (CNEs) as promising cis-regulatory elements with biological importance is challenging. Here, we develop a general comparative genomics framework to align isolated genomes and to comprehensively detect CNEs. Our approach integrates highly sensitive and quality-controlled local alignments and uses alignment transitivity and ancestral reconstruction to bridge large evolutionary distances. We apply our framework to zebrafish and demonstrate substantially improved CNE detection and quality compared with previous sets. Our zebrafish CNE set comprises 54 533 CNEs, of which 11 792 (22%) are conserved to human or mouse. Our zebrafish CNEs (http://zebrafish.stanford.edu) are highly enriched in known enhancers and extend existing experimental (ChIP-Seq) sets. The same framework can now be applied to the isolated genomes of frog, amphioxus, Caenorhabditis elegans and many others.
format Online
Article
Text
id pubmed-3753653
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-37536532013-08-27 Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish Hiller, Michael Agarwal, Saatvik Notwell, James H. Parikh, Ravi Guturu, Harendra Wenger, Aaron M. Bejerano, Gill Nucleic Acids Res Methods Online Many important model organisms for biomedical and evolutionary research have sequenced genomes, but occupy a phylogenetically isolated position, evolutionarily distant from other sequenced genomes. This phylogenetic isolation is exemplified for zebrafish, a vertebrate model for cis-regulation, development and human disease, whose evolutionary distance to all other currently sequenced fish exceeds the distance between human and chicken. Such large distances make it difficult to align genomes and use them for comparative analysis beyond gene-focused questions. In particular, detecting conserved non-genic elements (CNEs) as promising cis-regulatory elements with biological importance is challenging. Here, we develop a general comparative genomics framework to align isolated genomes and to comprehensively detect CNEs. Our approach integrates highly sensitive and quality-controlled local alignments and uses alignment transitivity and ancestral reconstruction to bridge large evolutionary distances. We apply our framework to zebrafish and demonstrate substantially improved CNE detection and quality compared with previous sets. Our zebrafish CNE set comprises 54 533 CNEs, of which 11 792 (22%) are conserved to human or mouse. Our zebrafish CNEs (http://zebrafish.stanford.edu) are highly enriched in known enhancers and extend existing experimental (ChIP-Seq) sets. The same framework can now be applied to the isolated genomes of frog, amphioxus, Caenorhabditis elegans and many others. Oxford University Press 2013-08 2013-06-27 /pmc/articles/PMC3753653/ /pubmed/23814184 http://dx.doi.org/10.1093/nar/gkt557 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Hiller, Michael
Agarwal, Saatvik
Notwell, James H.
Parikh, Ravi
Guturu, Harendra
Wenger, Aaron M.
Bejerano, Gill
Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish
title Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish
title_full Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish
title_fullStr Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish
title_full_unstemmed Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish
title_short Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish
title_sort computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753653/
https://www.ncbi.nlm.nih.gov/pubmed/23814184
http://dx.doi.org/10.1093/nar/gkt557
work_keys_str_mv AT hillermichael computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish
AT agarwalsaatvik computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish
AT notwelljamesh computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish
AT parikhravi computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish
AT guturuharendra computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish
AT wengeraaronm computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish
AT bejeranogill computationalmethodstodetectconservednongenicelementsinphylogeneticallyisolatedgenomesapplicationtozebrafish