Cargando…

Telescoper: de novo assembly of highly repetitive regions

Motivation: With advances in sequencing technology, it has become faster and cheaper to obtain short-read data from which to assemble genomes. Although there has been considerable progress in the field of genome assembly, producing high-quality de novo assemblies from short-reads remains challenging...

Descripción completa

Detalles Bibliográficos
Autores principales: Bresler, Ma'ayan, Sheehan, Sara, Chan, Andrew H., Song, Yun S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3436826/
https://www.ncbi.nlm.nih.gov/pubmed/22962446
http://dx.doi.org/10.1093/bioinformatics/bts399
_version_ 1782242706649513984
author Bresler, Ma'ayan
Sheehan, Sara
Chan, Andrew H.
Song, Yun S.
author_facet Bresler, Ma'ayan
Sheehan, Sara
Chan, Andrew H.
Song, Yun S.
author_sort Bresler, Ma'ayan
collection PubMed
description Motivation: With advances in sequencing technology, it has become faster and cheaper to obtain short-read data from which to assemble genomes. Although there has been considerable progress in the field of genome assembly, producing high-quality de novo assemblies from short-reads remains challenging, primarily because of the complex repeat structures found in the genomes of most higher organisms. The telomeric regions of many genomes are particularly difficult to assemble, though much could be gained from the study of these regions, as their evolution has not been fully characterized and they have been linked to aging. Results: In this article, we tackle the problem of assembling highly repetitive regions by developing a novel algorithm that iteratively extends long paths through a series of read-overlap graphs and evaluates them based on a statistical framework. Our algorithm, Telescoper, uses short- and long-insert libraries in an integrated way throughout the assembly process. Results on real and simulated data demonstrate that our approach can effectively resolve much of the complex repeat structures found in the telomeres of yeast genomes, especially when longer long-insert libraries are used. Availability: Telescoper is publicly available for download at sourceforge.net/p/telescoper. Contact: yss@eecs.berkeley.edu Supplementary Information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3436826
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34368262012-12-12 Telescoper: de novo assembly of highly repetitive regions Bresler, Ma'ayan Sheehan, Sara Chan, Andrew H. Song, Yun S. Bioinformatics Original Papers Motivation: With advances in sequencing technology, it has become faster and cheaper to obtain short-read data from which to assemble genomes. Although there has been considerable progress in the field of genome assembly, producing high-quality de novo assemblies from short-reads remains challenging, primarily because of the complex repeat structures found in the genomes of most higher organisms. The telomeric regions of many genomes are particularly difficult to assemble, though much could be gained from the study of these regions, as their evolution has not been fully characterized and they have been linked to aging. Results: In this article, we tackle the problem of assembling highly repetitive regions by developing a novel algorithm that iteratively extends long paths through a series of read-overlap graphs and evaluates them based on a statistical framework. Our algorithm, Telescoper, uses short- and long-insert libraries in an integrated way throughout the assembly process. Results on real and simulated data demonstrate that our approach can effectively resolve much of the complex repeat structures found in the telomeres of yeast genomes, especially when longer long-insert libraries are used. Availability: Telescoper is publicly available for download at sourceforge.net/p/telescoper. Contact: yss@eecs.berkeley.edu Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-09-15 2012-09-03 /pmc/articles/PMC3436826/ /pubmed/22962446 http://dx.doi.org/10.1093/bioinformatics/bts399 Text en © The Author(s) (2012). Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Bresler, Ma'ayan
Sheehan, Sara
Chan, Andrew H.
Song, Yun S.
Telescoper: de novo assembly of highly repetitive regions
title Telescoper: de novo assembly of highly repetitive regions
title_full Telescoper: de novo assembly of highly repetitive regions
title_fullStr Telescoper: de novo assembly of highly repetitive regions
title_full_unstemmed Telescoper: de novo assembly of highly repetitive regions
title_short Telescoper: de novo assembly of highly repetitive regions
title_sort telescoper: de novo assembly of highly repetitive regions
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3436826/
https://www.ncbi.nlm.nih.gov/pubmed/22962446
http://dx.doi.org/10.1093/bioinformatics/bts399
work_keys_str_mv AT breslermaayan telescoperdenovoassemblyofhighlyrepetitiveregions
AT sheehansara telescoperdenovoassemblyofhighlyrepetitiveregions
AT chanandrewh telescoperdenovoassemblyofhighlyrepetitiveregions
AT songyuns telescoperdenovoassemblyofhighlyrepetitiveregions