Cargando…

DACCOR–Detection, characterization, and reconstruction of repetitive regions in bacterial genomes

The reconstruction of genomes using mapping-based approaches with short reads experiences difficulties when resolving repetitive regions. These repetitive regions in genomes result in low mapping qualities of the respective reads, which in turn lead to many unresolved bases. Currently, the reconstru...

Descripción completa

Detalles Bibliográficos
Autores principales: Seitz, Alexander, Hanssen, Friederike, Nieselt, Kay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5983011/
https://www.ncbi.nlm.nih.gov/pubmed/29868249
http://dx.doi.org/10.7717/peerj.4742
_version_ 1783328357800214528
author Seitz, Alexander
Hanssen, Friederike
Nieselt, Kay
author_facet Seitz, Alexander
Hanssen, Friederike
Nieselt, Kay
author_sort Seitz, Alexander
collection PubMed
description The reconstruction of genomes using mapping-based approaches with short reads experiences difficulties when resolving repetitive regions. These repetitive regions in genomes result in low mapping qualities of the respective reads, which in turn lead to many unresolved bases. Currently, the reconstruction of these regions is often based on modified references in which the repetitive regions are masked. However, for many references, such masked genomes are not available or are based on repetitive regions of other genomes. Our idea is to identify repetitive regions in the reference genome de novo. These regions can then be used to reconstruct them separately using short read sequencing data. Afterward, the reconstructed repetitive sequence can be inserted into the reconstructed genome. We present the program detection, characterization, and reconstruction of repetitive regions, which performs these steps automatically. Our results show an increased base pair resolution of the repetitive regions in the reconstruction of Treponema pallidum samples, resulting in fewer unresolved bases.
format Online
Article
Text
id pubmed-5983011
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-59830112018-06-04 DACCOR–Detection, characterization, and reconstruction of repetitive regions in bacterial genomes Seitz, Alexander Hanssen, Friederike Nieselt, Kay PeerJ Bioinformatics The reconstruction of genomes using mapping-based approaches with short reads experiences difficulties when resolving repetitive regions. These repetitive regions in genomes result in low mapping qualities of the respective reads, which in turn lead to many unresolved bases. Currently, the reconstruction of these regions is often based on modified references in which the repetitive regions are masked. However, for many references, such masked genomes are not available or are based on repetitive regions of other genomes. Our idea is to identify repetitive regions in the reference genome de novo. These regions can then be used to reconstruct them separately using short read sequencing data. Afterward, the reconstructed repetitive sequence can be inserted into the reconstructed genome. We present the program detection, characterization, and reconstruction of repetitive regions, which performs these steps automatically. Our results show an increased base pair resolution of the repetitive regions in the reconstruction of Treponema pallidum samples, resulting in fewer unresolved bases. PeerJ Inc. 2018-05-29 /pmc/articles/PMC5983011/ /pubmed/29868249 http://dx.doi.org/10.7717/peerj.4742 Text en © 2018 Seitz et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Seitz, Alexander
Hanssen, Friederike
Nieselt, Kay
DACCOR–Detection, characterization, and reconstruction of repetitive regions in bacterial genomes
title DACCOR–Detection, characterization, and reconstruction of repetitive regions in bacterial genomes
title_full DACCOR–Detection, characterization, and reconstruction of repetitive regions in bacterial genomes
title_fullStr DACCOR–Detection, characterization, and reconstruction of repetitive regions in bacterial genomes
title_full_unstemmed DACCOR–Detection, characterization, and reconstruction of repetitive regions in bacterial genomes
title_short DACCOR–Detection, characterization, and reconstruction of repetitive regions in bacterial genomes
title_sort daccor–detection, characterization, and reconstruction of repetitive regions in bacterial genomes
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5983011/
https://www.ncbi.nlm.nih.gov/pubmed/29868249
http://dx.doi.org/10.7717/peerj.4742
work_keys_str_mv AT seitzalexander daccordetectioncharacterizationandreconstructionofrepetitiveregionsinbacterialgenomes
AT hanssenfriederike daccordetectioncharacterizationandreconstructionofrepetitiveregionsinbacterialgenomes
AT nieseltkay daccordetectioncharacterizationandreconstructionofrepetitiveregionsinbacterialgenomes