Cargando…
A survey of localized sequence rearrangements in human DNA
Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-re...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5829575/ https://www.ncbi.nlm.nih.gov/pubmed/29272440 http://dx.doi.org/10.1093/nar/gkx1266 |
_version_ | 1783302832962666496 |
---|---|
author | Frith, Martin C Khan, Sofia |
author_facet | Frith, Martin C Khan, Sofia |
author_sort | Frith, Martin C |
collection | PubMed |
description | Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex ‘local’ mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur. |
format | Online Article Text |
id | pubmed-5829575 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58295752018-03-06 A survey of localized sequence rearrangements in human DNA Frith, Martin C Khan, Sofia Nucleic Acids Res Computational Biology Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex ‘local’ mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur. Oxford University Press 2018-02-28 2017-12-19 /pmc/articles/PMC5829575/ /pubmed/29272440 http://dx.doi.org/10.1093/nar/gkx1266 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Frith, Martin C Khan, Sofia A survey of localized sequence rearrangements in human DNA |
title | A survey of localized sequence rearrangements in human DNA |
title_full | A survey of localized sequence rearrangements in human DNA |
title_fullStr | A survey of localized sequence rearrangements in human DNA |
title_full_unstemmed | A survey of localized sequence rearrangements in human DNA |
title_short | A survey of localized sequence rearrangements in human DNA |
title_sort | survey of localized sequence rearrangements in human dna |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5829575/ https://www.ncbi.nlm.nih.gov/pubmed/29272440 http://dx.doi.org/10.1093/nar/gkx1266 |
work_keys_str_mv | AT frithmartinc asurveyoflocalizedsequencerearrangementsinhumandna AT khansofia asurveyoflocalizedsequencerearrangementsinhumandna AT frithmartinc surveyoflocalizedsequencerearrangementsinhumandna AT khansofia surveyoflocalizedsequencerearrangementsinhumandna |