Cargando…

A survey of localized sequence rearrangements in human DNA

Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-re...

Descripción completa

Detalles Bibliográficos
Autores principales: Frith, Martin C, Khan, Sofia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5829575/
https://www.ncbi.nlm.nih.gov/pubmed/29272440
http://dx.doi.org/10.1093/nar/gkx1266
_version_ 1783302832962666496
author Frith, Martin C
Khan, Sofia
author_facet Frith, Martin C
Khan, Sofia
author_sort Frith, Martin C
collection PubMed
description Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex ‘local’ mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur.
format Online
Article
Text
id pubmed-5829575
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58295752018-03-06 A survey of localized sequence rearrangements in human DNA Frith, Martin C Khan, Sofia Nucleic Acids Res Computational Biology Genomes mutate and evolve in ways simple (substitution or deletion of bases) and complex (e.g. chromosome shattering). We do not fully understand what types of complex mutation occur, and we cannot routinely characterize arbitrarily-complex mutations in a high-throughput, genome-wide manner. Long-read DNA sequencing methods (e.g. PacBio, nanopore) are promising for this task, because one read may encompass a whole complex mutation. We describe an analysis pipeline to characterize arbitrarily-complex ‘local’ mutations, i.e. intrachromosomal mutations encompassed by one DNA read. We apply it to nanopore and PacBio reads from one human cell line (NA12878), and survey sequence rearrangements, both real and artifactual. Almost all the real rearrangements belong to recurring patterns or motifs: the most common is tandem multiplication (e.g. heptuplication), but there are also complex patterns such as localized shattering, which resembles DNA damage by radiation. Gene conversions are identified, including one between hemoglobin gamma genes. This study demonstrates a way to find intricate rearrangements with any number of duplications, deletions, and repositionings. It demonstrates a probability-based method to resolve ambiguous rearrangements involving highly similar sequences, as occurs in gene conversion. We present a catalog of local rearrangements in one human cell line, and show which rearrangement patterns occur. Oxford University Press 2018-02-28 2017-12-19 /pmc/articles/PMC5829575/ /pubmed/29272440 http://dx.doi.org/10.1093/nar/gkx1266 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Frith, Martin C
Khan, Sofia
A survey of localized sequence rearrangements in human DNA
title A survey of localized sequence rearrangements in human DNA
title_full A survey of localized sequence rearrangements in human DNA
title_fullStr A survey of localized sequence rearrangements in human DNA
title_full_unstemmed A survey of localized sequence rearrangements in human DNA
title_short A survey of localized sequence rearrangements in human DNA
title_sort survey of localized sequence rearrangements in human dna
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5829575/
https://www.ncbi.nlm.nih.gov/pubmed/29272440
http://dx.doi.org/10.1093/nar/gkx1266
work_keys_str_mv AT frithmartinc asurveyoflocalizedsequencerearrangementsinhumandna
AT khansofia asurveyoflocalizedsequencerearrangementsinhumandna
AT frithmartinc surveyoflocalizedsequencerearrangementsinhumandna
AT khansofia surveyoflocalizedsequencerearrangementsinhumandna