Cargando…

A pipeline for complete characterization of complex germline rearrangements from long DNA reads

BACKGROUND: Many genetic/genomic disorders are caused by genomic rearrangements. Standard methods can often characterize these variations only partly, e.g., copy number changes or breakpoints. It is important to fully understand the order and orientation of rearranged fragments, with precise breakpo...

Descripción completa

Detalles Bibliográficos
Autores principales: Mitsuhashi, Satomi, Ohori, Sachiko, Katoh, Kazutaka, Frith, Martin C., Matsumoto, Naomichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7393826/
https://www.ncbi.nlm.nih.gov/pubmed/32731881
http://dx.doi.org/10.1186/s13073-020-00762-1
_version_ 1783565111197171712
author Mitsuhashi, Satomi
Ohori, Sachiko
Katoh, Kazutaka
Frith, Martin C.
Matsumoto, Naomichi
author_facet Mitsuhashi, Satomi
Ohori, Sachiko
Katoh, Kazutaka
Frith, Martin C.
Matsumoto, Naomichi
author_sort Mitsuhashi, Satomi
collection PubMed
description BACKGROUND: Many genetic/genomic disorders are caused by genomic rearrangements. Standard methods can often characterize these variations only partly, e.g., copy number changes or breakpoints. It is important to fully understand the order and orientation of rearranged fragments, with precise breakpoints, to know the pathogenicity of the rearrangements. METHODS: We performed whole-genome-coverage nanopore sequencing of long DNA reads from four patients with chromosomal translocations. We identified rearrangements relative to a reference human genome, subtracted rearrangements shared by any of 33 control individuals, and determined the order and orientation of rearranged fragments, with our newly developed analysis pipeline. RESULTS: We describe the full characterization of complex chromosomal rearrangements, by filtering out genomic rearrangements seen in controls without the same disease, reducing the number of loci per patient from a few thousand to a few dozen. Breakpoint detection was very accurate; we usually see ~ 0 ± 1 base difference from Sanger sequencing-confirmed breakpoints. For one patient with two reciprocal chromosomal translocations, we find that the translocation points have complex rearrangements of multiple DNA fragments involving 5 chromosomes, which we could order and orient by an automatic algorithm, thereby fully reconstructing the rearrangement. A rearrangement is more than the sum of its parts: some properties, such as sequence loss, can be inferred only after reconstructing the whole rearrangement. In this patient, the rearrangements were evidently caused by shattering of the chromosomes into multiple fragments, which rejoined in a different order and orientation with loss of some fragments. CONCLUSIONS: We developed an effective analytic pipeline to find chromosomal aberration in congenital diseases by filtering benign changes, only from long read sequencing. Our algorithm for reconstruction of complex rearrangements is useful to interpret rearrangements with many breakpoints, e.g., chromothripsis. Our approach promises to fully characterize many congenital germline rearrangements, provided they do not involve poorly understood loci such as centromeric repeats.
format Online
Article
Text
id pubmed-7393826
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73938262020-08-04 A pipeline for complete characterization of complex germline rearrangements from long DNA reads Mitsuhashi, Satomi Ohori, Sachiko Katoh, Kazutaka Frith, Martin C. Matsumoto, Naomichi Genome Med Research BACKGROUND: Many genetic/genomic disorders are caused by genomic rearrangements. Standard methods can often characterize these variations only partly, e.g., copy number changes or breakpoints. It is important to fully understand the order and orientation of rearranged fragments, with precise breakpoints, to know the pathogenicity of the rearrangements. METHODS: We performed whole-genome-coverage nanopore sequencing of long DNA reads from four patients with chromosomal translocations. We identified rearrangements relative to a reference human genome, subtracted rearrangements shared by any of 33 control individuals, and determined the order and orientation of rearranged fragments, with our newly developed analysis pipeline. RESULTS: We describe the full characterization of complex chromosomal rearrangements, by filtering out genomic rearrangements seen in controls without the same disease, reducing the number of loci per patient from a few thousand to a few dozen. Breakpoint detection was very accurate; we usually see ~ 0 ± 1 base difference from Sanger sequencing-confirmed breakpoints. For one patient with two reciprocal chromosomal translocations, we find that the translocation points have complex rearrangements of multiple DNA fragments involving 5 chromosomes, which we could order and orient by an automatic algorithm, thereby fully reconstructing the rearrangement. A rearrangement is more than the sum of its parts: some properties, such as sequence loss, can be inferred only after reconstructing the whole rearrangement. In this patient, the rearrangements were evidently caused by shattering of the chromosomes into multiple fragments, which rejoined in a different order and orientation with loss of some fragments. CONCLUSIONS: We developed an effective analytic pipeline to find chromosomal aberration in congenital diseases by filtering benign changes, only from long read sequencing. Our algorithm for reconstruction of complex rearrangements is useful to interpret rearrangements with many breakpoints, e.g., chromothripsis. Our approach promises to fully characterize many congenital germline rearrangements, provided they do not involve poorly understood loci such as centromeric repeats. BioMed Central 2020-07-31 /pmc/articles/PMC7393826/ /pubmed/32731881 http://dx.doi.org/10.1186/s13073-020-00762-1 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Mitsuhashi, Satomi
Ohori, Sachiko
Katoh, Kazutaka
Frith, Martin C.
Matsumoto, Naomichi
A pipeline for complete characterization of complex germline rearrangements from long DNA reads
title A pipeline for complete characterization of complex germline rearrangements from long DNA reads
title_full A pipeline for complete characterization of complex germline rearrangements from long DNA reads
title_fullStr A pipeline for complete characterization of complex germline rearrangements from long DNA reads
title_full_unstemmed A pipeline for complete characterization of complex germline rearrangements from long DNA reads
title_short A pipeline for complete characterization of complex germline rearrangements from long DNA reads
title_sort pipeline for complete characterization of complex germline rearrangements from long dna reads
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7393826/
https://www.ncbi.nlm.nih.gov/pubmed/32731881
http://dx.doi.org/10.1186/s13073-020-00762-1
work_keys_str_mv AT mitsuhashisatomi apipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT ohorisachiko apipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT katohkazutaka apipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT frithmartinc apipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT matsumotonaomichi apipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT mitsuhashisatomi pipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT ohorisachiko pipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT katohkazutaka pipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT frithmartinc pipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads
AT matsumotonaomichi pipelineforcompletecharacterizationofcomplexgermlinerearrangementsfromlongdnareads