Cargando…

Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale

BACKGROUND: Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Siyang, Huang, Shujia, Rao, Junhua, Ye, Weijian, Krogh, Anders, Wang, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4690232/
https://www.ncbi.nlm.nih.gov/pubmed/26705468
http://dx.doi.org/10.1186/s13742-015-0103-4
_version_ 1782406973835182080
author Liu, Siyang
Huang, Shujia
Rao, Junhua
Ye, Weijian
Krogh, Anders
Wang, Jun
author_facet Liu, Siyang
Huang, Shujia
Rao, Junhua
Ye, Weijian
Krogh, Anders
Wang, Jun
author_sort Liu, Siyang
collection PubMed
description BACKGROUND: Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions (indels) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. FINDINGS: We present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. CONCLUSIONS: Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction of population-scale pan-genomes. Our study also highlights the usefulness of the de novo assembly strategy for definition of genome structure. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13742-015-0103-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4690232
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46902322015-12-25 Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale Liu, Siyang Huang, Shujia Rao, Junhua Ye, Weijian Krogh, Anders Wang, Jun Gigascience Technical Note BACKGROUND: Comprehensive recognition of genomic variation in one individual is important for understanding disease and developing personalized medication and treatment. Many tools based on DNA re-sequencing exist for identification of single nucleotide polymorphisms, small insertions and deletions (indels) as well as large deletions. However, these approaches consistently display a substantial bias against the recovery of complex structural variants and novel sequence in individual genomes and do not provide interpretation information such as the annotation of ancestral state and formation mechanism. FINDINGS: We present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variation and novel sequence from population-scale de novo genome assemblies up to nucleotide resolution. Application of AsmVar to several human de novo genome assemblies captures a wide spectrum of structural variants and novel sequences present in the human population in high sensitivity and specificity. CONCLUSIONS: Our method provides a direct solution for investigating structural variants and novel sequences from de novo genome assemblies, facilitating the construction of population-scale pan-genomes. Our study also highlights the usefulness of the de novo assembly strategy for definition of genome structure. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13742-015-0103-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-12-24 /pmc/articles/PMC4690232/ /pubmed/26705468 http://dx.doi.org/10.1186/s13742-015-0103-4 Text en © Liu et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Note
Liu, Siyang
Huang, Shujia
Rao, Junhua
Ye, Weijian
Krogh, Anders
Wang, Jun
Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale
title Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale
title_full Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale
title_fullStr Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale
title_full_unstemmed Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale
title_short Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale
title_sort discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4690232/
https://www.ncbi.nlm.nih.gov/pubmed/26705468
http://dx.doi.org/10.1186/s13742-015-0103-4
work_keys_str_mv AT liusiyang discoverygenotypingandcharacterizationofstructuralvariationandnovelsequenceatsinglenucleotideresolutionfromdenovogenomeassembliesonapopulationscale
AT huangshujia discoverygenotypingandcharacterizationofstructuralvariationandnovelsequenceatsinglenucleotideresolutionfromdenovogenomeassembliesonapopulationscale
AT raojunhua discoverygenotypingandcharacterizationofstructuralvariationandnovelsequenceatsinglenucleotideresolutionfromdenovogenomeassembliesonapopulationscale
AT yeweijian discoverygenotypingandcharacterizationofstructuralvariationandnovelsequenceatsinglenucleotideresolutionfromdenovogenomeassembliesonapopulationscale
AT discoverygenotypingandcharacterizationofstructuralvariationandnovelsequenceatsinglenucleotideresolutionfromdenovogenomeassembliesonapopulationscale
AT kroghanders discoverygenotypingandcharacterizationofstructuralvariationandnovelsequenceatsinglenucleotideresolutionfromdenovogenomeassembliesonapopulationscale
AT wangjun discoverygenotypingandcharacterizationofstructuralvariationandnovelsequenceatsinglenucleotideresolutionfromdenovogenomeassembliesonapopulationscale