Cargando…

Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools

Microsatellite polymorphism has always been a challenge for genome assembly and sequence alignment due to sequencing errors, short read lengths, and high incidence of polymerase slippage in microsatellite regions. Despite the information they carry being very valuable, microsatellite variations have...

Descripción completa

Detalles Bibliográficos
Autores principales: DEMİR, Gülfem, ALKAN, Can
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Scientific and Technological Research Council of Turkey 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710001/
https://www.ncbi.nlm.nih.gov/pubmed/31496881
http://dx.doi.org/10.3906/biy-1903-16
_version_ 1783446275535929344
author DEMİR, Gülfem
ALKAN, Can
author_facet DEMİR, Gülfem
ALKAN, Can
author_sort DEMİR, Gülfem
collection PubMed
description Microsatellite polymorphism has always been a challenge for genome assembly and sequence alignment due to sequencing errors, short read lengths, and high incidence of polymerase slippage in microsatellite regions. Despite the information they carry being very valuable, microsatellite variations have not gained enough attention to be a routine step in genome sequence analysis pipelines. After the completion of the 1000 Genomes Project, which aimed to establish the most detailed genetic variation catalog for humans, the consortium released only two microsatellite prediction sets generated by two tools. Many other large research efforts have failed to shed light on microsatellite variations. We evaluated the performance of three different local assembly methods on three different experimental settings, focusing on genotype-based performance, coverage impact, and preprocessing including flanking regions. All these experiments supported our initial expectations on assembly. We also demonstrate that overlap-layout-consensus (OLC)-basedassembly methods show higher sensitivity to microsatellite variant calling when compared to a de Bruijn graph-based approach. We conclude that assembly with OLC is the better method for genotyping microsatellites. Our pipeline is available at https://github.com/gulfemd/STRAssembly.
format Online
Article
Text
id pubmed-6710001
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher The Scientific and Technological Research Council of Turkey
record_format MEDLINE/PubMed
spelling pubmed-67100012019-09-06 Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools DEMİR, Gülfem ALKAN, Can Turk J Biol Article Microsatellite polymorphism has always been a challenge for genome assembly and sequence alignment due to sequencing errors, short read lengths, and high incidence of polymerase slippage in microsatellite regions. Despite the information they carry being very valuable, microsatellite variations have not gained enough attention to be a routine step in genome sequence analysis pipelines. After the completion of the 1000 Genomes Project, which aimed to establish the most detailed genetic variation catalog for humans, the consortium released only two microsatellite prediction sets generated by two tools. Many other large research efforts have failed to shed light on microsatellite variations. We evaluated the performance of three different local assembly methods on three different experimental settings, focusing on genotype-based performance, coverage impact, and preprocessing including flanking regions. All these experiments supported our initial expectations on assembly. We also demonstrate that overlap-layout-consensus (OLC)-basedassembly methods show higher sensitivity to microsatellite variant calling when compared to a de Bruijn graph-based approach. We conclude that assembly with OLC is the better method for genotyping microsatellites. Our pipeline is available at https://github.com/gulfemd/STRAssembly. The Scientific and Technological Research Council of Turkey 2019-08-05 /pmc/articles/PMC6710001/ /pubmed/31496881 http://dx.doi.org/10.3906/biy-1903-16 Text en Copyright © 2019 The Author(s) This article is distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Article
DEMİR, Gülfem
ALKAN, Can
Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools
title Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools
title_full Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools
title_fullStr Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools
title_full_unstemmed Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools
title_short Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools
title_sort characterizing microsatellite polymorphisms using assembly-based and mapping-based tools
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710001/
https://www.ncbi.nlm.nih.gov/pubmed/31496881
http://dx.doi.org/10.3906/biy-1903-16
work_keys_str_mv AT demirgulfem characterizingmicrosatellitepolymorphismsusingassemblybasedandmappingbasedtools
AT alkancan characterizingmicrosatellitepolymorphismsusingassemblybasedandmappingbasedtools