Cargando…

Optimization of sequence alignment for simple sequence repeat regions

BACKGROUND: Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jighly, Abdulqader, Hamwieh, Aladdin, Ogbonnaya, Francis C
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Technical Note
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3160389/ https://www.ncbi.nlm.nih.gov/pubmed/21774810 http://dx.doi.org/10.1186/1756-0500-4-239

_version_	1782210550686547968
author	Jighly, Abdulqader Hamwieh, Aladdin Ogbonnaya, Francis C
author_facet	Jighly, Abdulqader Hamwieh, Aladdin Ogbonnaya, Francis C
author_sort	Jighly, Abdulqader
collection	PubMed
description	BACKGROUND: Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs). SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. FINDINGS: To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. CONCLUSIONS: The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.
format	Online Article Text
id	pubmed-3160389
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31603892011-08-24 Optimization of sequence alignment for simple sequence repeat regions Jighly, Abdulqader Hamwieh, Aladdin Ogbonnaya, Francis C BMC Res Notes Technical Note BACKGROUND: Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs). SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. FINDINGS: To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. CONCLUSIONS: The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship. BioMed Central 2011-07-20 /pmc/articles/PMC3160389/ /pubmed/21774810 http://dx.doi.org/10.1186/1756-0500-4-239 Text en Copyright ©2011 Jighly et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/2.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Technical Note Jighly, Abdulqader Hamwieh, Aladdin Ogbonnaya, Francis C Optimization of sequence alignment for simple sequence repeat regions
title	Optimization of sequence alignment for simple sequence repeat regions
title_full	Optimization of sequence alignment for simple sequence repeat regions
title_fullStr	Optimization of sequence alignment for simple sequence repeat regions
title_full_unstemmed	Optimization of sequence alignment for simple sequence repeat regions
title_short	Optimization of sequence alignment for simple sequence repeat regions
title_sort	optimization of sequence alignment for simple sequence repeat regions
topic	Technical Note
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3160389/ https://www.ncbi.nlm.nih.gov/pubmed/21774810 http://dx.doi.org/10.1186/1756-0500-4-239
work_keys_str_mv	AT jighlyabdulqader optimizationofsequencealignmentforsimplesequencerepeatregions AT hamwiehaladdin optimizationofsequencealignmentforsimplesequencerepeatregions AT ogbonnayafrancisc optimizationofsequencealignmentforsimplesequencerepeatregions

Optimization of sequence alignment for simple sequence repeat regions

Ejemplares similares