Cargando…

GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing

Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xuewen, Wang, Le
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5020087/
https://www.ncbi.nlm.nih.gov/pubmed/27679641
http://dx.doi.org/10.3389/fpls.2016.01350
_version_ 1782453171063357440
author Wang, Xuewen
Wang, Le
author_facet Wang, Xuewen
Wang, Le
author_sort Wang, Xuewen
collection PubMed
description Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar.
format Online
Article
Text
id pubmed-5020087
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-50200872016-09-27 GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing Wang, Xuewen Wang, Le Front Plant Sci Plant Science Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar. Frontiers Media S.A. 2016-09-13 /pmc/articles/PMC5020087/ /pubmed/27679641 http://dx.doi.org/10.3389/fpls.2016.01350 Text en Copyright © 2016 Wang and Wang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Wang, Xuewen
Wang, Le
GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing
title GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing
title_full GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing
title_fullStr GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing
title_full_unstemmed GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing
title_short GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing
title_sort gmata: an integrated software package for genome-scale ssr mining, marker development and viewing
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5020087/
https://www.ncbi.nlm.nih.gov/pubmed/27679641
http://dx.doi.org/10.3389/fpls.2016.01350
work_keys_str_mv AT wangxuewen gmataanintegratedsoftwarepackageforgenomescalessrminingmarkerdevelopmentandviewing
AT wangle gmataanintegratedsoftwarepackageforgenomescalessrminingmarkerdevelopmentandviewing