Cargando…
GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing
Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5020087/ https://www.ncbi.nlm.nih.gov/pubmed/27679641 http://dx.doi.org/10.3389/fpls.2016.01350 |
_version_ | 1782453171063357440 |
---|---|
author | Wang, Xuewen Wang, Le |
author_facet | Wang, Xuewen Wang, Le |
author_sort | Wang, Xuewen |
collection | PubMed |
description | Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar. |
format | Online Article Text |
id | pubmed-5020087 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-50200872016-09-27 GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing Wang, Xuewen Wang, Le Front Plant Sci Plant Science Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar. Frontiers Media S.A. 2016-09-13 /pmc/articles/PMC5020087/ /pubmed/27679641 http://dx.doi.org/10.3389/fpls.2016.01350 Text en Copyright © 2016 Wang and Wang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Wang, Xuewen Wang, Le GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing |
title | GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing |
title_full | GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing |
title_fullStr | GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing |
title_full_unstemmed | GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing |
title_short | GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing |
title_sort | gmata: an integrated software package for genome-scale ssr mining, marker development and viewing |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5020087/ https://www.ncbi.nlm.nih.gov/pubmed/27679641 http://dx.doi.org/10.3389/fpls.2016.01350 |
work_keys_str_mv | AT wangxuewen gmataanintegratedsoftwarepackageforgenomescalessrminingmarkerdevelopmentandviewing AT wangle gmataanintegratedsoftwarepackageforgenomescalessrminingmarkerdevelopmentandviewing |