Cargando…

micRocounter: Microsatellite Characterization in Genome Assemblies

Microsatellites are repetitive DNA sequences usually found in non-coding regions of the genome. Their quantification and analysis have applications in fields from population genetics to evolutionary biology. As genome assemblies become commonplace, the need for software that can facilitate analyses...

Descripción completa

Detalles Bibliográficos
Autores principales: Lo, Johnathan, Jonika, Michelle M., Blackmon, Heath
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6778809/
https://www.ncbi.nlm.nih.gov/pubmed/31375475
http://dx.doi.org/10.1534/g3.119.400335
_version_ 1783456826781597696
author Lo, Johnathan
Jonika, Michelle M.
Blackmon, Heath
author_facet Lo, Johnathan
Jonika, Michelle M.
Blackmon, Heath
author_sort Lo, Johnathan
collection PubMed
description Microsatellites are repetitive DNA sequences usually found in non-coding regions of the genome. Their quantification and analysis have applications in fields from population genetics to evolutionary biology. As genome assemblies become commonplace, the need for software that can facilitate analyses has never been greater. In particular, R packages that can analyze genomic data are particularly important since this is one of the most popular software environments for biologists. We created an R package, micRocounter, to quantify microsatellites. We have optimized our package for speed, accessibility, and portability, making the automated analysis of large genomic data sets feasible. Computationally intensive algorithms were built in C++ to increase speed. Tests using benchmark datasets show a 200-fold improvement in speed over existing software. A moderately sized genome of 500 Mb can be processed in under 50 sec. Results are output as an object in R increasing accessibility and flexibility for practitioners.
format Online
Article
Text
id pubmed-6778809
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-67788092019-10-07 micRocounter: Microsatellite Characterization in Genome Assemblies Lo, Johnathan Jonika, Michelle M. Blackmon, Heath G3 (Bethesda) Software and Data Resources Microsatellites are repetitive DNA sequences usually found in non-coding regions of the genome. Their quantification and analysis have applications in fields from population genetics to evolutionary biology. As genome assemblies become commonplace, the need for software that can facilitate analyses has never been greater. In particular, R packages that can analyze genomic data are particularly important since this is one of the most popular software environments for biologists. We created an R package, micRocounter, to quantify microsatellites. We have optimized our package for speed, accessibility, and portability, making the automated analysis of large genomic data sets feasible. Computationally intensive algorithms were built in C++ to increase speed. Tests using benchmark datasets show a 200-fold improvement in speed over existing software. A moderately sized genome of 500 Mb can be processed in under 50 sec. Results are output as an object in R increasing accessibility and flexibility for practitioners. Genetics Society of America 2019-08-02 /pmc/articles/PMC6778809/ /pubmed/31375475 http://dx.doi.org/10.1534/g3.119.400335 Text en Copyright © 2019 Lo et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software and Data Resources
Lo, Johnathan
Jonika, Michelle M.
Blackmon, Heath
micRocounter: Microsatellite Characterization in Genome Assemblies
title micRocounter: Microsatellite Characterization in Genome Assemblies
title_full micRocounter: Microsatellite Characterization in Genome Assemblies
title_fullStr micRocounter: Microsatellite Characterization in Genome Assemblies
title_full_unstemmed micRocounter: Microsatellite Characterization in Genome Assemblies
title_short micRocounter: Microsatellite Characterization in Genome Assemblies
title_sort microcounter: microsatellite characterization in genome assemblies
topic Software and Data Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6778809/
https://www.ncbi.nlm.nih.gov/pubmed/31375475
http://dx.doi.org/10.1534/g3.119.400335
work_keys_str_mv AT lojohnathan microcountermicrosatellitecharacterizationingenomeassemblies
AT jonikamichellem microcountermicrosatellitecharacterizationingenomeassemblies
AT blackmonheath microcountermicrosatellitecharacterizationingenomeassemblies