Cargando…
RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation
SUMMARY: Repetitive elements comprise large proportion of many genomes. They have impact on both genome evolution and regulation. Their classification and the study of evolutionary history is a major emerging field. Various software exist to-date to classify and map repeats across genomes. The major...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6419915/ https://www.ncbi.nlm.nih.gov/pubmed/30165587 http://dx.doi.org/10.1093/bioinformatics/bty745 |
_version_ | 1783404024777670656 |
---|---|
author | Wong, Wai Yee Simakov, Oleg |
author_facet | Wong, Wai Yee Simakov, Oleg |
author_sort | Wong, Wai Yee |
collection | PubMed |
description | SUMMARY: Repetitive elements comprise large proportion of many genomes. They have impact on both genome evolution and regulation. Their classification and the study of evolutionary history is a major emerging field. Various software exist to-date to classify and map repeats across genomes. The major unresolved drawback, however, is the fragmented nature of many identified repeat loci. This ultimately makes the classification of novel repeats and their evolutionary analyses difficult. To improve on this, we developed a pipeline (RepeatCraft) that integrates results from several repeat element classification tools based on both sequence similarity and structural features. The pipeline de-fragments closely spaced repeat loci in the genomes, reconstructing longer copies, thus allowing for a better annotation and sequence comparisons. The pipeline also includes a user interface that can run in a web browser allowing for an easy access and exploration of the repeat data. AVAILABILITY AND IMPLEMENTATION: RepeatCraft is implemented in Python and the web application is implemented in R. Download and documentation is freely available at https://github.com/niccw/repeatCraftp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6419915 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-64199152019-03-20 RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation Wong, Wai Yee Simakov, Oleg Bioinformatics Applications Notes SUMMARY: Repetitive elements comprise large proportion of many genomes. They have impact on both genome evolution and regulation. Their classification and the study of evolutionary history is a major emerging field. Various software exist to-date to classify and map repeats across genomes. The major unresolved drawback, however, is the fragmented nature of many identified repeat loci. This ultimately makes the classification of novel repeats and their evolutionary analyses difficult. To improve on this, we developed a pipeline (RepeatCraft) that integrates results from several repeat element classification tools based on both sequence similarity and structural features. The pipeline de-fragments closely spaced repeat loci in the genomes, reconstructing longer copies, thus allowing for a better annotation and sequence comparisons. The pipeline also includes a user interface that can run in a web browser allowing for an easy access and exploration of the repeat data. AVAILABILITY AND IMPLEMENTATION: RepeatCraft is implemented in Python and the web application is implemented in R. Download and documentation is freely available at https://github.com/niccw/repeatCraftp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-03-15 2018-08-25 /pmc/articles/PMC6419915/ /pubmed/30165587 http://dx.doi.org/10.1093/bioinformatics/bty745 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Wong, Wai Yee Simakov, Oleg RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation |
title | RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation |
title_full | RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation |
title_fullStr | RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation |
title_full_unstemmed | RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation |
title_short | RepeatCraft: a meta-pipeline for repetitive element de-fragmentation and annotation |
title_sort | repeatcraft: a meta-pipeline for repetitive element de-fragmentation and annotation |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6419915/ https://www.ncbi.nlm.nih.gov/pubmed/30165587 http://dx.doi.org/10.1093/bioinformatics/bty745 |
work_keys_str_mv | AT wongwaiyee repeatcraftametapipelineforrepetitiveelementdefragmentationandannotation AT simakovoleg repeatcraftametapipelineforrepetitiveelementdefragmentationandannotation |