Cargando…
mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development
BACKGROUND: Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4832496/ https://www.ncbi.nlm.nih.gov/pubmed/27079510 http://dx.doi.org/10.1186/s12864-016-2614-5 |
_version_ | 1782427264567214080 |
---|---|
author | Lv, Yuanda Liu, Yuhe Zhao, Han |
author_facet | Lv, Yuanda Liu, Yuhe Zhao, Han |
author_sort | Lv, Yuanda |
collection | PubMed |
description | BACKGROUND: Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput of sequence data generation, and enabled genome-wide identification of insertion and deletion variation. However, current NGS-based InDel mining tools, such as Samtools, GATK and Atlas2, all rely on a reference genome for variant calling which hinders their application on unsequenced organisms and the output of short InDels compromised their use on gel-based genotyping platforms. To address these issues, an enhanced platform is needed to identify longer InDels and develop markers in absence of a reference genome. RESULTS: Here we present mInDel (multiple InDel), a next-generation variant calling tool specifically designed for InDel marker discovery. By taking in raw sequence reads and assembling them into contigs de novo, this software identifies InDel polymorphisms using a sliding window alignment from assembled contigs, rendering a unique advantage when a reference genome is unavailable. By providing an option of combining multiple discovered InDels as output, mInDel is amiable to gel-based genotyping platforms where markers with large polymorphisms are preferred. We demonstrated the usability and performance of this software through a case study using a set of maize NGS data, and experimentally validated the accuracy of markers generated from mInDel. CONCLUSIONS: mInDel is a novel and practical tool that enables rapid genome-wide InDel marker discovery. The features of being independent from a reference genome and the flexibility with downstream genotyping platforms will allow a broad range of applications across genetics research and plant breeding. The mInDel pipeline is freely available at www.github.com/lyd0527/mInDel. |
format | Online Article Text |
id | pubmed-4832496 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48324962016-04-16 mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development Lv, Yuanda Liu, Yuhe Zhao, Han BMC Genomics Software BACKGROUND: Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput of sequence data generation, and enabled genome-wide identification of insertion and deletion variation. However, current NGS-based InDel mining tools, such as Samtools, GATK and Atlas2, all rely on a reference genome for variant calling which hinders their application on unsequenced organisms and the output of short InDels compromised their use on gel-based genotyping platforms. To address these issues, an enhanced platform is needed to identify longer InDels and develop markers in absence of a reference genome. RESULTS: Here we present mInDel (multiple InDel), a next-generation variant calling tool specifically designed for InDel marker discovery. By taking in raw sequence reads and assembling them into contigs de novo, this software identifies InDel polymorphisms using a sliding window alignment from assembled contigs, rendering a unique advantage when a reference genome is unavailable. By providing an option of combining multiple discovered InDels as output, mInDel is amiable to gel-based genotyping platforms where markers with large polymorphisms are preferred. We demonstrated the usability and performance of this software through a case study using a set of maize NGS data, and experimentally validated the accuracy of markers generated from mInDel. CONCLUSIONS: mInDel is a novel and practical tool that enables rapid genome-wide InDel marker discovery. The features of being independent from a reference genome and the flexibility with downstream genotyping platforms will allow a broad range of applications across genetics research and plant breeding. The mInDel pipeline is freely available at www.github.com/lyd0527/mInDel. BioMed Central 2016-04-14 /pmc/articles/PMC4832496/ /pubmed/27079510 http://dx.doi.org/10.1186/s12864-016-2614-5 Text en © Lv et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Lv, Yuanda Liu, Yuhe Zhao, Han mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development |
title | mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development |
title_full | mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development |
title_fullStr | mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development |
title_full_unstemmed | mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development |
title_short | mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development |
title_sort | mindel: a high-throughput and efficient pipeline for genome-wide indel marker development |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4832496/ https://www.ncbi.nlm.nih.gov/pubmed/27079510 http://dx.doi.org/10.1186/s12864-016-2614-5 |
work_keys_str_mv | AT lvyuanda mindelahighthroughputandefficientpipelineforgenomewideindelmarkerdevelopment AT liuyuhe mindelahighthroughputandefficientpipelineforgenomewideindelmarkerdevelopment AT zhaohan mindelahighthroughputandefficientpipelineforgenomewideindelmarkerdevelopment |