Cargando…

mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development

BACKGROUND: Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput o...

Descripción completa

Detalles Bibliográficos
Autores principales: Lv, Yuanda, Liu, Yuhe, Zhao, Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4832496/
https://www.ncbi.nlm.nih.gov/pubmed/27079510
http://dx.doi.org/10.1186/s12864-016-2614-5
_version_ 1782427264567214080
author Lv, Yuanda
Liu, Yuhe
Zhao, Han
author_facet Lv, Yuanda
Liu, Yuhe
Zhao, Han
author_sort Lv, Yuanda
collection PubMed
description BACKGROUND: Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput of sequence data generation, and enabled genome-wide identification of insertion and deletion variation. However, current NGS-based InDel mining tools, such as Samtools, GATK and Atlas2, all rely on a reference genome for variant calling which hinders their application on unsequenced organisms and the output of short InDels compromised their use on gel-based genotyping platforms. To address these issues, an enhanced platform is needed to identify longer InDels and develop markers in absence of a reference genome. RESULTS: Here we present mInDel (multiple InDel), a next-generation variant calling tool specifically designed for InDel marker discovery. By taking in raw sequence reads and assembling them into contigs de novo, this software identifies InDel polymorphisms using a sliding window alignment from assembled contigs, rendering a unique advantage when a reference genome is unavailable. By providing an option of combining multiple discovered InDels as output, mInDel is amiable to gel-based genotyping platforms where markers with large polymorphisms are preferred. We demonstrated the usability and performance of this software through a case study using a set of maize NGS data, and experimentally validated the accuracy of markers generated from mInDel. CONCLUSIONS: mInDel is a novel and practical tool that enables rapid genome-wide InDel marker discovery. The features of being independent from a reference genome and the flexibility with downstream genotyping platforms will allow a broad range of applications across genetics research and plant breeding. The mInDel pipeline is freely available at www.github.com/lyd0527/mInDel.
format Online
Article
Text
id pubmed-4832496
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48324962016-04-16 mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development Lv, Yuanda Liu, Yuhe Zhao, Han BMC Genomics Software BACKGROUND: Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput of sequence data generation, and enabled genome-wide identification of insertion and deletion variation. However, current NGS-based InDel mining tools, such as Samtools, GATK and Atlas2, all rely on a reference genome for variant calling which hinders their application on unsequenced organisms and the output of short InDels compromised their use on gel-based genotyping platforms. To address these issues, an enhanced platform is needed to identify longer InDels and develop markers in absence of a reference genome. RESULTS: Here we present mInDel (multiple InDel), a next-generation variant calling tool specifically designed for InDel marker discovery. By taking in raw sequence reads and assembling them into contigs de novo, this software identifies InDel polymorphisms using a sliding window alignment from assembled contigs, rendering a unique advantage when a reference genome is unavailable. By providing an option of combining multiple discovered InDels as output, mInDel is amiable to gel-based genotyping platforms where markers with large polymorphisms are preferred. We demonstrated the usability and performance of this software through a case study using a set of maize NGS data, and experimentally validated the accuracy of markers generated from mInDel. CONCLUSIONS: mInDel is a novel and practical tool that enables rapid genome-wide InDel marker discovery. The features of being independent from a reference genome and the flexibility with downstream genotyping platforms will allow a broad range of applications across genetics research and plant breeding. The mInDel pipeline is freely available at www.github.com/lyd0527/mInDel. BioMed Central 2016-04-14 /pmc/articles/PMC4832496/ /pubmed/27079510 http://dx.doi.org/10.1186/s12864-016-2614-5 Text en © Lv et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Lv, Yuanda
Liu, Yuhe
Zhao, Han
mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development
title mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development
title_full mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development
title_fullStr mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development
title_full_unstemmed mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development
title_short mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development
title_sort mindel: a high-throughput and efficient pipeline for genome-wide indel marker development
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4832496/
https://www.ncbi.nlm.nih.gov/pubmed/27079510
http://dx.doi.org/10.1186/s12864-016-2614-5
work_keys_str_mv AT lvyuanda mindelahighthroughputandefficientpipelineforgenomewideindelmarkerdevelopment
AT liuyuhe mindelahighthroughputandefficientpipelineforgenomewideindelmarkerdevelopment
AT zhaohan mindelahighthroughputandefficientpipelineforgenomewideindelmarkerdevelopment