Cargando…

Performance evaluation of indel calling tools using real short-read data

BACKGROUND: Insertion and deletion (indel), a common form of genetic variation, has been shown to cause or contribute to human genetic diseases and cancer. With the advance of next-generation sequencing technology, many indel calling tools have been developed; however, evaluation and comparison of t...

Descripción completa

Detalles Bibliográficos
Autores principales: Hasan, Mohammad Shabbir, Wu, Xiaowei, Zhang, Liqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4545535/
https://www.ncbi.nlm.nih.gov/pubmed/26286629
http://dx.doi.org/10.1186/s40246-015-0042-2
Descripción
Sumario:BACKGROUND: Insertion and deletion (indel), a common form of genetic variation, has been shown to cause or contribute to human genetic diseases and cancer. With the advance of next-generation sequencing technology, many indel calling tools have been developed; however, evaluation and comparison of these tools using large-scale real data are still scant. Here we evaluated seven popular and publicly available indel calling tools, GATK Unified Genotyper, VarScan, Pindel, SAMtools, Dindel, GTAK HaplotypeCaller, and Platypus, using 78 human genome low-coverage data from the 1000 Genomes project. RESULTS: Comparing indels called by these tools with a known set of indels, we found that Platypus outperforms other tools. In addition, a high percentage of known indels still remain undetected and the number of common indels called by all seven tools is very low. CONCLUSION: All these findings indicate the necessity of improving the existing tools or developing new algorithms to achieve reliable and consistent indel calling results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40246-015-0042-2) contains supplementary material, which is available to authorized users.