Cargando…

Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy

BACKGROUND: Mitochondrial genome sequence analysis is critical to the diagnostic evaluation of mitochondrial disease. Existing methodologies differ widely in throughput, complexity, cost efficiency, and sensitivity of heteroplasmy detection. Affymetrix MitoChip v2.0, which uses a sequencing-by-genot...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Hongbo M, Perin, Juan C, Schurr, Theodore G, Dulik, Matthew C, Zhadanov, Sergey I, Baur, Joseph A, King, Michael P, Place, Emily, Clarke, Colleen, Grauer, Michael, Schug, Jonathan, Santani, Avni, Albano, Anthony, Kim, Cecilia, Procaccio, Vincent, Hakonarson, Hakon, Gai, Xiaowu, Falk, Marni J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3234255/
https://www.ncbi.nlm.nih.gov/pubmed/22011106
http://dx.doi.org/10.1186/1471-2105-12-402
_version_ 1782218494815764480
author Xie, Hongbo M
Perin, Juan C
Schurr, Theodore G
Dulik, Matthew C
Zhadanov, Sergey I
Baur, Joseph A
King, Michael P
Place, Emily
Clarke, Colleen
Grauer, Michael
Schug, Jonathan
Santani, Avni
Albano, Anthony
Kim, Cecilia
Procaccio, Vincent
Hakonarson, Hakon
Gai, Xiaowu
Falk, Marni J
author_facet Xie, Hongbo M
Perin, Juan C
Schurr, Theodore G
Dulik, Matthew C
Zhadanov, Sergey I
Baur, Joseph A
King, Michael P
Place, Emily
Clarke, Colleen
Grauer, Michael
Schug, Jonathan
Santani, Avni
Albano, Anthony
Kim, Cecilia
Procaccio, Vincent
Hakonarson, Hakon
Gai, Xiaowu
Falk, Marni J
author_sort Xie, Hongbo M
collection PubMed
description BACKGROUND: Mitochondrial genome sequence analysis is critical to the diagnostic evaluation of mitochondrial disease. Existing methodologies differ widely in throughput, complexity, cost efficiency, and sensitivity of heteroplasmy detection. Affymetrix MitoChip v2.0, which uses a sequencing-by-genotyping technology, allows potentially accurate and high-throughput sequencing of the entire human mitochondrial genome to be completed in a cost-effective fashion. However, the relatively low call rate achieved using existing software tools has limited the wide adoption of this platform for either clinical or research applications. Here, we report the design and development of a custom bioinformatics software pipeline that achieves a much improved call rate and accuracy for the Affymetrix MitoChip v2.0 platform. We used this custom pipeline to analyze MitoChip v2.0 data from 24 DNA samples representing a broad range of tissue types (18 whole blood, 3 skeletal muscle, 3 cell lines), mutations (a 5.8 kilobase pair deletion and 6 known heteroplasmic mutations), and haplogroup origins. All results were compared to those obtained by at least one other mitochondrial DNA sequence analysis method, including Sanger sequencing, denaturing HPLC-based heteroduplex analysis, and/or the Illumina Genome Analyzer II next generation sequencing platform. RESULTS: An average call rate of 99.75% was achieved across all samples with our custom pipeline. Comparison of calls for 15 samples characterized previously by Sanger sequencing revealed a total of 29 discordant calls, which translates to an estimated 0.012% for the base call error rate. We successfully identified 4 known heteroplasmic mutations and 24 other potential heteroplasmic mutations across 20 samples that passed quality control. CONCLUSIONS: Affymetrix MitoChip v2.0 analysis using our optimized MitoChip Filtering Protocol (MFP) bioinformatics pipeline now offers the high sensitivity and accuracy needed for reliable, high-throughput and cost-efficient whole mitochondrial genome sequencing. This approach provides a viable alternative of potential utility for both clinical diagnostic and research applications to traditional Sanger and other emerging sequencing technologies for whole mitochondrial genome analysis.
format Online
Article
Text
id pubmed-3234255
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32342552011-12-12 Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy Xie, Hongbo M Perin, Juan C Schurr, Theodore G Dulik, Matthew C Zhadanov, Sergey I Baur, Joseph A King, Michael P Place, Emily Clarke, Colleen Grauer, Michael Schug, Jonathan Santani, Avni Albano, Anthony Kim, Cecilia Procaccio, Vincent Hakonarson, Hakon Gai, Xiaowu Falk, Marni J BMC Bioinformatics Software BACKGROUND: Mitochondrial genome sequence analysis is critical to the diagnostic evaluation of mitochondrial disease. Existing methodologies differ widely in throughput, complexity, cost efficiency, and sensitivity of heteroplasmy detection. Affymetrix MitoChip v2.0, which uses a sequencing-by-genotyping technology, allows potentially accurate and high-throughput sequencing of the entire human mitochondrial genome to be completed in a cost-effective fashion. However, the relatively low call rate achieved using existing software tools has limited the wide adoption of this platform for either clinical or research applications. Here, we report the design and development of a custom bioinformatics software pipeline that achieves a much improved call rate and accuracy for the Affymetrix MitoChip v2.0 platform. We used this custom pipeline to analyze MitoChip v2.0 data from 24 DNA samples representing a broad range of tissue types (18 whole blood, 3 skeletal muscle, 3 cell lines), mutations (a 5.8 kilobase pair deletion and 6 known heteroplasmic mutations), and haplogroup origins. All results were compared to those obtained by at least one other mitochondrial DNA sequence analysis method, including Sanger sequencing, denaturing HPLC-based heteroduplex analysis, and/or the Illumina Genome Analyzer II next generation sequencing platform. RESULTS: An average call rate of 99.75% was achieved across all samples with our custom pipeline. Comparison of calls for 15 samples characterized previously by Sanger sequencing revealed a total of 29 discordant calls, which translates to an estimated 0.012% for the base call error rate. We successfully identified 4 known heteroplasmic mutations and 24 other potential heteroplasmic mutations across 20 samples that passed quality control. CONCLUSIONS: Affymetrix MitoChip v2.0 analysis using our optimized MitoChip Filtering Protocol (MFP) bioinformatics pipeline now offers the high sensitivity and accuracy needed for reliable, high-throughput and cost-efficient whole mitochondrial genome sequencing. This approach provides a viable alternative of potential utility for both clinical diagnostic and research applications to traditional Sanger and other emerging sequencing technologies for whole mitochondrial genome analysis. BioMed Central 2011-10-19 /pmc/articles/PMC3234255/ /pubmed/22011106 http://dx.doi.org/10.1186/1471-2105-12-402 Text en Copyright © 2011 Xie et al; licensee BioMed Central Ltd. https://creativecommons.org/licenses/by/2.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Xie, Hongbo M
Perin, Juan C
Schurr, Theodore G
Dulik, Matthew C
Zhadanov, Sergey I
Baur, Joseph A
King, Michael P
Place, Emily
Clarke, Colleen
Grauer, Michael
Schug, Jonathan
Santani, Avni
Albano, Anthony
Kim, Cecilia
Procaccio, Vincent
Hakonarson, Hakon
Gai, Xiaowu
Falk, Marni J
Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy
title Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy
title_full Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy
title_fullStr Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy
title_full_unstemmed Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy
title_short Mitochondrial genome sequence analysis: A custom bioinformatics pipeline substantially improves Affymetrix MitoChip v2.0 call rate and accuracy
title_sort mitochondrial genome sequence analysis: a custom bioinformatics pipeline substantially improves affymetrix mitochip v2.0 call rate and accuracy
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3234255/
https://www.ncbi.nlm.nih.gov/pubmed/22011106
http://dx.doi.org/10.1186/1471-2105-12-402
work_keys_str_mv AT xiehongbom mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT perinjuanc mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT schurrtheodoreg mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT dulikmatthewc mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT zhadanovsergeyi mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT baurjosepha mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT kingmichaelp mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT placeemily mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT clarkecolleen mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT grauermichael mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT schugjonathan mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT santaniavni mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT albanoanthony mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT kimcecilia mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT procacciovincent mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT hakonarsonhakon mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT gaixiaowu mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy
AT falkmarnij mitochondrialgenomesequenceanalysisacustombioinformaticspipelinesubstantiallyimprovesaffymetrixmitochipv20callrateandaccuracy