Cargando…

Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?

BACKGROUND: Patient-derived tumor xenografts in mice are widely used in cancer research and have become important in developing personalized therapies. When these xenografts are subject to DNA sequencing, the samples could contain various amounts of mouse DNA. It has been unclear how the mouse reads...

Descripción completa

Detalles Bibliográficos
Autores principales: Tso, Kai-Yuen, Lee, Sau Dan, Lo, Kwok-Wai, Yip, Kevin Y
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4326289/
https://www.ncbi.nlm.nih.gov/pubmed/25539684
http://dx.doi.org/10.1186/1471-2164-15-1172
_version_ 1782356904270364672
author Tso, Kai-Yuen
Lee, Sau Dan
Lo, Kwok-Wai
Yip, Kevin Y
author_facet Tso, Kai-Yuen
Lee, Sau Dan
Lo, Kwok-Wai
Yip, Kevin Y
author_sort Tso, Kai-Yuen
collection PubMed
description BACKGROUND: Patient-derived tumor xenografts in mice are widely used in cancer research and have become important in developing personalized therapies. When these xenografts are subject to DNA sequencing, the samples could contain various amounts of mouse DNA. It has been unclear how the mouse reads would affect data analyses. We conducted comprehensive simulations to compare three alignment strategies at different mutation rates, read lengths, sequencing error rates, human-mouse mixing ratios and sequenced regions. We also sequenced a nasopharyngeal carcinoma xenograft and a cell line to test how the strategies work on real data. RESULTS: We found the "filtering" and "combined reference" strategies performed better than aligning reads directly to human reference in terms of alignment and variant calling accuracies. The combined reference strategy was particularly good at reducing false negative variants calls without significantly increasing the false positive rate. In some scenarios the performance gain of these two special handling strategies was too small for special handling to be cost-effective, but it was found crucial when false non-synonymous SNVs should be minimized, especially in exome sequencing. CONCLUSIONS: Our study systematically analyzes the effects of mouse contamination in the sequencing data of human-in-mouse xenografts. Our findings provide information for designing data analysis pipelines for these data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-1172) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4326289
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43262892015-02-14 Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? Tso, Kai-Yuen Lee, Sau Dan Lo, Kwok-Wai Yip, Kevin Y BMC Genomics Methodology Article BACKGROUND: Patient-derived tumor xenografts in mice are widely used in cancer research and have become important in developing personalized therapies. When these xenografts are subject to DNA sequencing, the samples could contain various amounts of mouse DNA. It has been unclear how the mouse reads would affect data analyses. We conducted comprehensive simulations to compare three alignment strategies at different mutation rates, read lengths, sequencing error rates, human-mouse mixing ratios and sequenced regions. We also sequenced a nasopharyngeal carcinoma xenograft and a cell line to test how the strategies work on real data. RESULTS: We found the "filtering" and "combined reference" strategies performed better than aligning reads directly to human reference in terms of alignment and variant calling accuracies. The combined reference strategy was particularly good at reducing false negative variants calls without significantly increasing the false positive rate. In some scenarios the performance gain of these two special handling strategies was too small for special handling to be cost-effective, but it was found crucial when false non-synonymous SNVs should be minimized, especially in exome sequencing. CONCLUSIONS: Our study systematically analyzes the effects of mouse contamination in the sequencing data of human-in-mouse xenografts. Our findings provide information for designing data analysis pipelines for these data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-1172) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-23 /pmc/articles/PMC4326289/ /pubmed/25539684 http://dx.doi.org/10.1186/1471-2164-15-1172 Text en © Tso et al.; licensee BioMed Central. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Tso, Kai-Yuen
Lee, Sau Dan
Lo, Kwok-Wai
Yip, Kevin Y
Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?
title Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?
title_full Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?
title_fullStr Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?
title_full_unstemmed Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?
title_short Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?
title_sort are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4326289/
https://www.ncbi.nlm.nih.gov/pubmed/25539684
http://dx.doi.org/10.1186/1471-2164-15-1172
work_keys_str_mv AT tsokaiyuen arespecialreadalignmentstrategiesnecessaryandcosteffectivewhenhandlingsequencingreadsfrompatientderivedtumorxenografts
AT leesaudan arespecialreadalignmentstrategiesnecessaryandcosteffectivewhenhandlingsequencingreadsfrompatientderivedtumorxenografts
AT lokwokwai arespecialreadalignmentstrategiesnecessaryandcosteffectivewhenhandlingsequencingreadsfrompatientderivedtumorxenografts
AT yipkeviny arespecialreadalignmentstrategiesnecessaryandcosteffectivewhenhandlingsequencingreadsfrompatientderivedtumorxenografts