Cargando…
Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts?
BACKGROUND: Patient-derived tumor xenografts in mice are widely used in cancer research and have become important in developing personalized therapies. When these xenografts are subject to DNA sequencing, the samples could contain various amounts of mouse DNA. It has been unclear how the mouse reads...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4326289/ https://www.ncbi.nlm.nih.gov/pubmed/25539684 http://dx.doi.org/10.1186/1471-2164-15-1172 |
_version_ | 1782356904270364672 |
---|---|
author | Tso, Kai-Yuen Lee, Sau Dan Lo, Kwok-Wai Yip, Kevin Y |
author_facet | Tso, Kai-Yuen Lee, Sau Dan Lo, Kwok-Wai Yip, Kevin Y |
author_sort | Tso, Kai-Yuen |
collection | PubMed |
description | BACKGROUND: Patient-derived tumor xenografts in mice are widely used in cancer research and have become important in developing personalized therapies. When these xenografts are subject to DNA sequencing, the samples could contain various amounts of mouse DNA. It has been unclear how the mouse reads would affect data analyses. We conducted comprehensive simulations to compare three alignment strategies at different mutation rates, read lengths, sequencing error rates, human-mouse mixing ratios and sequenced regions. We also sequenced a nasopharyngeal carcinoma xenograft and a cell line to test how the strategies work on real data. RESULTS: We found the "filtering" and "combined reference" strategies performed better than aligning reads directly to human reference in terms of alignment and variant calling accuracies. The combined reference strategy was particularly good at reducing false negative variants calls without significantly increasing the false positive rate. In some scenarios the performance gain of these two special handling strategies was too small for special handling to be cost-effective, but it was found crucial when false non-synonymous SNVs should be minimized, especially in exome sequencing. CONCLUSIONS: Our study systematically analyzes the effects of mouse contamination in the sequencing data of human-in-mouse xenografts. Our findings provide information for designing data analysis pipelines for these data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-1172) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4326289 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43262892015-02-14 Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? Tso, Kai-Yuen Lee, Sau Dan Lo, Kwok-Wai Yip, Kevin Y BMC Genomics Methodology Article BACKGROUND: Patient-derived tumor xenografts in mice are widely used in cancer research and have become important in developing personalized therapies. When these xenografts are subject to DNA sequencing, the samples could contain various amounts of mouse DNA. It has been unclear how the mouse reads would affect data analyses. We conducted comprehensive simulations to compare three alignment strategies at different mutation rates, read lengths, sequencing error rates, human-mouse mixing ratios and sequenced regions. We also sequenced a nasopharyngeal carcinoma xenograft and a cell line to test how the strategies work on real data. RESULTS: We found the "filtering" and "combined reference" strategies performed better than aligning reads directly to human reference in terms of alignment and variant calling accuracies. The combined reference strategy was particularly good at reducing false negative variants calls without significantly increasing the false positive rate. In some scenarios the performance gain of these two special handling strategies was too small for special handling to be cost-effective, but it was found crucial when false non-synonymous SNVs should be minimized, especially in exome sequencing. CONCLUSIONS: Our study systematically analyzes the effects of mouse contamination in the sequencing data of human-in-mouse xenografts. Our findings provide information for designing data analysis pipelines for these data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-1172) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-23 /pmc/articles/PMC4326289/ /pubmed/25539684 http://dx.doi.org/10.1186/1471-2164-15-1172 Text en © Tso et al.; licensee BioMed Central. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Tso, Kai-Yuen Lee, Sau Dan Lo, Kwok-Wai Yip, Kevin Y Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? |
title | Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? |
title_full | Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? |
title_fullStr | Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? |
title_full_unstemmed | Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? |
title_short | Are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? |
title_sort | are special read alignment strategies necessary and cost-effective when handling sequencing reads from patient-derived tumor xenografts? |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4326289/ https://www.ncbi.nlm.nih.gov/pubmed/25539684 http://dx.doi.org/10.1186/1471-2164-15-1172 |
work_keys_str_mv | AT tsokaiyuen arespecialreadalignmentstrategiesnecessaryandcosteffectivewhenhandlingsequencingreadsfrompatientderivedtumorxenografts AT leesaudan arespecialreadalignmentstrategiesnecessaryandcosteffectivewhenhandlingsequencingreadsfrompatientderivedtumorxenografts AT lokwokwai arespecialreadalignmentstrategiesnecessaryandcosteffectivewhenhandlingsequencingreadsfrompatientderivedtumorxenografts AT yipkeviny arespecialreadalignmentstrategiesnecessaryandcosteffectivewhenhandlingsequencingreadsfrompatientderivedtumorxenografts |