Cargando…
Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts
BACKGROUND: Patient-Derived Tumour Xenografts (PDTXs) have emerged as the pre-clinical models that best represent clinical tumour diversity and intra-tumour heterogeneity. The molecular characterization of PDTXs using High-Throughput Sequencing (HTS) is essential; however, the presence of mouse stro...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5755132/ https://www.ncbi.nlm.nih.gov/pubmed/29304755 http://dx.doi.org/10.1186/s12864-017-4414-y |
_version_ | 1783290548524679168 |
---|---|
author | Callari, Maurizio Batra, Ankita Sati Batra, Rajbir Nath Sammut, Stephen-John Greenwood, Wendy Clifford, Harry Hercus, Colin Chin, Suet-Feung Bruna, Alejandra Rueda, Oscar M. Caldas, Carlos |
author_facet | Callari, Maurizio Batra, Ankita Sati Batra, Rajbir Nath Sammut, Stephen-John Greenwood, Wendy Clifford, Harry Hercus, Colin Chin, Suet-Feung Bruna, Alejandra Rueda, Oscar M. Caldas, Carlos |
author_sort | Callari, Maurizio |
collection | PubMed |
description | BACKGROUND: Patient-Derived Tumour Xenografts (PDTXs) have emerged as the pre-clinical models that best represent clinical tumour diversity and intra-tumour heterogeneity. The molecular characterization of PDTXs using High-Throughput Sequencing (HTS) is essential; however, the presence of mouse stroma is challenging for HTS data analysis. Indeed, the high homology between the two genomes results in a proportion of mouse reads being mapped as human. RESULTS: In this study we generated Whole Exome Sequencing (WES), Reduced Representation Bisulfite Sequencing (RRBS) and RNA sequencing (RNA-seq) data from samples with known mixtures of mouse and human DNA or RNA and from a cohort of human breast cancers and their derived PDTXs. We show that using an In silico Combined human-mouse Reference Genome (ICRG) for alignment discriminates between human and mouse reads with up to 99.9% accuracy and decreases the number of false positive somatic mutations caused by misalignment by >99.9%. We also derived a model to estimate the human DNA content in independent PDTX samples. For RNA-seq and RRBS data analysis, the use of the ICRG allows dissecting computationally the transcriptome and methylome of human tumour cells and mouse stroma. In a direct comparison with previously reported approaches, our method showed similar or higher accuracy while requiring significantly less computing time. CONCLUSIONS: The computational pipeline we describe here is a valuable tool for the molecular analysis of PDTXs as well as any other mixture of DNA or RNA species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-017-4414-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5755132 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57551322018-01-08 Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts Callari, Maurizio Batra, Ankita Sati Batra, Rajbir Nath Sammut, Stephen-John Greenwood, Wendy Clifford, Harry Hercus, Colin Chin, Suet-Feung Bruna, Alejandra Rueda, Oscar M. Caldas, Carlos BMC Genomics Methodology Article BACKGROUND: Patient-Derived Tumour Xenografts (PDTXs) have emerged as the pre-clinical models that best represent clinical tumour diversity and intra-tumour heterogeneity. The molecular characterization of PDTXs using High-Throughput Sequencing (HTS) is essential; however, the presence of mouse stroma is challenging for HTS data analysis. Indeed, the high homology between the two genomes results in a proportion of mouse reads being mapped as human. RESULTS: In this study we generated Whole Exome Sequencing (WES), Reduced Representation Bisulfite Sequencing (RRBS) and RNA sequencing (RNA-seq) data from samples with known mixtures of mouse and human DNA or RNA and from a cohort of human breast cancers and their derived PDTXs. We show that using an In silico Combined human-mouse Reference Genome (ICRG) for alignment discriminates between human and mouse reads with up to 99.9% accuracy and decreases the number of false positive somatic mutations caused by misalignment by >99.9%. We also derived a model to estimate the human DNA content in independent PDTX samples. For RNA-seq and RRBS data analysis, the use of the ICRG allows dissecting computationally the transcriptome and methylome of human tumour cells and mouse stroma. In a direct comparison with previously reported approaches, our method showed similar or higher accuracy while requiring significantly less computing time. CONCLUSIONS: The computational pipeline we describe here is a valuable tool for the molecular analysis of PDTXs as well as any other mixture of DNA or RNA species. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-017-4414-y) contains supplementary material, which is available to authorized users. BioMed Central 2018-01-05 /pmc/articles/PMC5755132/ /pubmed/29304755 http://dx.doi.org/10.1186/s12864-017-4414-y Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Callari, Maurizio Batra, Ankita Sati Batra, Rajbir Nath Sammut, Stephen-John Greenwood, Wendy Clifford, Harry Hercus, Colin Chin, Suet-Feung Bruna, Alejandra Rueda, Oscar M. Caldas, Carlos Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts |
title | Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts |
title_full | Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts |
title_fullStr | Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts |
title_full_unstemmed | Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts |
title_short | Computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts |
title_sort | computational approach to discriminate human and mouse sequences in patient-derived tumour xenografts |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5755132/ https://www.ncbi.nlm.nih.gov/pubmed/29304755 http://dx.doi.org/10.1186/s12864-017-4414-y |
work_keys_str_mv | AT callarimaurizio computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT batraankitasati computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT batrarajbirnath computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT sammutstephenjohn computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT greenwoodwendy computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT cliffordharry computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT hercuscolin computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT chinsuetfeung computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT brunaalejandra computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT ruedaoscarm computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts AT caldascarlos computationalapproachtodiscriminatehumanandmousesequencesinpatientderivedtumourxenografts |