Cargando…

Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores

BACKGROUND: Although the throughput of next generation sequencing is increasing and at the same time the cost is substantially reduced, for the majority of laboratories whole genome sequencing of large cohorts of cancer samples is still not feasible. In addition, the low number of genomes that are b...

Descripción completa

Detalles Bibliográficos
Autores principales: Fieuw, Annelies, De Wilde, Bram, Speleman, Frank, Vandesompele, Jo, De Preter, Katleen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3291573/
https://www.ncbi.nlm.nih.gov/pubmed/22396732
http://dx.doi.org/10.1371/journal.pone.0031333
_version_ 1782225155528851456
author Fieuw, Annelies
De Wilde, Bram
Speleman, Frank
Vandesompele, Jo
De Preter, Katleen
author_facet Fieuw, Annelies
De Wilde, Bram
Speleman, Frank
Vandesompele, Jo
De Preter, Katleen
author_sort Fieuw, Annelies
collection PubMed
description BACKGROUND: Although the throughput of next generation sequencing is increasing and at the same time the cost is substantially reduced, for the majority of laboratories whole genome sequencing of large cohorts of cancer samples is still not feasible. In addition, the low number of genomes that are being sequenced is often problematic for the downstream interpretation of the significance of the variants. Targeted resequencing can partially circumvent this problem; by focusing on a limited number of candidate cancer genes to sequence, more samples can be included in the screening, hence resulting in substantial improvement of the statistical power. In this study, a successful strategy for prioritizing candidate genes for targeted resequencing of cancer genomes is presented. RESULTS: Four prioritization strategies were evaluated on six different cancer types: genes were ranked using these strategies, and the positive predictive value (PPV) or mutation rate within the top-ranked genes was compared to the baseline mutation rate in each tumor type. Successful strategies generate gene lists in which the top is enriched for known mutated genes, as evidenced by an increase in PPV. A clear example of such an improvement is seen in colon cancer, where the PPV is increased by 2.3 fold compared to the baseline level when 100 top fitSNP genes are sequenced. CONCLUSIONS: A gene prioritization strategy based on the fitSNP scores appears to be most successful in identifying mutated cancer genes across different tumor entities, with variance of gene expression levels as a good second best.
format Online
Article
Text
id pubmed-3291573
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32915732012-03-06 Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores Fieuw, Annelies De Wilde, Bram Speleman, Frank Vandesompele, Jo De Preter, Katleen PLoS One Research Article BACKGROUND: Although the throughput of next generation sequencing is increasing and at the same time the cost is substantially reduced, for the majority of laboratories whole genome sequencing of large cohorts of cancer samples is still not feasible. In addition, the low number of genomes that are being sequenced is often problematic for the downstream interpretation of the significance of the variants. Targeted resequencing can partially circumvent this problem; by focusing on a limited number of candidate cancer genes to sequence, more samples can be included in the screening, hence resulting in substantial improvement of the statistical power. In this study, a successful strategy for prioritizing candidate genes for targeted resequencing of cancer genomes is presented. RESULTS: Four prioritization strategies were evaluated on six different cancer types: genes were ranked using these strategies, and the positive predictive value (PPV) or mutation rate within the top-ranked genes was compared to the baseline mutation rate in each tumor type. Successful strategies generate gene lists in which the top is enriched for known mutated genes, as evidenced by an increase in PPV. A clear example of such an improvement is seen in colon cancer, where the PPV is increased by 2.3 fold compared to the baseline level when 100 top fitSNP genes are sequenced. CONCLUSIONS: A gene prioritization strategy based on the fitSNP scores appears to be most successful in identifying mutated cancer genes across different tumor entities, with variance of gene expression levels as a good second best. Public Library of Science 2012-03-01 /pmc/articles/PMC3291573/ /pubmed/22396732 http://dx.doi.org/10.1371/journal.pone.0031333 Text en Fieuw et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fieuw, Annelies
De Wilde, Bram
Speleman, Frank
Vandesompele, Jo
De Preter, Katleen
Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores
title Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores
title_full Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores
title_fullStr Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores
title_full_unstemmed Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores
title_short Cancer Gene Prioritization for Targeted Resequencing Using FitSNP Scores
title_sort cancer gene prioritization for targeted resequencing using fitsnp scores
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3291573/
https://www.ncbi.nlm.nih.gov/pubmed/22396732
http://dx.doi.org/10.1371/journal.pone.0031333
work_keys_str_mv AT fieuwannelies cancergeneprioritizationfortargetedresequencingusingfitsnpscores
AT dewildebram cancergeneprioritizationfortargetedresequencingusingfitsnpscores
AT spelemanfrank cancergeneprioritizationfortargetedresequencingusingfitsnpscores
AT vandesompelejo cancergeneprioritizationfortargetedresequencingusingfitsnpscores
AT depreterkatleen cancergeneprioritizationfortargetedresequencingusingfitsnpscores