Cargando…
Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA
The confluence of deep sequencing and powerful machine learning is providing an unprecedented peek at the darkest of the dark genomic matter, the non-coding genomic regions lacking any functional annotation. While deep sequencing uncovers rare tumor variants, the heterogeneity of the disease confoun...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6742441/ https://www.ncbi.nlm.nih.gov/pubmed/31469830 http://dx.doi.org/10.1371/journal.pcbi.1007332 |
_version_ | 1783451111761379328 |
---|---|
author | Parida, Laxmi Haferlach, Claudia Rhrissorrakrai, Kahn Utro, Filippo Levovitz, Chaya Kern, Wolfgang Nadarajah, Niroshan Twardziok, Sven Hutter, Stephan Meggendorfer, Manja Walter, Wencke Baer, Constance Haferlach, Torsten |
author_facet | Parida, Laxmi Haferlach, Claudia Rhrissorrakrai, Kahn Utro, Filippo Levovitz, Chaya Kern, Wolfgang Nadarajah, Niroshan Twardziok, Sven Hutter, Stephan Meggendorfer, Manja Walter, Wencke Baer, Constance Haferlach, Torsten |
author_sort | Parida, Laxmi |
collection | PubMed |
description | The confluence of deep sequencing and powerful machine learning is providing an unprecedented peek at the darkest of the dark genomic matter, the non-coding genomic regions lacking any functional annotation. While deep sequencing uncovers rare tumor variants, the heterogeneity of the disease confounds the best of machine learning (ML) algorithms. Here we set out to answer if the dark-matter of the genome encompass signals that can distinguish the fine subtypes of disease that are otherwise genomically indistinguishable. We introduce a novel stochastic regularization, ReVeaL, that empowers ML to discriminate subtle cancer subtypes even from the same ‘cell of origin’. Analogous to heritability, implicitly defined on whole genome, we use predictability (F(1) score) definable on portions of the genome. In an effort to distinguish cancer subtypes using dark-matter DNA, we applied ReVeaL to a new WGS dataset from 727 patient samples with seven forms of hematological cancers and assessed the predictivity over several genomic regions including genic, non-dark, non-coding, non-genic, and dark. ReVeaL enabled improved discrimination of cancer subtypes for all segments of the genome. The non-genic, non-coding and dark-matter had the highest F(1) scores, with dark-matter having the highest level of predictability. Based on ReVeaL’s predictability of different genomic regions, dark-matter contains enough signal to significantly discriminate fine subtypes of disease. Hence, the agglomeration of rare variants, even in the hitherto unannotated and ill-understood regions of the genome, may play a substantial role in the disease etiology and deserve much more attention. |
format | Online Article Text |
id | pubmed-6742441 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-67424412019-09-20 Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA Parida, Laxmi Haferlach, Claudia Rhrissorrakrai, Kahn Utro, Filippo Levovitz, Chaya Kern, Wolfgang Nadarajah, Niroshan Twardziok, Sven Hutter, Stephan Meggendorfer, Manja Walter, Wencke Baer, Constance Haferlach, Torsten PLoS Comput Biol Research Article The confluence of deep sequencing and powerful machine learning is providing an unprecedented peek at the darkest of the dark genomic matter, the non-coding genomic regions lacking any functional annotation. While deep sequencing uncovers rare tumor variants, the heterogeneity of the disease confounds the best of machine learning (ML) algorithms. Here we set out to answer if the dark-matter of the genome encompass signals that can distinguish the fine subtypes of disease that are otherwise genomically indistinguishable. We introduce a novel stochastic regularization, ReVeaL, that empowers ML to discriminate subtle cancer subtypes even from the same ‘cell of origin’. Analogous to heritability, implicitly defined on whole genome, we use predictability (F(1) score) definable on portions of the genome. In an effort to distinguish cancer subtypes using dark-matter DNA, we applied ReVeaL to a new WGS dataset from 727 patient samples with seven forms of hematological cancers and assessed the predictivity over several genomic regions including genic, non-dark, non-coding, non-genic, and dark. ReVeaL enabled improved discrimination of cancer subtypes for all segments of the genome. The non-genic, non-coding and dark-matter had the highest F(1) scores, with dark-matter having the highest level of predictability. Based on ReVeaL’s predictability of different genomic regions, dark-matter contains enough signal to significantly discriminate fine subtypes of disease. Hence, the agglomeration of rare variants, even in the hitherto unannotated and ill-understood regions of the genome, may play a substantial role in the disease etiology and deserve much more attention. Public Library of Science 2019-08-30 /pmc/articles/PMC6742441/ /pubmed/31469830 http://dx.doi.org/10.1371/journal.pcbi.1007332 Text en © 2019 Parida et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Parida, Laxmi Haferlach, Claudia Rhrissorrakrai, Kahn Utro, Filippo Levovitz, Chaya Kern, Wolfgang Nadarajah, Niroshan Twardziok, Sven Hutter, Stephan Meggendorfer, Manja Walter, Wencke Baer, Constance Haferlach, Torsten Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA |
title | Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA |
title_full | Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA |
title_fullStr | Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA |
title_full_unstemmed | Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA |
title_short | Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA |
title_sort | dark-matter matters: discriminating subtle blood cancers using the darkest dna |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6742441/ https://www.ncbi.nlm.nih.gov/pubmed/31469830 http://dx.doi.org/10.1371/journal.pcbi.1007332 |
work_keys_str_mv | AT paridalaxmi darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT haferlachclaudia darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT rhrissorrakraikahn darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT utrofilippo darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT levovitzchaya darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT kernwolfgang darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT nadarajahniroshan darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT twardzioksven darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT hutterstephan darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT meggendorfermanja darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT walterwencke darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT baerconstance darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna AT haferlachtorsten darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna |