Cargando…

Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA

The confluence of deep sequencing and powerful machine learning is providing an unprecedented peek at the darkest of the dark genomic matter, the non-coding genomic regions lacking any functional annotation. While deep sequencing uncovers rare tumor variants, the heterogeneity of the disease confoun...

Descripción completa

Detalles Bibliográficos
Autores principales: Parida, Laxmi, Haferlach, Claudia, Rhrissorrakrai, Kahn, Utro, Filippo, Levovitz, Chaya, Kern, Wolfgang, Nadarajah, Niroshan, Twardziok, Sven, Hutter, Stephan, Meggendorfer, Manja, Walter, Wencke, Baer, Constance, Haferlach, Torsten
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6742441/
https://www.ncbi.nlm.nih.gov/pubmed/31469830
http://dx.doi.org/10.1371/journal.pcbi.1007332
_version_ 1783451111761379328
author Parida, Laxmi
Haferlach, Claudia
Rhrissorrakrai, Kahn
Utro, Filippo
Levovitz, Chaya
Kern, Wolfgang
Nadarajah, Niroshan
Twardziok, Sven
Hutter, Stephan
Meggendorfer, Manja
Walter, Wencke
Baer, Constance
Haferlach, Torsten
author_facet Parida, Laxmi
Haferlach, Claudia
Rhrissorrakrai, Kahn
Utro, Filippo
Levovitz, Chaya
Kern, Wolfgang
Nadarajah, Niroshan
Twardziok, Sven
Hutter, Stephan
Meggendorfer, Manja
Walter, Wencke
Baer, Constance
Haferlach, Torsten
author_sort Parida, Laxmi
collection PubMed
description The confluence of deep sequencing and powerful machine learning is providing an unprecedented peek at the darkest of the dark genomic matter, the non-coding genomic regions lacking any functional annotation. While deep sequencing uncovers rare tumor variants, the heterogeneity of the disease confounds the best of machine learning (ML) algorithms. Here we set out to answer if the dark-matter of the genome encompass signals that can distinguish the fine subtypes of disease that are otherwise genomically indistinguishable. We introduce a novel stochastic regularization, ReVeaL, that empowers ML to discriminate subtle cancer subtypes even from the same ‘cell of origin’. Analogous to heritability, implicitly defined on whole genome, we use predictability (F(1) score) definable on portions of the genome. In an effort to distinguish cancer subtypes using dark-matter DNA, we applied ReVeaL to a new WGS dataset from 727 patient samples with seven forms of hematological cancers and assessed the predictivity over several genomic regions including genic, non-dark, non-coding, non-genic, and dark. ReVeaL enabled improved discrimination of cancer subtypes for all segments of the genome. The non-genic, non-coding and dark-matter had the highest F(1) scores, with dark-matter having the highest level of predictability. Based on ReVeaL’s predictability of different genomic regions, dark-matter contains enough signal to significantly discriminate fine subtypes of disease. Hence, the agglomeration of rare variants, even in the hitherto unannotated and ill-understood regions of the genome, may play a substantial role in the disease etiology and deserve much more attention.
format Online
Article
Text
id pubmed-6742441
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-67424412019-09-20 Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA Parida, Laxmi Haferlach, Claudia Rhrissorrakrai, Kahn Utro, Filippo Levovitz, Chaya Kern, Wolfgang Nadarajah, Niroshan Twardziok, Sven Hutter, Stephan Meggendorfer, Manja Walter, Wencke Baer, Constance Haferlach, Torsten PLoS Comput Biol Research Article The confluence of deep sequencing and powerful machine learning is providing an unprecedented peek at the darkest of the dark genomic matter, the non-coding genomic regions lacking any functional annotation. While deep sequencing uncovers rare tumor variants, the heterogeneity of the disease confounds the best of machine learning (ML) algorithms. Here we set out to answer if the dark-matter of the genome encompass signals that can distinguish the fine subtypes of disease that are otherwise genomically indistinguishable. We introduce a novel stochastic regularization, ReVeaL, that empowers ML to discriminate subtle cancer subtypes even from the same ‘cell of origin’. Analogous to heritability, implicitly defined on whole genome, we use predictability (F(1) score) definable on portions of the genome. In an effort to distinguish cancer subtypes using dark-matter DNA, we applied ReVeaL to a new WGS dataset from 727 patient samples with seven forms of hematological cancers and assessed the predictivity over several genomic regions including genic, non-dark, non-coding, non-genic, and dark. ReVeaL enabled improved discrimination of cancer subtypes for all segments of the genome. The non-genic, non-coding and dark-matter had the highest F(1) scores, with dark-matter having the highest level of predictability. Based on ReVeaL’s predictability of different genomic regions, dark-matter contains enough signal to significantly discriminate fine subtypes of disease. Hence, the agglomeration of rare variants, even in the hitherto unannotated and ill-understood regions of the genome, may play a substantial role in the disease etiology and deserve much more attention. Public Library of Science 2019-08-30 /pmc/articles/PMC6742441/ /pubmed/31469830 http://dx.doi.org/10.1371/journal.pcbi.1007332 Text en © 2019 Parida et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Parida, Laxmi
Haferlach, Claudia
Rhrissorrakrai, Kahn
Utro, Filippo
Levovitz, Chaya
Kern, Wolfgang
Nadarajah, Niroshan
Twardziok, Sven
Hutter, Stephan
Meggendorfer, Manja
Walter, Wencke
Baer, Constance
Haferlach, Torsten
Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA
title Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA
title_full Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA
title_fullStr Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA
title_full_unstemmed Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA
title_short Dark-matter matters: Discriminating subtle blood cancers using the darkest DNA
title_sort dark-matter matters: discriminating subtle blood cancers using the darkest dna
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6742441/
https://www.ncbi.nlm.nih.gov/pubmed/31469830
http://dx.doi.org/10.1371/journal.pcbi.1007332
work_keys_str_mv AT paridalaxmi darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT haferlachclaudia darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT rhrissorrakraikahn darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT utrofilippo darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT levovitzchaya darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT kernwolfgang darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT nadarajahniroshan darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT twardzioksven darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT hutterstephan darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT meggendorfermanja darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT walterwencke darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT baerconstance darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna
AT haferlachtorsten darkmattermattersdiscriminatingsubtlebloodcancersusingthedarkestdna