Cargando…

Identification of 12 cancer types through genome deep learning

Cancer is a major cause of death worldwide, and an early diagnosis is required for a favorable prognosis. Histological examination is the gold standard for cancer identification; however, large amount of inter-observer variability exists in histological diagnosis. Numerous studies have shown cancer...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Yingshuai, Zhu, Sitao, Ma, Kailong, Liu, Weiqing, Yue, Yao, Hu, Gang, Lu, Huifang, Chen, Wenbin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6872744/
https://www.ncbi.nlm.nih.gov/pubmed/31754222
http://dx.doi.org/10.1038/s41598-019-53989-3
_version_ 1783472553436643328
author Sun, Yingshuai
Zhu, Sitao
Ma, Kailong
Liu, Weiqing
Yue, Yao
Hu, Gang
Lu, Huifang
Chen, Wenbin
author_facet Sun, Yingshuai
Zhu, Sitao
Ma, Kailong
Liu, Weiqing
Yue, Yao
Hu, Gang
Lu, Huifang
Chen, Wenbin
author_sort Sun, Yingshuai
collection PubMed
description Cancer is a major cause of death worldwide, and an early diagnosis is required for a favorable prognosis. Histological examination is the gold standard for cancer identification; however, large amount of inter-observer variability exists in histological diagnosis. Numerous studies have shown cancer genesis is accompanied by an accumulation of harmful mutations, potentiating the identification of cancer based on genomic information. We have proposed a method, GDL (genome deep learning), to study the relationship between genomic variations and traits based on deep neural networks. We analyzed 6,083 samples’ WES (Whole Exon Sequencing) mutations files from 12 cancer types obtained from the TCGA (The Cancer Genome Atlas) and 1,991 healthy samples’ WES data from the 1000 Genomes project. We constructed 12 specific models to distinguish between certain type of cancer and healthy tissues, a total-specific model that can identify healthy and cancer tissues, and a mixture model to distinguish between all 12 types of cancer based on GDL. We demonstrate that the accuracy of specific, mixture and total specific model are 97.47%, 70.08% and 94.70% for cancer identification. We developed an efficient method for the identification of cancer based on genomic information that offers a new direction for disease diagnosis.
format Online
Article
Text
id pubmed-6872744
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-68727442019-12-04 Identification of 12 cancer types through genome deep learning Sun, Yingshuai Zhu, Sitao Ma, Kailong Liu, Weiqing Yue, Yao Hu, Gang Lu, Huifang Chen, Wenbin Sci Rep Article Cancer is a major cause of death worldwide, and an early diagnosis is required for a favorable prognosis. Histological examination is the gold standard for cancer identification; however, large amount of inter-observer variability exists in histological diagnosis. Numerous studies have shown cancer genesis is accompanied by an accumulation of harmful mutations, potentiating the identification of cancer based on genomic information. We have proposed a method, GDL (genome deep learning), to study the relationship between genomic variations and traits based on deep neural networks. We analyzed 6,083 samples’ WES (Whole Exon Sequencing) mutations files from 12 cancer types obtained from the TCGA (The Cancer Genome Atlas) and 1,991 healthy samples’ WES data from the 1000 Genomes project. We constructed 12 specific models to distinguish between certain type of cancer and healthy tissues, a total-specific model that can identify healthy and cancer tissues, and a mixture model to distinguish between all 12 types of cancer based on GDL. We demonstrate that the accuracy of specific, mixture and total specific model are 97.47%, 70.08% and 94.70% for cancer identification. We developed an efficient method for the identification of cancer based on genomic information that offers a new direction for disease diagnosis. Nature Publishing Group UK 2019-11-21 /pmc/articles/PMC6872744/ /pubmed/31754222 http://dx.doi.org/10.1038/s41598-019-53989-3 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Sun, Yingshuai
Zhu, Sitao
Ma, Kailong
Liu, Weiqing
Yue, Yao
Hu, Gang
Lu, Huifang
Chen, Wenbin
Identification of 12 cancer types through genome deep learning
title Identification of 12 cancer types through genome deep learning
title_full Identification of 12 cancer types through genome deep learning
title_fullStr Identification of 12 cancer types through genome deep learning
title_full_unstemmed Identification of 12 cancer types through genome deep learning
title_short Identification of 12 cancer types through genome deep learning
title_sort identification of 12 cancer types through genome deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6872744/
https://www.ncbi.nlm.nih.gov/pubmed/31754222
http://dx.doi.org/10.1038/s41598-019-53989-3
work_keys_str_mv AT sunyingshuai identificationof12cancertypesthroughgenomedeeplearning
AT zhusitao identificationof12cancertypesthroughgenomedeeplearning
AT makailong identificationof12cancertypesthroughgenomedeeplearning
AT liuweiqing identificationof12cancertypesthroughgenomedeeplearning
AT yueyao identificationof12cancertypesthroughgenomedeeplearning
AT hugang identificationof12cancertypesthroughgenomedeeplearning
AT luhuifang identificationof12cancertypesthroughgenomedeeplearning
AT chenwenbin identificationof12cancertypesthroughgenomedeeplearning