Cargando…

Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations

The abnormal growth of human healthy cells is called cancer. One of the major types of cancer is sarcoma, mostly found in human bones and soft tissue cells. It commonly occurs in children. According to a survey of the United States of America, there are more than 17,000 sarcoma patients registered e...

Descripción completa

Detalles Bibliográficos
Autores principales: Shah, Asghar Ali, Alturise, Fahad, Alkhalifah, Tamim, Khan, Yaser Daanial
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9597026/
https://www.ncbi.nlm.nih.gov/pubmed/36312852
http://dx.doi.org/10.1177/20552076221133703
_version_ 1784815999784058880
author Shah, Asghar Ali
Alturise, Fahad
Alkhalifah, Tamim
Khan, Yaser Daanial
author_facet Shah, Asghar Ali
Alturise, Fahad
Alkhalifah, Tamim
Khan, Yaser Daanial
author_sort Shah, Asghar Ali
collection PubMed
description The abnormal growth of human healthy cells is called cancer. One of the major types of cancer is sarcoma, mostly found in human bones and soft tissue cells. It commonly occurs in children. According to a survey of the United States of America, there are more than 17,000 sarcoma patients registered each year which is 15% of all cancer cases. Recognition of cancer at its early stage saves many lives. The proposed study developed a framework for the early detection of human sarcoma cancer using deep learning Recurrent Neural Network (RNN) algorithms. The DNA of a human cell is made up of 25,000 to 30,000 genes. Each gene is represented by sequences of nucleotides. The nucleotides in a sequence of a driver gene can change which is termed as mutations. Some mutations can cause cancer. There are seven types of a gene whose mutation causes sarcoma cancer. The study uses the dataset which has been taken from more than 134 samples and includes 141 mutations in 8 driver genes. On these gene sequences RNN algorithms Long and Short-Term Memory (LSTM), Gated Recurrent Units and Bi-directional LSTM (Bi-LSTM) are used for training. Rigorous testing techniques such as Self-consistency testing, independent set testing, 10-fold cross-validation test are applied for the validation of results. These validation techniques yield several metrics such as Area Under the Curve (AUC), sensitivity, specificity, Mathew's correlation coefficient, loss, and accuracy. The proposed algorithm exhibits an accuracy of 99.6% with an AUC value of 1.00.
format Online
Article
Text
id pubmed-9597026
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-95970262022-10-27 Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations Shah, Asghar Ali Alturise, Fahad Alkhalifah, Tamim Khan, Yaser Daanial Digit Health Original Research The abnormal growth of human healthy cells is called cancer. One of the major types of cancer is sarcoma, mostly found in human bones and soft tissue cells. It commonly occurs in children. According to a survey of the United States of America, there are more than 17,000 sarcoma patients registered each year which is 15% of all cancer cases. Recognition of cancer at its early stage saves many lives. The proposed study developed a framework for the early detection of human sarcoma cancer using deep learning Recurrent Neural Network (RNN) algorithms. The DNA of a human cell is made up of 25,000 to 30,000 genes. Each gene is represented by sequences of nucleotides. The nucleotides in a sequence of a driver gene can change which is termed as mutations. Some mutations can cause cancer. There are seven types of a gene whose mutation causes sarcoma cancer. The study uses the dataset which has been taken from more than 134 samples and includes 141 mutations in 8 driver genes. On these gene sequences RNN algorithms Long and Short-Term Memory (LSTM), Gated Recurrent Units and Bi-directional LSTM (Bi-LSTM) are used for training. Rigorous testing techniques such as Self-consistency testing, independent set testing, 10-fold cross-validation test are applied for the validation of results. These validation techniques yield several metrics such as Area Under the Curve (AUC), sensitivity, specificity, Mathew's correlation coefficient, loss, and accuracy. The proposed algorithm exhibits an accuracy of 99.6% with an AUC value of 1.00. SAGE Publications 2022-10-22 /pmc/articles/PMC9597026/ /pubmed/36312852 http://dx.doi.org/10.1177/20552076221133703 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Shah, Asghar Ali
Alturise, Fahad
Alkhalifah, Tamim
Khan, Yaser Daanial
Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations
title Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations
title_full Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations
title_fullStr Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations
title_full_unstemmed Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations
title_short Evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations
title_sort evaluation of deep learning techniques for identification of sarcoma-causing carcinogenic mutations
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9597026/
https://www.ncbi.nlm.nih.gov/pubmed/36312852
http://dx.doi.org/10.1177/20552076221133703
work_keys_str_mv AT shahasgharali evaluationofdeeplearningtechniquesforidentificationofsarcomacausingcarcinogenicmutations
AT alturisefahad evaluationofdeeplearningtechniquesforidentificationofsarcomacausingcarcinogenicmutations
AT alkhalifahtamim evaluationofdeeplearningtechniquesforidentificationofsarcomacausingcarcinogenicmutations
AT khanyaserdaanial evaluationofdeeplearningtechniquesforidentificationofsarcomacausingcarcinogenicmutations