Cargando…

Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications

Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately pri...

Descripción completa

Detalles Bibliográficos
Autores principales: Iddamalgoda, Lahiru, Das, Partha S., Aponso, Achala, Sundararajan, Vijayaraghava S., Suravajhala, Prashanth, Valadi, Jayaraman K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4979376/
https://www.ncbi.nlm.nih.gov/pubmed/27559342
http://dx.doi.org/10.3389/fgene.2016.00136
_version_ 1782447311299805184
author Iddamalgoda, Lahiru
Das, Partha S.
Aponso, Achala
Sundararajan, Vijayaraghava S.
Suravajhala, Prashanth
Valadi, Jayaraman K.
author_facet Iddamalgoda, Lahiru
Das, Partha S.
Aponso, Achala
Sundararajan, Vijayaraghava S.
Suravajhala, Prashanth
Valadi, Jayaraman K.
author_sort Iddamalgoda, Lahiru
collection PubMed
description Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation.
format Online
Article
Text
id pubmed-4979376
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-49793762016-08-24 Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications Iddamalgoda, Lahiru Das, Partha S. Aponso, Achala Sundararajan, Vijayaraghava S. Suravajhala, Prashanth Valadi, Jayaraman K. Front Genet Genetics Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation. Frontiers Media S.A. 2016-08-10 /pmc/articles/PMC4979376/ /pubmed/27559342 http://dx.doi.org/10.3389/fgene.2016.00136 Text en Copyright © 2016 Iddamalgoda, Das, Aponso, Sundararajan, Suravajhala and Valadi. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Iddamalgoda, Lahiru
Das, Partha S.
Aponso, Achala
Sundararajan, Vijayaraghava S.
Suravajhala, Prashanth
Valadi, Jayaraman K.
Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications
title Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications
title_full Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications
title_fullStr Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications
title_full_unstemmed Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications
title_short Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications
title_sort data mining and pattern recognition models for identifying inherited diseases: challenges and implications
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4979376/
https://www.ncbi.nlm.nih.gov/pubmed/27559342
http://dx.doi.org/10.3389/fgene.2016.00136
work_keys_str_mv AT iddamalgodalahiru dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications
AT dasparthas dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications
AT aponsoachala dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications
AT sundararajanvijayaraghavas dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications
AT suravajhalaprashanth dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications
AT valadijayaramank dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications