Cargando…
Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications
Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately pri...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4979376/ https://www.ncbi.nlm.nih.gov/pubmed/27559342 http://dx.doi.org/10.3389/fgene.2016.00136 |
_version_ | 1782447311299805184 |
---|---|
author | Iddamalgoda, Lahiru Das, Partha S. Aponso, Achala Sundararajan, Vijayaraghava S. Suravajhala, Prashanth Valadi, Jayaraman K. |
author_facet | Iddamalgoda, Lahiru Das, Partha S. Aponso, Achala Sundararajan, Vijayaraghava S. Suravajhala, Prashanth Valadi, Jayaraman K. |
author_sort | Iddamalgoda, Lahiru |
collection | PubMed |
description | Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation. |
format | Online Article Text |
id | pubmed-4979376 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-49793762016-08-24 Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications Iddamalgoda, Lahiru Das, Partha S. Aponso, Achala Sundararajan, Vijayaraghava S. Suravajhala, Prashanth Valadi, Jayaraman K. Front Genet Genetics Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation. Frontiers Media S.A. 2016-08-10 /pmc/articles/PMC4979376/ /pubmed/27559342 http://dx.doi.org/10.3389/fgene.2016.00136 Text en Copyright © 2016 Iddamalgoda, Das, Aponso, Sundararajan, Suravajhala and Valadi. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Iddamalgoda, Lahiru Das, Partha S. Aponso, Achala Sundararajan, Vijayaraghava S. Suravajhala, Prashanth Valadi, Jayaraman K. Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications |
title | Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications |
title_full | Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications |
title_fullStr | Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications |
title_full_unstemmed | Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications |
title_short | Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications |
title_sort | data mining and pattern recognition models for identifying inherited diseases: challenges and implications |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4979376/ https://www.ncbi.nlm.nih.gov/pubmed/27559342 http://dx.doi.org/10.3389/fgene.2016.00136 |
work_keys_str_mv | AT iddamalgodalahiru dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications AT dasparthas dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications AT aponsoachala dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications AT sundararajanvijayaraghavas dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications AT suravajhalaprashanth dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications AT valadijayaramank dataminingandpatternrecognitionmodelsforidentifyinginheriteddiseaseschallengesandimplications |