Cargando…

Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications

Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately pri...

Descripción completa

Detalles Bibliográficos
Autores principales: Iddamalgoda, Lahiru, Das, Partha S., Aponso, Achala, Sundararajan, Vijayaraghava S., Suravajhala, Prashanth, Valadi, Jayaraman K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4979376/
https://www.ncbi.nlm.nih.gov/pubmed/27559342
http://dx.doi.org/10.3389/fgene.2016.00136
Descripción
Sumario:Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation.