Cargando…

Alignment-free similarity analysis for protein sequences based on fuzzy integral

Sequence comparison is an essential part of modern molecular biology research. In this study, we estimated the parameters of Markov chain by considering the frequencies of occurrence of the all possible amino acid pairs from each alignment-free protein sequence. These estimated Markov chain paramete...

Descripción completa

Detalles Bibliográficos
Autores principales: Saw, Ajay Kumar, Tripathy, Binod Chandra, Nandi, Soumyadeep
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6391537/
https://www.ncbi.nlm.nih.gov/pubmed/30808983
http://dx.doi.org/10.1038/s41598-019-39477-8
Descripción
Sumario:Sequence comparison is an essential part of modern molecular biology research. In this study, we estimated the parameters of Markov chain by considering the frequencies of occurrence of the all possible amino acid pairs from each alignment-free protein sequence. These estimated Markov chain parameters were used to calculate similarity between two protein sequences based on a fuzzy integral algorithm. For validation, our result was compared with both alignment-based (ClustalW) and alignment-free methods on six benchmark datasets. The results indicate that our developed algorithm has a better clustering performance for protein sequence comparison.