Cargando…

GANA—a genetic algorithm for NMR backbone resonance assignment

NMR data from different experiments often contain errors; thus, automated backbone resonance assignment is a very challenging issue. In this paper, we present a method called GANA that uses a genetic algorithm to automatically perform backbone resonance assignment with a high degree of precision and...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lin, Hsin-Nan, Wu, Kun-Pin, Chang, Jia-Ming, Sung, Ting-Yi, Hsu, Wen-Lian
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2005
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184223/ https://www.ncbi.nlm.nih.gov/pubmed/16093550 http://dx.doi.org/10.1093/nar/gki768

_version_	1782124725126823936
author	Lin, Hsin-Nan Wu, Kun-Pin Chang, Jia-Ming Sung, Ting-Yi Hsu, Wen-Lian
author_facet	Lin, Hsin-Nan Wu, Kun-Pin Chang, Jia-Ming Sung, Ting-Yi Hsu, Wen-Lian
author_sort	Lin, Hsin-Nan
collection	PubMed
description	NMR data from different experiments often contain errors; thus, automated backbone resonance assignment is a very challenging issue. In this paper, we present a method called GANA that uses a genetic algorithm to automatically perform backbone resonance assignment with a high degree of precision and recall. Precision is the number of correctly assigned residues divided by the number of assigned residues, and recall is the number of correctly assigned residues divided by the number of residues with known human curated answers. GANA takes spin systems as input data and uses two data structures, candidate lists and adjacency lists, to assign the spin systems to each amino acid of a target protein. Using GANA, almost all spin systems can be mapped correctly onto a target protein, even if the data are noisy. We use the BioMagResBank (BMRB) dataset (901 proteins) to test the performance of GANA. To evaluate the robustness of GANA, we generate four additional datasets from the BMRB dataset to simulate data errors of false positives, false negatives and linking errors. We also use a combination of these three error types to examine the fault tolerance of our method. The average precision rates of GANA on BMRB and the four simulated test cases are 99.61, 99.55, 99.34, 99.35 and 98.60%, respectively. The average recall rates of GANA on BMRB and the four simulated test cases are 99.26, 99.19, 98.85, 98.87 and 97.78%, respectively. We also test GANA on two real wet-lab datasets, hbSBD and hbLBD. The precision and recall rates of GANA on hbSBD are 95.12 and 92.86%, respectively, and those of hbLBD are 100 and 97.40%, respectively.
format	Text
id	pubmed-1184223
institution	National Center for Biotechnology Information
language	English
publishDate	2005
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-11842232005-08-11 GANA—a genetic algorithm for NMR backbone resonance assignment Lin, Hsin-Nan Wu, Kun-Pin Chang, Jia-Ming Sung, Ting-Yi Hsu, Wen-Lian Nucleic Acids Res Article NMR data from different experiments often contain errors; thus, automated backbone resonance assignment is a very challenging issue. In this paper, we present a method called GANA that uses a genetic algorithm to automatically perform backbone resonance assignment with a high degree of precision and recall. Precision is the number of correctly assigned residues divided by the number of assigned residues, and recall is the number of correctly assigned residues divided by the number of residues with known human curated answers. GANA takes spin systems as input data and uses two data structures, candidate lists and adjacency lists, to assign the spin systems to each amino acid of a target protein. Using GANA, almost all spin systems can be mapped correctly onto a target protein, even if the data are noisy. We use the BioMagResBank (BMRB) dataset (901 proteins) to test the performance of GANA. To evaluate the robustness of GANA, we generate four additional datasets from the BMRB dataset to simulate data errors of false positives, false negatives and linking errors. We also use a combination of these three error types to examine the fault tolerance of our method. The average precision rates of GANA on BMRB and the four simulated test cases are 99.61, 99.55, 99.34, 99.35 and 98.60%, respectively. The average recall rates of GANA on BMRB and the four simulated test cases are 99.26, 99.19, 98.85, 98.87 and 97.78%, respectively. We also test GANA on two real wet-lab datasets, hbSBD and hbLBD. The precision and recall rates of GANA on hbSBD are 95.12 and 92.86%, respectively, and those of hbLBD are 100 and 97.40%, respectively. Oxford University Press 2005 2005-08-10 /pmc/articles/PMC1184223/ /pubmed/16093550 http://dx.doi.org/10.1093/nar/gki768 Text en © The Author 2005. Published by Oxford University Press. All rights reserved
spellingShingle	Article Lin, Hsin-Nan Wu, Kun-Pin Chang, Jia-Ming Sung, Ting-Yi Hsu, Wen-Lian GANA—a genetic algorithm for NMR backbone resonance assignment
title	GANA—a genetic algorithm for NMR backbone resonance assignment
title_full	GANA—a genetic algorithm for NMR backbone resonance assignment
title_fullStr	GANA—a genetic algorithm for NMR backbone resonance assignment
title_full_unstemmed	GANA—a genetic algorithm for NMR backbone resonance assignment
title_short	GANA—a genetic algorithm for NMR backbone resonance assignment
title_sort	gana—a genetic algorithm for nmr backbone resonance assignment
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184223/ https://www.ncbi.nlm.nih.gov/pubmed/16093550 http://dx.doi.org/10.1093/nar/gki768
work_keys_str_mv	AT linhsinnan ganaageneticalgorithmfornmrbackboneresonanceassignment AT wukunpin ganaageneticalgorithmfornmrbackboneresonanceassignment AT changjiaming ganaageneticalgorithmfornmrbackboneresonanceassignment AT sungtingyi ganaageneticalgorithmfornmrbackboneresonanceassignment AT hsuwenlian ganaageneticalgorithmfornmrbackboneresonanceassignment

GANA—a genetic algorithm for NMR backbone resonance assignment

Ejemplares similares