Cargando…
GANA—a genetic algorithm for NMR backbone resonance assignment
NMR data from different experiments often contain errors; thus, automated backbone resonance assignment is a very challenging issue. In this paper, we present a method called GANA that uses a genetic algorithm to automatically perform backbone resonance assignment with a high degree of precision and...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184223/ https://www.ncbi.nlm.nih.gov/pubmed/16093550 http://dx.doi.org/10.1093/nar/gki768 |
_version_ | 1782124725126823936 |
---|---|
author | Lin, Hsin-Nan Wu, Kun-Pin Chang, Jia-Ming Sung, Ting-Yi Hsu, Wen-Lian |
author_facet | Lin, Hsin-Nan Wu, Kun-Pin Chang, Jia-Ming Sung, Ting-Yi Hsu, Wen-Lian |
author_sort | Lin, Hsin-Nan |
collection | PubMed |
description | NMR data from different experiments often contain errors; thus, automated backbone resonance assignment is a very challenging issue. In this paper, we present a method called GANA that uses a genetic algorithm to automatically perform backbone resonance assignment with a high degree of precision and recall. Precision is the number of correctly assigned residues divided by the number of assigned residues, and recall is the number of correctly assigned residues divided by the number of residues with known human curated answers. GANA takes spin systems as input data and uses two data structures, candidate lists and adjacency lists, to assign the spin systems to each amino acid of a target protein. Using GANA, almost all spin systems can be mapped correctly onto a target protein, even if the data are noisy. We use the BioMagResBank (BMRB) dataset (901 proteins) to test the performance of GANA. To evaluate the robustness of GANA, we generate four additional datasets from the BMRB dataset to simulate data errors of false positives, false negatives and linking errors. We also use a combination of these three error types to examine the fault tolerance of our method. The average precision rates of GANA on BMRB and the four simulated test cases are 99.61, 99.55, 99.34, 99.35 and 98.60%, respectively. The average recall rates of GANA on BMRB and the four simulated test cases are 99.26, 99.19, 98.85, 98.87 and 97.78%, respectively. We also test GANA on two real wet-lab datasets, hbSBD and hbLBD. The precision and recall rates of GANA on hbSBD are 95.12 and 92.86%, respectively, and those of hbLBD are 100 and 97.40%, respectively. |
format | Text |
id | pubmed-1184223 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-11842232005-08-11 GANA—a genetic algorithm for NMR backbone resonance assignment Lin, Hsin-Nan Wu, Kun-Pin Chang, Jia-Ming Sung, Ting-Yi Hsu, Wen-Lian Nucleic Acids Res Article NMR data from different experiments often contain errors; thus, automated backbone resonance assignment is a very challenging issue. In this paper, we present a method called GANA that uses a genetic algorithm to automatically perform backbone resonance assignment with a high degree of precision and recall. Precision is the number of correctly assigned residues divided by the number of assigned residues, and recall is the number of correctly assigned residues divided by the number of residues with known human curated answers. GANA takes spin systems as input data and uses two data structures, candidate lists and adjacency lists, to assign the spin systems to each amino acid of a target protein. Using GANA, almost all spin systems can be mapped correctly onto a target protein, even if the data are noisy. We use the BioMagResBank (BMRB) dataset (901 proteins) to test the performance of GANA. To evaluate the robustness of GANA, we generate four additional datasets from the BMRB dataset to simulate data errors of false positives, false negatives and linking errors. We also use a combination of these three error types to examine the fault tolerance of our method. The average precision rates of GANA on BMRB and the four simulated test cases are 99.61, 99.55, 99.34, 99.35 and 98.60%, respectively. The average recall rates of GANA on BMRB and the four simulated test cases are 99.26, 99.19, 98.85, 98.87 and 97.78%, respectively. We also test GANA on two real wet-lab datasets, hbSBD and hbLBD. The precision and recall rates of GANA on hbSBD are 95.12 and 92.86%, respectively, and those of hbLBD are 100 and 97.40%, respectively. Oxford University Press 2005 2005-08-10 /pmc/articles/PMC1184223/ /pubmed/16093550 http://dx.doi.org/10.1093/nar/gki768 Text en © The Author 2005. Published by Oxford University Press. All rights reserved |
spellingShingle | Article Lin, Hsin-Nan Wu, Kun-Pin Chang, Jia-Ming Sung, Ting-Yi Hsu, Wen-Lian GANA—a genetic algorithm for NMR backbone resonance assignment |
title | GANA—a genetic algorithm for NMR backbone resonance assignment |
title_full | GANA—a genetic algorithm for NMR backbone resonance assignment |
title_fullStr | GANA—a genetic algorithm for NMR backbone resonance assignment |
title_full_unstemmed | GANA—a genetic algorithm for NMR backbone resonance assignment |
title_short | GANA—a genetic algorithm for NMR backbone resonance assignment |
title_sort | gana—a genetic algorithm for nmr backbone resonance assignment |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184223/ https://www.ncbi.nlm.nih.gov/pubmed/16093550 http://dx.doi.org/10.1093/nar/gki768 |
work_keys_str_mv | AT linhsinnan ganaageneticalgorithmfornmrbackboneresonanceassignment AT wukunpin ganaageneticalgorithmfornmrbackboneresonanceassignment AT changjiaming ganaageneticalgorithmfornmrbackboneresonanceassignment AT sungtingyi ganaageneticalgorithmfornmrbackboneresonanceassignment AT hsuwenlian ganaageneticalgorithmfornmrbackboneresonanceassignment |