Cargando…

WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering

Motivation: Nuclear magnetic resonance (NMR) has been widely used as a powerful tool to determine the 3D structures of proteins in vivo. However, the post-spectra processing stage of NMR structure determination usually involves a tremendous amount of time and expert knowledge, which includes peak pi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhi, Abbas, Ahmed, Jing, Bing-Yi, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315717/
https://www.ncbi.nlm.nih.gov/pubmed/22328784
http://dx.doi.org/10.1093/bioinformatics/bts078
_version_ 1782228277712125952
author Liu, Zhi
Abbas, Ahmed
Jing, Bing-Yi
Gao, Xin
author_facet Liu, Zhi
Abbas, Ahmed
Jing, Bing-Yi
Gao, Xin
author_sort Liu, Zhi
collection PubMed
description Motivation: Nuclear magnetic resonance (NMR) has been widely used as a powerful tool to determine the 3D structures of proteins in vivo. However, the post-spectra processing stage of NMR structure determination usually involves a tremendous amount of time and expert knowledge, which includes peak picking, chemical shift assignment and structure calculation steps. Detecting accurate peaks from the NMR spectra is a prerequisite for all following steps, and thus remains a key problem in automatic NMR structure determination. Results: We introduce WaVPeak, a fully automatic peak detection method. WaVPeak first smoothes the given NMR spectrum by wavelets. The peaks are then identified as the local maxima. The false positive peaks are filtered out efficiently by considering the volume of the peaks. WaVPeak has two major advantages over the state-of-the-art peak-picking methods. First, through wavelet-based smoothing, WaVPeak does not eliminate any data point in the spectra. Therefore, WaVPeak is able to detect weak peaks that are embedded in the noise level. NMR spectroscopists need the most help isolating these weak peaks. Second, WaVPeak estimates the volume of the peaks to filter the false positives. This is more reliable than intensity-based filters that are widely used in existing methods. We evaluate the performance of WaVPeak on the benchmark set proposed by PICKY (Alipanahi et al., 2009), one of the most accurate methods in the literature. The dataset comprises 32 2D and 3D spectra from eight different proteins. Experimental results demonstrate that WaVPeak achieves an average of 96%, 91%, 88%, 76% and 85% recall on (15)N-HSQC, HNCO, HNCA, HNCACB and CBCA(CO)NH, respectively. When the same number of peaks are considered, WaVPeak significantly outperforms PICKY. Availability: WaVPeak is an open source program. The source code and two test spectra of WaVPeak are available at http://faculty.kaust.edu.sa/sites/xingao/Pages/Publications.aspx. The online server is under construction. Contact: statliuzhi@xmu.edu.cn; ahmed.abbas@kaust.edu.sa; majing@ust.hk; xin.gao@kaust.edu.sa
format Online
Article
Text
id pubmed-3315717
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33157172012-03-30 WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering Liu, Zhi Abbas, Ahmed Jing, Bing-Yi Gao, Xin Bioinformatics Original Papers Motivation: Nuclear magnetic resonance (NMR) has been widely used as a powerful tool to determine the 3D structures of proteins in vivo. However, the post-spectra processing stage of NMR structure determination usually involves a tremendous amount of time and expert knowledge, which includes peak picking, chemical shift assignment and structure calculation steps. Detecting accurate peaks from the NMR spectra is a prerequisite for all following steps, and thus remains a key problem in automatic NMR structure determination. Results: We introduce WaVPeak, a fully automatic peak detection method. WaVPeak first smoothes the given NMR spectrum by wavelets. The peaks are then identified as the local maxima. The false positive peaks are filtered out efficiently by considering the volume of the peaks. WaVPeak has two major advantages over the state-of-the-art peak-picking methods. First, through wavelet-based smoothing, WaVPeak does not eliminate any data point in the spectra. Therefore, WaVPeak is able to detect weak peaks that are embedded in the noise level. NMR spectroscopists need the most help isolating these weak peaks. Second, WaVPeak estimates the volume of the peaks to filter the false positives. This is more reliable than intensity-based filters that are widely used in existing methods. We evaluate the performance of WaVPeak on the benchmark set proposed by PICKY (Alipanahi et al., 2009), one of the most accurate methods in the literature. The dataset comprises 32 2D and 3D spectra from eight different proteins. Experimental results demonstrate that WaVPeak achieves an average of 96%, 91%, 88%, 76% and 85% recall on (15)N-HSQC, HNCO, HNCA, HNCACB and CBCA(CO)NH, respectively. When the same number of peaks are considered, WaVPeak significantly outperforms PICKY. Availability: WaVPeak is an open source program. The source code and two test spectra of WaVPeak are available at http://faculty.kaust.edu.sa/sites/xingao/Pages/Publications.aspx. The online server is under construction. Contact: statliuzhi@xmu.edu.cn; ahmed.abbas@kaust.edu.sa; majing@ust.hk; xin.gao@kaust.edu.sa Oxford University Press 2012-04-01 2012-02-10 /pmc/articles/PMC3315717/ /pubmed/22328784 http://dx.doi.org/10.1093/bioinformatics/bts078 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Liu, Zhi
Abbas, Ahmed
Jing, Bing-Yi
Gao, Xin
WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering
title WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering
title_full WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering
title_fullStr WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering
title_full_unstemmed WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering
title_short WaVPeak: picking NMR peaks through wavelet-based smoothing and volume-based filtering
title_sort wavpeak: picking nmr peaks through wavelet-based smoothing and volume-based filtering
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3315717/
https://www.ncbi.nlm.nih.gov/pubmed/22328784
http://dx.doi.org/10.1093/bioinformatics/bts078
work_keys_str_mv AT liuzhi wavpeakpickingnmrpeaksthroughwaveletbasedsmoothingandvolumebasedfiltering
AT abbasahmed wavpeakpickingnmrpeaksthroughwaveletbasedsmoothingandvolumebasedfiltering
AT jingbingyi wavpeakpickingnmrpeaksthroughwaveletbasedsmoothingandvolumebasedfiltering
AT gaoxin wavpeakpickingnmrpeaksthroughwaveletbasedsmoothingandvolumebasedfiltering