Cargando…

A general-purpose signal processing algorithm for biological profiles using only first-order derivative information

BACKGROUND: Automatic signal-feature extraction algorithms are crucial for profile processing in bioinformatics. Both baseline drift and noise seriously affect the position and peak area of signals. An efficient algorithm named the derivative passing accumulation (DPA) method for simultaneous baseli...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yuanjie, Lin, Jianhan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882060/
https://www.ncbi.nlm.nih.gov/pubmed/31775621
http://dx.doi.org/10.1186/s12859-019-3188-4
_version_ 1783474073387401216
author Liu, Yuanjie
Lin, Jianhan
author_facet Liu, Yuanjie
Lin, Jianhan
author_sort Liu, Yuanjie
collection PubMed
description BACKGROUND: Automatic signal-feature extraction algorithms are crucial for profile processing in bioinformatics. Both baseline drift and noise seriously affect the position and peak area of signals. An efficient algorithm named the derivative passing accumulation (DPA) method for simultaneous baseline correction and signal extraction is presented in this article. It is an efficient method using only the first-order derivatives which are obtained through taking the simple differences. RESULTS: We developed a new signal feature extracting procedure. The vector representing the discrete first-order derivative was divided into negative and positive parts and then accumulated to build a signal descriptor. The signals and background fluctuations are easily separated according to this descriptor via thresholding. In addition, the signal peaks are simultaneously located by checking the corresponding intervals in the descriptor. Therefore, the eternal issues of parsing the 1-dimensional output of detectors in biological instruments are solved together. Thereby, the baseline is corrected, and the signal peaks are extracted. CONCLUSIONS: We have introduced a new method for signal peak picking, where baseline computation and peak identification are performed jointly. The testing results of both authentic and artificially synthesized data illustrate that the new method is powerful, and it could be a better choice for practical processing.
format Online
Article
Text
id pubmed-6882060
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68820602019-12-03 A general-purpose signal processing algorithm for biological profiles using only first-order derivative information Liu, Yuanjie Lin, Jianhan BMC Bioinformatics Methodology Article BACKGROUND: Automatic signal-feature extraction algorithms are crucial for profile processing in bioinformatics. Both baseline drift and noise seriously affect the position and peak area of signals. An efficient algorithm named the derivative passing accumulation (DPA) method for simultaneous baseline correction and signal extraction is presented in this article. It is an efficient method using only the first-order derivatives which are obtained through taking the simple differences. RESULTS: We developed a new signal feature extracting procedure. The vector representing the discrete first-order derivative was divided into negative and positive parts and then accumulated to build a signal descriptor. The signals and background fluctuations are easily separated according to this descriptor via thresholding. In addition, the signal peaks are simultaneously located by checking the corresponding intervals in the descriptor. Therefore, the eternal issues of parsing the 1-dimensional output of detectors in biological instruments are solved together. Thereby, the baseline is corrected, and the signal peaks are extracted. CONCLUSIONS: We have introduced a new method for signal peak picking, where baseline computation and peak identification are performed jointly. The testing results of both authentic and artificially synthesized data illustrate that the new method is powerful, and it could be a better choice for practical processing. BioMed Central 2019-11-27 /pmc/articles/PMC6882060/ /pubmed/31775621 http://dx.doi.org/10.1186/s12859-019-3188-4 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Liu, Yuanjie
Lin, Jianhan
A general-purpose signal processing algorithm for biological profiles using only first-order derivative information
title A general-purpose signal processing algorithm for biological profiles using only first-order derivative information
title_full A general-purpose signal processing algorithm for biological profiles using only first-order derivative information
title_fullStr A general-purpose signal processing algorithm for biological profiles using only first-order derivative information
title_full_unstemmed A general-purpose signal processing algorithm for biological profiles using only first-order derivative information
title_short A general-purpose signal processing algorithm for biological profiles using only first-order derivative information
title_sort general-purpose signal processing algorithm for biological profiles using only first-order derivative information
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882060/
https://www.ncbi.nlm.nih.gov/pubmed/31775621
http://dx.doi.org/10.1186/s12859-019-3188-4
work_keys_str_mv AT liuyuanjie ageneralpurposesignalprocessingalgorithmforbiologicalprofilesusingonlyfirstorderderivativeinformation
AT linjianhan ageneralpurposesignalprocessingalgorithmforbiologicalprofilesusingonlyfirstorderderivativeinformation
AT liuyuanjie generalpurposesignalprocessingalgorithmforbiologicalprofilesusingonlyfirstorderderivativeinformation
AT linjianhan generalpurposesignalprocessingalgorithmforbiologicalprofilesusingonlyfirstorderderivativeinformation