Cargando…
A general-purpose signal processing algorithm for biological profiles using only first-order derivative information
BACKGROUND: Automatic signal-feature extraction algorithms are crucial for profile processing in bioinformatics. Both baseline drift and noise seriously affect the position and peak area of signals. An efficient algorithm named the derivative passing accumulation (DPA) method for simultaneous baseli...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882060/ https://www.ncbi.nlm.nih.gov/pubmed/31775621 http://dx.doi.org/10.1186/s12859-019-3188-4 |
_version_ | 1783474073387401216 |
---|---|
author | Liu, Yuanjie Lin, Jianhan |
author_facet | Liu, Yuanjie Lin, Jianhan |
author_sort | Liu, Yuanjie |
collection | PubMed |
description | BACKGROUND: Automatic signal-feature extraction algorithms are crucial for profile processing in bioinformatics. Both baseline drift and noise seriously affect the position and peak area of signals. An efficient algorithm named the derivative passing accumulation (DPA) method for simultaneous baseline correction and signal extraction is presented in this article. It is an efficient method using only the first-order derivatives which are obtained through taking the simple differences. RESULTS: We developed a new signal feature extracting procedure. The vector representing the discrete first-order derivative was divided into negative and positive parts and then accumulated to build a signal descriptor. The signals and background fluctuations are easily separated according to this descriptor via thresholding. In addition, the signal peaks are simultaneously located by checking the corresponding intervals in the descriptor. Therefore, the eternal issues of parsing the 1-dimensional output of detectors in biological instruments are solved together. Thereby, the baseline is corrected, and the signal peaks are extracted. CONCLUSIONS: We have introduced a new method for signal peak picking, where baseline computation and peak identification are performed jointly. The testing results of both authentic and artificially synthesized data illustrate that the new method is powerful, and it could be a better choice for practical processing. |
format | Online Article Text |
id | pubmed-6882060 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68820602019-12-03 A general-purpose signal processing algorithm for biological profiles using only first-order derivative information Liu, Yuanjie Lin, Jianhan BMC Bioinformatics Methodology Article BACKGROUND: Automatic signal-feature extraction algorithms are crucial for profile processing in bioinformatics. Both baseline drift and noise seriously affect the position and peak area of signals. An efficient algorithm named the derivative passing accumulation (DPA) method for simultaneous baseline correction and signal extraction is presented in this article. It is an efficient method using only the first-order derivatives which are obtained through taking the simple differences. RESULTS: We developed a new signal feature extracting procedure. The vector representing the discrete first-order derivative was divided into negative and positive parts and then accumulated to build a signal descriptor. The signals and background fluctuations are easily separated according to this descriptor via thresholding. In addition, the signal peaks are simultaneously located by checking the corresponding intervals in the descriptor. Therefore, the eternal issues of parsing the 1-dimensional output of detectors in biological instruments are solved together. Thereby, the baseline is corrected, and the signal peaks are extracted. CONCLUSIONS: We have introduced a new method for signal peak picking, where baseline computation and peak identification are performed jointly. The testing results of both authentic and artificially synthesized data illustrate that the new method is powerful, and it could be a better choice for practical processing. BioMed Central 2019-11-27 /pmc/articles/PMC6882060/ /pubmed/31775621 http://dx.doi.org/10.1186/s12859-019-3188-4 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Liu, Yuanjie Lin, Jianhan A general-purpose signal processing algorithm for biological profiles using only first-order derivative information |
title | A general-purpose signal processing algorithm for biological profiles using only first-order derivative information |
title_full | A general-purpose signal processing algorithm for biological profiles using only first-order derivative information |
title_fullStr | A general-purpose signal processing algorithm for biological profiles using only first-order derivative information |
title_full_unstemmed | A general-purpose signal processing algorithm for biological profiles using only first-order derivative information |
title_short | A general-purpose signal processing algorithm for biological profiles using only first-order derivative information |
title_sort | general-purpose signal processing algorithm for biological profiles using only first-order derivative information |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6882060/ https://www.ncbi.nlm.nih.gov/pubmed/31775621 http://dx.doi.org/10.1186/s12859-019-3188-4 |
work_keys_str_mv | AT liuyuanjie ageneralpurposesignalprocessingalgorithmforbiologicalprofilesusingonlyfirstorderderivativeinformation AT linjianhan ageneralpurposesignalprocessingalgorithmforbiologicalprofilesusingonlyfirstorderderivativeinformation AT liuyuanjie generalpurposesignalprocessingalgorithmforbiologicalprofilesusingonlyfirstorderderivativeinformation AT linjianhan generalpurposesignalprocessingalgorithmforbiologicalprofilesusingonlyfirstorderderivativeinformation |