Cargando…
Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis
BACKGROUND: A nanopore detector has a nanometer-scale trans-membrane channel across which a potential difference is established, resulting in an ionic current through the channel in the pA-nA range. A distinctive channel current blockade signal is created as individually "captured" DNA mol...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2099480/ https://www.ncbi.nlm.nih.gov/pubmed/18047711 http://dx.doi.org/10.1186/1471-2105-8-S7-S12 |
_version_ | 1782138314780835840 |
---|---|
author | Landry, Matthew Winters-Hilt, Stephen |
author_facet | Landry, Matthew Winters-Hilt, Stephen |
author_sort | Landry, Matthew |
collection | PubMed |
description | BACKGROUND: A nanopore detector has a nanometer-scale trans-membrane channel across which a potential difference is established, resulting in an ionic current through the channel in the pA-nA range. A distinctive channel current blockade signal is created as individually "captured" DNA molecules interact with the channel and modulate the channel's ionic current. The nanopore detector is sensitive enough that nearly identical DNA molecules can be classified with very high accuracy using machine learning techniques such as Hidden Markov Models (HMMs) and Support Vector Machines (SVMs). RESULTS: A non-standard implementation of an HMM, emission inversion, is used for improved classification. Additional features are considered for the feature vector employed by the SVM for classification as well: The addition of a single feature representing spike density is shown to notably improve classification results. Another, much larger, feature set expansion was studied (2500 additional features instead of 1), deriving from including all the HMM's transition probabilities. The expanded features can introduce redundant, noisy information (as well as diagnostic information) into the current feature set, and thus degrade classification performance. A hybrid Adaptive Boosting approach was used for feature selection to alleviate this problem. CONCLUSION: The methods shown here, for more informed feature extraction, improve both classification and provide biologists and chemists with tools for obtaining a better understanding of the kinetic properties of molecules of interest. |
format | Text |
id | pubmed-2099480 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-20994802007-12-01 Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis Landry, Matthew Winters-Hilt, Stephen BMC Bioinformatics Proceedings BACKGROUND: A nanopore detector has a nanometer-scale trans-membrane channel across which a potential difference is established, resulting in an ionic current through the channel in the pA-nA range. A distinctive channel current blockade signal is created as individually "captured" DNA molecules interact with the channel and modulate the channel's ionic current. The nanopore detector is sensitive enough that nearly identical DNA molecules can be classified with very high accuracy using machine learning techniques such as Hidden Markov Models (HMMs) and Support Vector Machines (SVMs). RESULTS: A non-standard implementation of an HMM, emission inversion, is used for improved classification. Additional features are considered for the feature vector employed by the SVM for classification as well: The addition of a single feature representing spike density is shown to notably improve classification results. Another, much larger, feature set expansion was studied (2500 additional features instead of 1), deriving from including all the HMM's transition probabilities. The expanded features can introduce redundant, noisy information (as well as diagnostic information) into the current feature set, and thus degrade classification performance. A hybrid Adaptive Boosting approach was used for feature selection to alleviate this problem. CONCLUSION: The methods shown here, for more informed feature extraction, improve both classification and provide biologists and chemists with tools for obtaining a better understanding of the kinetic properties of molecules of interest. BioMed Central 2007-11-01 /pmc/articles/PMC2099480/ /pubmed/18047711 http://dx.doi.org/10.1186/1471-2105-8-S7-S12 Text en Copyright © 2007 Landry and Winters-Hilt; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Landry, Matthew Winters-Hilt, Stephen Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis |
title | Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis |
title_full | Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis |
title_fullStr | Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis |
title_full_unstemmed | Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis |
title_short | Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis |
title_sort | analysis of nanopore detector measurements using machine-learning methods, with application to single-molecule kinetic analysis |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2099480/ https://www.ncbi.nlm.nih.gov/pubmed/18047711 http://dx.doi.org/10.1186/1471-2105-8-S7-S12 |
work_keys_str_mv | AT landrymatthew analysisofnanoporedetectormeasurementsusingmachinelearningmethodswithapplicationtosinglemoleculekineticanalysis AT wintershiltstephen analysisofnanoporedetectormeasurementsusingmachinelearningmethodswithapplicationtosinglemoleculekineticanalysis |