Cargando…

Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis

BACKGROUND: A nanopore detector has a nanometer-scale trans-membrane channel across which a potential difference is established, resulting in an ionic current through the channel in the pA-nA range. A distinctive channel current blockade signal is created as individually "captured" DNA mol...

Descripción completa

Detalles Bibliográficos
Autores principales: Landry, Matthew, Winters-Hilt, Stephen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2099480/
https://www.ncbi.nlm.nih.gov/pubmed/18047711
http://dx.doi.org/10.1186/1471-2105-8-S7-S12
_version_ 1782138314780835840
author Landry, Matthew
Winters-Hilt, Stephen
author_facet Landry, Matthew
Winters-Hilt, Stephen
author_sort Landry, Matthew
collection PubMed
description BACKGROUND: A nanopore detector has a nanometer-scale trans-membrane channel across which a potential difference is established, resulting in an ionic current through the channel in the pA-nA range. A distinctive channel current blockade signal is created as individually "captured" DNA molecules interact with the channel and modulate the channel's ionic current. The nanopore detector is sensitive enough that nearly identical DNA molecules can be classified with very high accuracy using machine learning techniques such as Hidden Markov Models (HMMs) and Support Vector Machines (SVMs). RESULTS: A non-standard implementation of an HMM, emission inversion, is used for improved classification. Additional features are considered for the feature vector employed by the SVM for classification as well: The addition of a single feature representing spike density is shown to notably improve classification results. Another, much larger, feature set expansion was studied (2500 additional features instead of 1), deriving from including all the HMM's transition probabilities. The expanded features can introduce redundant, noisy information (as well as diagnostic information) into the current feature set, and thus degrade classification performance. A hybrid Adaptive Boosting approach was used for feature selection to alleviate this problem. CONCLUSION: The methods shown here, for more informed feature extraction, improve both classification and provide biologists and chemists with tools for obtaining a better understanding of the kinetic properties of molecules of interest.
format Text
id pubmed-2099480
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-20994802007-12-01 Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis Landry, Matthew Winters-Hilt, Stephen BMC Bioinformatics Proceedings BACKGROUND: A nanopore detector has a nanometer-scale trans-membrane channel across which a potential difference is established, resulting in an ionic current through the channel in the pA-nA range. A distinctive channel current blockade signal is created as individually "captured" DNA molecules interact with the channel and modulate the channel's ionic current. The nanopore detector is sensitive enough that nearly identical DNA molecules can be classified with very high accuracy using machine learning techniques such as Hidden Markov Models (HMMs) and Support Vector Machines (SVMs). RESULTS: A non-standard implementation of an HMM, emission inversion, is used for improved classification. Additional features are considered for the feature vector employed by the SVM for classification as well: The addition of a single feature representing spike density is shown to notably improve classification results. Another, much larger, feature set expansion was studied (2500 additional features instead of 1), deriving from including all the HMM's transition probabilities. The expanded features can introduce redundant, noisy information (as well as diagnostic information) into the current feature set, and thus degrade classification performance. A hybrid Adaptive Boosting approach was used for feature selection to alleviate this problem. CONCLUSION: The methods shown here, for more informed feature extraction, improve both classification and provide biologists and chemists with tools for obtaining a better understanding of the kinetic properties of molecules of interest. BioMed Central 2007-11-01 /pmc/articles/PMC2099480/ /pubmed/18047711 http://dx.doi.org/10.1186/1471-2105-8-S7-S12 Text en Copyright © 2007 Landry and Winters-Hilt; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Landry, Matthew
Winters-Hilt, Stephen
Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis
title Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis
title_full Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis
title_fullStr Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis
title_full_unstemmed Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis
title_short Analysis of nanopore detector measurements using Machine-Learning methods, with application to single-molecule kinetic analysis
title_sort analysis of nanopore detector measurements using machine-learning methods, with application to single-molecule kinetic analysis
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2099480/
https://www.ncbi.nlm.nih.gov/pubmed/18047711
http://dx.doi.org/10.1186/1471-2105-8-S7-S12
work_keys_str_mv AT landrymatthew analysisofnanoporedetectormeasurementsusingmachinelearningmethodswithapplicationtosinglemoleculekineticanalysis
AT wintershiltstephen analysisofnanoporedetectormeasurementsusingmachinelearningmethodswithapplicationtosinglemoleculekineticanalysis