Cargando…

Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model

We have shown previously that a feed-forward, back propagation neural network model based on composite n-grams can predict normalized signal strengths of a microarray based DNA sequencing experiment. The microarray comprises a 4xN set of 25-base single-stranded DNA molecule ('oligos'), spe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chilaka, Charles, Carr, Steven, Shalaby, Nabil, Banzhaf, Wolfgang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Biomedical Informatics 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6614125/ https://www.ncbi.nlm.nih.gov/pubmed/31312075 http://dx.doi.org/10.6026/97320630015388

_version_	1783433128865431552
author	Chilaka, Charles Carr, Steven Shalaby, Nabil Banzhaf, Wolfgang
author_facet	Chilaka, Charles Carr, Steven Shalaby, Nabil Banzhaf, Wolfgang
author_sort	Chilaka, Charles
collection	PubMed
description	We have shown previously that a feed-forward, back propagation neural network model based on composite n-grams can predict normalized signal strengths of a microarray based DNA sequencing experiment. The microarray comprises a 4xN set of 25-base single-stranded DNA molecule ('oligos'), specific for each of the four possible bases (A, C, G, or T) for Adenine, Cytosine, Guanine and Thymine respectively at each of N positions in the experimental DNA. Strength of binding between reference oligos and experimental DNA varies according to base complementarity and the strongest signal in any quartet should `call the base` at that position. Variation in base composition of and (or) order within oligos can affect accuracy and (or) confidence of base calls. To evaluate the effect of order, we present oligos as n-gram neural input vectors of degree 3 and measure their performance. Microarray signal intensity data were divided into training, validation and testing sets. Regression values obtained were >99.80% overall with very low mean square errors that transform to high best validation performance values. Pattern recognition results showed high percentage confusion matrix values along the diagonal and receiver operating characteristic curves were clustered in the upper left corner, both indices of good predictive performance. Higher order n-grams are expected to produce even better predictions.
format	Online Article Text
id	pubmed-6614125
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Biomedical Informatics
record_format	MEDLINE/PubMed
spelling	pubmed-66141252019-07-16 Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model Chilaka, Charles Carr, Steven Shalaby, Nabil Banzhaf, Wolfgang Bioinformation Research Article We have shown previously that a feed-forward, back propagation neural network model based on composite n-grams can predict normalized signal strengths of a microarray based DNA sequencing experiment. The microarray comprises a 4xN set of 25-base single-stranded DNA molecule ('oligos'), specific for each of the four possible bases (A, C, G, or T) for Adenine, Cytosine, Guanine and Thymine respectively at each of N positions in the experimental DNA. Strength of binding between reference oligos and experimental DNA varies according to base complementarity and the strongest signal in any quartet should `call the base` at that position. Variation in base composition of and (or) order within oligos can affect accuracy and (or) confidence of base calls. To evaluate the effect of order, we present oligos as n-gram neural input vectors of degree 3 and measure their performance. Microarray signal intensity data were divided into training, validation and testing sets. Regression values obtained were >99.80% overall with very low mean square errors that transform to high best validation performance values. Pattern recognition results showed high percentage confusion matrix values along the diagonal and receiver operating characteristic curves were clustered in the upper left corner, both indices of good predictive performance. Higher order n-grams are expected to produce even better predictions. Biomedical Informatics 2019-05-30 /pmc/articles/PMC6614125/ /pubmed/31312075 http://dx.doi.org/10.6026/97320630015388 Text en © 2019 Biomedical Informatics http://creativecommons.org/licenses/by/3.0/ This is an Open Access article which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. This is distributed under the terms of the Creative Commons Attribution License.
spellingShingle	Research Article Chilaka, Charles Carr, Steven Shalaby, Nabil Banzhaf, Wolfgang Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model
title	Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model
title_full	Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model
title_fullStr	Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model
title_full_unstemmed	Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model
title_short	Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model
title_sort	prediction of normalized signal strength on dna sequencing microarrays by n-grams within a neural network model
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6614125/ https://www.ncbi.nlm.nih.gov/pubmed/31312075 http://dx.doi.org/10.6026/97320630015388
work_keys_str_mv	AT chilakacharles predictionofnormalizedsignalstrengthondnasequencingmicroarraysbyngramswithinaneuralnetworkmodel AT carrsteven predictionofnormalizedsignalstrengthondnasequencingmicroarraysbyngramswithinaneuralnetworkmodel AT shalabynabil predictionofnormalizedsignalstrengthondnasequencingmicroarraysbyngramswithinaneuralnetworkmodel AT banzhafwolfgang predictionofnormalizedsignalstrengthondnasequencingmicroarraysbyngramswithinaneuralnetworkmodel

Prediction of normalized signal strength on DNA sequencing microarrays by n-grams within a neural network model

Ejemplares similares