Cargando…

Can We Convert Genotype Sequences Into Images for Cases/Controls Classification?

Converting genotype sequences into images offers advantages, such as genotype data visualization, classification, and comparison of genotype sequences. This study converted genotype sequences into images, applied two-dimensional convolutional neural networks for case/control classification, and comp...

Descripción completa

Detalles Bibliográficos
Autores principales: Muneeb, Muhammad, Feng, Samuel F., Henschel, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580854/
https://www.ncbi.nlm.nih.gov/pubmed/36304278
http://dx.doi.org/10.3389/fbinf.2022.914435
_version_ 1784812485053775872
author Muneeb, Muhammad
Feng, Samuel F.
Henschel, Andreas
author_facet Muneeb, Muhammad
Feng, Samuel F.
Henschel, Andreas
author_sort Muneeb, Muhammad
collection PubMed
description Converting genotype sequences into images offers advantages, such as genotype data visualization, classification, and comparison of genotype sequences. This study converted genotype sequences into images, applied two-dimensional convolutional neural networks for case/control classification, and compared the results with the one-dimensional convolutional neural network. Surprisingly, the average accuracy of multiple runs of 2DCNN was 0.86, and that of 1DCNN was 0.89, yielding a difference of 0.03, which suggests that even the 2DCNN algorithm works on genotype sequences. Moreover, the results generated by the 2DCNN exhibited less variation than those generated by the 1DCNN, thereby offering greater stability. The purpose of this study is to draw the research community’s attention to explore encoding schemes for genotype data and machine learning algorithms that can be used on genotype data by changing the representation of the genotype data for case/control classification.
format Online
Article
Text
id pubmed-9580854
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95808542022-10-26 Can We Convert Genotype Sequences Into Images for Cases/Controls Classification? Muneeb, Muhammad Feng, Samuel F. Henschel, Andreas Front Bioinform Bioinformatics Converting genotype sequences into images offers advantages, such as genotype data visualization, classification, and comparison of genotype sequences. This study converted genotype sequences into images, applied two-dimensional convolutional neural networks for case/control classification, and compared the results with the one-dimensional convolutional neural network. Surprisingly, the average accuracy of multiple runs of 2DCNN was 0.86, and that of 1DCNN was 0.89, yielding a difference of 0.03, which suggests that even the 2DCNN algorithm works on genotype sequences. Moreover, the results generated by the 2DCNN exhibited less variation than those generated by the 1DCNN, thereby offering greater stability. The purpose of this study is to draw the research community’s attention to explore encoding schemes for genotype data and machine learning algorithms that can be used on genotype data by changing the representation of the genotype data for case/control classification. Frontiers Media S.A. 2022-06-28 /pmc/articles/PMC9580854/ /pubmed/36304278 http://dx.doi.org/10.3389/fbinf.2022.914435 Text en Copyright © 2022 Muneeb, Feng and Henschel. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Muneeb, Muhammad
Feng, Samuel F.
Henschel, Andreas
Can We Convert Genotype Sequences Into Images for Cases/Controls Classification?
title Can We Convert Genotype Sequences Into Images for Cases/Controls Classification?
title_full Can We Convert Genotype Sequences Into Images for Cases/Controls Classification?
title_fullStr Can We Convert Genotype Sequences Into Images for Cases/Controls Classification?
title_full_unstemmed Can We Convert Genotype Sequences Into Images for Cases/Controls Classification?
title_short Can We Convert Genotype Sequences Into Images for Cases/Controls Classification?
title_sort can we convert genotype sequences into images for cases/controls classification?
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9580854/
https://www.ncbi.nlm.nih.gov/pubmed/36304278
http://dx.doi.org/10.3389/fbinf.2022.914435
work_keys_str_mv AT muneebmuhammad canweconvertgenotypesequencesintoimagesforcasescontrolsclassification
AT fengsamuelf canweconvertgenotypesequencesintoimagesforcasescontrolsclassification
AT henschelandreas canweconvertgenotypesequencesintoimagesforcasescontrolsclassification