Cargando…

Genomics dataset on unclassified published organism (patent US 7547531)

Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531) is...

Descripción completa

Detalles Bibliográficos
Autores principales:	Khan Shawan, Mohammad Mahfuz Ali, Hasan, Md. Ashraful, Hossain, Md. Mozammel, Hasan, Md. Mahmudul, Parvin, Afroza, Akter, Salina, Uddin, Kazi Rasel, Banik, Subrata, Morshed, Mahbubul, Rahman, Md. Nazibur, Rahman, S.M. Badier
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2016
Materias:	Data Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066183/ https://www.ncbi.nlm.nih.gov/pubmed/27766287 http://dx.doi.org/10.1016/j.dib.2016.09.046

_version_	1782460436363345920
author	Khan Shawan, Mohammad Mahfuz Ali Hasan, Md. Ashraful Hossain, Md. Mozammel Hasan, Md. Mahmudul Parvin, Afroza Akter, Salina Uddin, Kazi Rasel Banik, Subrata Morshed, Mahbubul Rahman, Md. Nazibur Rahman, S.M. Badier
author_facet	Khan Shawan, Mohammad Mahfuz Ali Hasan, Md. Ashraful Hossain, Md. Mozammel Hasan, Md. Mahmudul Parvin, Afroza Akter, Salina Uddin, Kazi Rasel Banik, Subrata Morshed, Mahbubul Rahman, Md. Nazibur Rahman, S.M. Badier
author_sort	Khan Shawan, Mohammad Mahfuz Ali
collection	PubMed
description	Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531) is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from patent US 7547531 were selected and their complete sequences were retrieved from NCBI BioSample database. Quick response (QR) code of those DNA sequences was constructed by DNA BarID tool. QR code is useful for the identification and comparison of isolates with other organisms. AT/GC content of the DNA sequences was determined using ENDMEMO GC Content Calculator, which indicates their stability at different temperature. The highest GC content was observed in GP445188 (62.5%) which was followed by GP445198 (61.8%) and GP445189 (59.44%), while lowest was in GP445178 (24.39%). In addition, New England BioLabs (NEB) database was used to identify cleavage code indicating the 5, 3 and blunt end and enzyme code indicating the methylation site of the DNA sequences was also shown. These data will be helpful for the construction of the organisms’ hierarchical classification, determination of their phylogenetic and taxonomic position and revelation of their molecular characteristics.
format	Online Article Text
id	pubmed-5066183
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-50661832016-10-20 Genomics dataset on unclassified published organism (patent US 7547531) Khan Shawan, Mohammad Mahfuz Ali Hasan, Md. Ashraful Hossain, Md. Mozammel Hasan, Md. Mahmudul Parvin, Afroza Akter, Salina Uddin, Kazi Rasel Banik, Subrata Morshed, Mahbubul Rahman, Md. Nazibur Rahman, S.M. Badier Data Brief Data Article Nucleotide (DNA) sequence analysis provides important clues regarding the characteristics and taxonomic position of an organism. With the intention that, DNA sequence analysis is very crucial to learn about hierarchical classification of that particular organism. This dataset (patent US 7547531) is chosen to simplify all the complex raw data buried in undisclosed DNA sequences which help to open doors for new collaborations. In this data, a total of 48 unidentified DNA sequences from patent US 7547531 were selected and their complete sequences were retrieved from NCBI BioSample database. Quick response (QR) code of those DNA sequences was constructed by DNA BarID tool. QR code is useful for the identification and comparison of isolates with other organisms. AT/GC content of the DNA sequences was determined using ENDMEMO GC Content Calculator, which indicates their stability at different temperature. The highest GC content was observed in GP445188 (62.5%) which was followed by GP445198 (61.8%) and GP445189 (59.44%), while lowest was in GP445178 (24.39%). In addition, New England BioLabs (NEB) database was used to identify cleavage code indicating the 5, 3 and blunt end and enzyme code indicating the methylation site of the DNA sequences was also shown. These data will be helpful for the construction of the organisms’ hierarchical classification, determination of their phylogenetic and taxonomic position and revelation of their molecular characteristics. Elsevier 2016-10-05 /pmc/articles/PMC5066183/ /pubmed/27766287 http://dx.doi.org/10.1016/j.dib.2016.09.046 Text en © 2016 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Data Article Khan Shawan, Mohammad Mahfuz Ali Hasan, Md. Ashraful Hossain, Md. Mozammel Hasan, Md. Mahmudul Parvin, Afroza Akter, Salina Uddin, Kazi Rasel Banik, Subrata Morshed, Mahbubul Rahman, Md. Nazibur Rahman, S.M. Badier Genomics dataset on unclassified published organism (patent US 7547531)
title	Genomics dataset on unclassified published organism (patent US 7547531)
title_full	Genomics dataset on unclassified published organism (patent US 7547531)
title_fullStr	Genomics dataset on unclassified published organism (patent US 7547531)
title_full_unstemmed	Genomics dataset on unclassified published organism (patent US 7547531)
title_short	Genomics dataset on unclassified published organism (patent US 7547531)
title_sort	genomics dataset on unclassified published organism (patent us 7547531)
topic	Data Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066183/ https://www.ncbi.nlm.nih.gov/pubmed/27766287 http://dx.doi.org/10.1016/j.dib.2016.09.046
work_keys_str_mv	AT khanshawanmohammadmahfuzali genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT hasanmdashraful genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT hossainmdmozammel genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT hasanmdmahmudul genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT parvinafroza genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT aktersalina genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT uddinkazirasel genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT baniksubrata genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT morshedmahbubul genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT rahmanmdnazibur genomicsdatasetonunclassifiedpublishedorganismpatentus7547531 AT rahmansmbadier genomicsdatasetonunclassifiedpublishedorganismpatentus7547531

Genomics dataset on unclassified published organism (patent US 7547531)

Ejemplares similares