Cargando…

Googling DNA sequences on the World Wide Web

BACKGROUND: New web-based technologies provide an excellent opportunity for sharing and accessing information and using web as a platform for interaction and collaboration. Although several specialized tools are available for analyzing DNA sequence information, conventional web-based tools have not...

Descripción completa

Detalles Bibliográficos
Autores principales: Hajibabaei, Mehrdad, Singer, Gregory AC
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2775150/
https://www.ncbi.nlm.nih.gov/pubmed/19900300
http://dx.doi.org/10.1186/1471-2105-10-S14-S4
_version_ 1782173991982596096
author Hajibabaei, Mehrdad
Singer, Gregory AC
author_facet Hajibabaei, Mehrdad
Singer, Gregory AC
author_sort Hajibabaei, Mehrdad
collection PubMed
description BACKGROUND: New web-based technologies provide an excellent opportunity for sharing and accessing information and using web as a platform for interaction and collaboration. Although several specialized tools are available for analyzing DNA sequence information, conventional web-based tools have not been utilized for bioinformatics applications. We have developed a novel algorithm and implemented it for searching species-specific genomic sequences, DNA barcodes, by using popular web-based methods such as Google. RESULTS: We developed an alignment independent character based algorithm based on dividing a sequence library (DNA barcodes) and query sequence to words. The actual search is conducted by conventional search tools such as freely available Google Desktop Search. We implemented our algorithm in two exemplar packages. We developed pre and post-processing software to provide customized input and output services, respectively. Our analysis of all publicly available DNA barcode sequences shows a high accuracy as well as rapid results. CONCLUSION: Our method makes use of conventional web-based technologies for specialized genetic data. It provides a robust and efficient solution for sequence search on the web. The integration of our search method for large-scale sequence libraries such as DNA barcodes provides an excellent web-based tool for accessing this information and linking it to other available categories of information on the web.
format Text
id pubmed-2775150
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27751502009-11-10 Googling DNA sequences on the World Wide Web Hajibabaei, Mehrdad Singer, Gregory AC BMC Bioinformatics Research BACKGROUND: New web-based technologies provide an excellent opportunity for sharing and accessing information and using web as a platform for interaction and collaboration. Although several specialized tools are available for analyzing DNA sequence information, conventional web-based tools have not been utilized for bioinformatics applications. We have developed a novel algorithm and implemented it for searching species-specific genomic sequences, DNA barcodes, by using popular web-based methods such as Google. RESULTS: We developed an alignment independent character based algorithm based on dividing a sequence library (DNA barcodes) and query sequence to words. The actual search is conducted by conventional search tools such as freely available Google Desktop Search. We implemented our algorithm in two exemplar packages. We developed pre and post-processing software to provide customized input and output services, respectively. Our analysis of all publicly available DNA barcode sequences shows a high accuracy as well as rapid results. CONCLUSION: Our method makes use of conventional web-based technologies for specialized genetic data. It provides a robust and efficient solution for sequence search on the web. The integration of our search method for large-scale sequence libraries such as DNA barcodes provides an excellent web-based tool for accessing this information and linking it to other available categories of information on the web. BioMed Central 2009-11-10 /pmc/articles/PMC2775150/ /pubmed/19900300 http://dx.doi.org/10.1186/1471-2105-10-S14-S4 Text en Copyright © 2009 Hajibabaei and Singer; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided th original work is properly cited.
spellingShingle Research
Hajibabaei, Mehrdad
Singer, Gregory AC
Googling DNA sequences on the World Wide Web
title Googling DNA sequences on the World Wide Web
title_full Googling DNA sequences on the World Wide Web
title_fullStr Googling DNA sequences on the World Wide Web
title_full_unstemmed Googling DNA sequences on the World Wide Web
title_short Googling DNA sequences on the World Wide Web
title_sort googling dna sequences on the world wide web
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2775150/
https://www.ncbi.nlm.nih.gov/pubmed/19900300
http://dx.doi.org/10.1186/1471-2105-10-S14-S4
work_keys_str_mv AT hajibabaeimehrdad googlingdnasequencesontheworldwideweb
AT singergregoryac googlingdnasequencesontheworldwideweb