Cargando…

Curated BLAST for Genomes

Curated BLAST for Genomes finds candidate genes for a process or an enzymatic activity within a genome of interest. In contrast to annotation tools, which usually predict a single activity for each protein, Curated BLAST asks if any of the proteins in the genome are similar to characterized proteins...

Descripción completa

Detalles Bibliográficos
Autores principales: Price, Morgan N., Arkin, Adam P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6435814/
https://www.ncbi.nlm.nih.gov/pubmed/30944879
http://dx.doi.org/10.1128/mSystems.00072-19
_version_ 1783406714269204480
author Price, Morgan N.
Arkin, Adam P.
author_facet Price, Morgan N.
Arkin, Adam P.
author_sort Price, Morgan N.
collection PubMed
description Curated BLAST for Genomes finds candidate genes for a process or an enzymatic activity within a genome of interest. In contrast to annotation tools, which usually predict a single activity for each protein, Curated BLAST asks if any of the proteins in the genome are similar to characterized proteins that are relevant. Given a query such as an enzyme’s name or an EC number, Curated BLAST searches the curated descriptions of over 100,000 characterized proteins, and it compares the relevant characterized proteins to the predicted proteins in the genome of interest. In case of errors in the gene models, Curated BLAST also searches the six-frame translation of the genome. Curated BLAST is available at http://papers.genomics.lbl.gov/curated. IMPORTANCE Given a microbe’s genome sequence, we often want to predict what capabilities the organism has, such as which nutrients it requires or which energy sources it can use. Or, we know the organism has a capability and we want to find the genes involved. Scientists often use automated gene annotations to find relevant genes, but automated annotations are often vague or incorrect. Curated BLAST finds candidate genes for a capability without relying on automated annotations. First, Curated BLAST finds proteins (usually from other organisms) whose functions have been studied experimentally and whose curated descriptions match a query. Then, it searches the genome of interest for similar proteins and returns a list of candidates. Curated BLAST is fast and often finds relevant genes that are missed by automated annotation.
format Online
Article
Text
id pubmed-6435814
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-64358142019-04-03 Curated BLAST for Genomes Price, Morgan N. Arkin, Adam P. mSystems Methods and Protocols Curated BLAST for Genomes finds candidate genes for a process or an enzymatic activity within a genome of interest. In contrast to annotation tools, which usually predict a single activity for each protein, Curated BLAST asks if any of the proteins in the genome are similar to characterized proteins that are relevant. Given a query such as an enzyme’s name or an EC number, Curated BLAST searches the curated descriptions of over 100,000 characterized proteins, and it compares the relevant characterized proteins to the predicted proteins in the genome of interest. In case of errors in the gene models, Curated BLAST also searches the six-frame translation of the genome. Curated BLAST is available at http://papers.genomics.lbl.gov/curated. IMPORTANCE Given a microbe’s genome sequence, we often want to predict what capabilities the organism has, such as which nutrients it requires or which energy sources it can use. Or, we know the organism has a capability and we want to find the genes involved. Scientists often use automated gene annotations to find relevant genes, but automated annotations are often vague or incorrect. Curated BLAST finds candidate genes for a capability without relying on automated annotations. First, Curated BLAST finds proteins (usually from other organisms) whose functions have been studied experimentally and whose curated descriptions match a query. Then, it searches the genome of interest for similar proteins and returns a list of candidates. Curated BLAST is fast and often finds relevant genes that are missed by automated annotation. American Society for Microbiology 2019-03-26 /pmc/articles/PMC6435814/ /pubmed/30944879 http://dx.doi.org/10.1128/mSystems.00072-19 Text en Copyright © 2019 Price and Arkin. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Methods and Protocols
Price, Morgan N.
Arkin, Adam P.
Curated BLAST for Genomes
title Curated BLAST for Genomes
title_full Curated BLAST for Genomes
title_fullStr Curated BLAST for Genomes
title_full_unstemmed Curated BLAST for Genomes
title_short Curated BLAST for Genomes
title_sort curated blast for genomes
topic Methods and Protocols
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6435814/
https://www.ncbi.nlm.nih.gov/pubmed/30944879
http://dx.doi.org/10.1128/mSystems.00072-19
work_keys_str_mv AT pricemorgann curatedblastforgenomes
AT arkinadamp curatedblastforgenomes