Cargando…

TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences

Gene families compose a large proportion of eukaryotic genomes. The rapidly expanding genomic sequence database provides a good opportunity to study gene family evolution and function. However, most gene family identification programs are restricted to searching protein databases where data are ofte...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Yujun, Burnette, James M., Wessler, Susan R.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2699529/
https://www.ncbi.nlm.nih.gov/pubmed/19429695
http://dx.doi.org/10.1093/nar/gkp295
_version_ 1782168505865469952
author Han, Yujun
Burnette, James M.
Wessler, Susan R.
author_facet Han, Yujun
Burnette, James M.
Wessler, Susan R.
author_sort Han, Yujun
collection PubMed
description Gene families compose a large proportion of eukaryotic genomes. The rapidly expanding genomic sequence database provides a good opportunity to study gene family evolution and function. However, most gene family identification programs are restricted to searching protein databases where data are often lagging behind the genomic sequence data. Here, we report a user-friendly web-based pipeline, named TARGeT (Tree Analysis of Related Genes and Transposons), which uses either a DNA or amino acid ‘seed’ query to: (i) automatically identify and retrieve gene family homologs from a genomic database, (ii) characterize gene structure and (iii) perform phylogenetic analysis. Due to its high speed, TARGeT is also able to characterize very large gene families, including transposable elements (TEs). We evaluated TARGeT using well-annotated datasets, including the ascorbate peroxidase gene family of rice, maize and sorghum and several TE families in rice. In all cases, TARGeT rapidly recapitulated the known homologs and predicted new ones. We also demonstrated that TARGeT outperforms similar pipelines and has functionality that is not offered elsewhere.
format Text
id pubmed-2699529
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26995292009-06-22 TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences Han, Yujun Burnette, James M. Wessler, Susan R. Nucleic Acids Res Methods Online Gene families compose a large proportion of eukaryotic genomes. The rapidly expanding genomic sequence database provides a good opportunity to study gene family evolution and function. However, most gene family identification programs are restricted to searching protein databases where data are often lagging behind the genomic sequence data. Here, we report a user-friendly web-based pipeline, named TARGeT (Tree Analysis of Related Genes and Transposons), which uses either a DNA or amino acid ‘seed’ query to: (i) automatically identify and retrieve gene family homologs from a genomic database, (ii) characterize gene structure and (iii) perform phylogenetic analysis. Due to its high speed, TARGeT is also able to characterize very large gene families, including transposable elements (TEs). We evaluated TARGeT using well-annotated datasets, including the ascorbate peroxidase gene family of rice, maize and sorghum and several TE families in rice. In all cases, TARGeT rapidly recapitulated the known homologs and predicted new ones. We also demonstrated that TARGeT outperforms similar pipelines and has functionality that is not offered elsewhere. Oxford University Press 2009-06 2009-05-08 /pmc/articles/PMC2699529/ /pubmed/19429695 http://dx.doi.org/10.1093/nar/gkp295 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Han, Yujun
Burnette, James M.
Wessler, Susan R.
TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
title TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
title_full TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
title_fullStr TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
title_full_unstemmed TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
title_short TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
title_sort target: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2699529/
https://www.ncbi.nlm.nih.gov/pubmed/19429695
http://dx.doi.org/10.1093/nar/gkp295
work_keys_str_mv AT hanyujun targetawebbasedpipelineforretrievingandcharacterizinggeneandtransposableelementfamiliesfromgenomicsequences
AT burnettejamesm targetawebbasedpipelineforretrievingandcharacterizinggeneandtransposableelementfamiliesfromgenomicsequences
AT wesslersusanr targetawebbasedpipelineforretrievingandcharacterizinggeneandtransposableelementfamiliesfromgenomicsequences