Cargando…
TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences
Gene families compose a large proportion of eukaryotic genomes. The rapidly expanding genomic sequence database provides a good opportunity to study gene family evolution and function. However, most gene family identification programs are restricted to searching protein databases where data are ofte...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2699529/ https://www.ncbi.nlm.nih.gov/pubmed/19429695 http://dx.doi.org/10.1093/nar/gkp295 |
_version_ | 1782168505865469952 |
---|---|
author | Han, Yujun Burnette, James M. Wessler, Susan R. |
author_facet | Han, Yujun Burnette, James M. Wessler, Susan R. |
author_sort | Han, Yujun |
collection | PubMed |
description | Gene families compose a large proportion of eukaryotic genomes. The rapidly expanding genomic sequence database provides a good opportunity to study gene family evolution and function. However, most gene family identification programs are restricted to searching protein databases where data are often lagging behind the genomic sequence data. Here, we report a user-friendly web-based pipeline, named TARGeT (Tree Analysis of Related Genes and Transposons), which uses either a DNA or amino acid ‘seed’ query to: (i) automatically identify and retrieve gene family homologs from a genomic database, (ii) characterize gene structure and (iii) perform phylogenetic analysis. Due to its high speed, TARGeT is also able to characterize very large gene families, including transposable elements (TEs). We evaluated TARGeT using well-annotated datasets, including the ascorbate peroxidase gene family of rice, maize and sorghum and several TE families in rice. In all cases, TARGeT rapidly recapitulated the known homologs and predicted new ones. We also demonstrated that TARGeT outperforms similar pipelines and has functionality that is not offered elsewhere. |
format | Text |
id | pubmed-2699529 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-26995292009-06-22 TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences Han, Yujun Burnette, James M. Wessler, Susan R. Nucleic Acids Res Methods Online Gene families compose a large proportion of eukaryotic genomes. The rapidly expanding genomic sequence database provides a good opportunity to study gene family evolution and function. However, most gene family identification programs are restricted to searching protein databases where data are often lagging behind the genomic sequence data. Here, we report a user-friendly web-based pipeline, named TARGeT (Tree Analysis of Related Genes and Transposons), which uses either a DNA or amino acid ‘seed’ query to: (i) automatically identify and retrieve gene family homologs from a genomic database, (ii) characterize gene structure and (iii) perform phylogenetic analysis. Due to its high speed, TARGeT is also able to characterize very large gene families, including transposable elements (TEs). We evaluated TARGeT using well-annotated datasets, including the ascorbate peroxidase gene family of rice, maize and sorghum and several TE families in rice. In all cases, TARGeT rapidly recapitulated the known homologs and predicted new ones. We also demonstrated that TARGeT outperforms similar pipelines and has functionality that is not offered elsewhere. Oxford University Press 2009-06 2009-05-08 /pmc/articles/PMC2699529/ /pubmed/19429695 http://dx.doi.org/10.1093/nar/gkp295 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Han, Yujun Burnette, James M. Wessler, Susan R. TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences |
title | TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences |
title_full | TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences |
title_fullStr | TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences |
title_full_unstemmed | TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences |
title_short | TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences |
title_sort | target: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequences |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2699529/ https://www.ncbi.nlm.nih.gov/pubmed/19429695 http://dx.doi.org/10.1093/nar/gkp295 |
work_keys_str_mv | AT hanyujun targetawebbasedpipelineforretrievingandcharacterizinggeneandtransposableelementfamiliesfromgenomicsequences AT burnettejamesm targetawebbasedpipelineforretrievingandcharacterizinggeneandtransposableelementfamiliesfromgenomicsequences AT wesslersusanr targetawebbasedpipelineforretrievingandcharacterizinggeneandtransposableelementfamiliesfromgenomicsequences |