Cargando…
SAND, a New Protein Family: From Nucleic Acid to Protein Structure and Function Prediction
As a result of genome, EST and cDNA sequencing projects, there are huge numbers of predicted and/or partially characterised protein sequences compared with a relatively small number of proteins with experimentally determined function and structure. Thus, there is a considerable attention focused on...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2001
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447211/ https://www.ncbi.nlm.nih.gov/pubmed/18628914 http://dx.doi.org/10.1002/cfg.93 |
_version_ | 1782156883710181376 |
---|---|
author | Cottage, Amanda Edwards, Yvonne J. K. Elgar, Greg |
author_facet | Cottage, Amanda Edwards, Yvonne J. K. Elgar, Greg |
author_sort | Cottage, Amanda |
collection | PubMed |
description | As a result of genome, EST and cDNA sequencing projects, there are huge numbers of predicted and/or partially characterised protein sequences compared with a relatively small number of proteins with experimentally determined function and structure. Thus, there is a considerable attention focused on the accurate prediction of gene function and structure from sequence by using bioinformatics. In the course of our analysis of genomic sequence from Fugu rubripes, we identified a novel gene, SAND, with significant sequence identity to hypothetical proteins predicted in Saccharomyces cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans, a Drosophila melanogaster gene, and mouse and human cDNAs. Here we identify a further SAND homologue in human and Arabidopsis thaliana by use of standard computational tools. We describe the genomic organisation of SAND in these evolutionarily divergent species and identify sequence homologues from EST database searches confirming the expression of SAND in over 20 different eukaryotes. We confirm the expression of two different SAND paralogues in mammals and determine expression of one SAND in other vertebrates and eukaryotes. Furthermore, we predict structural properties of SAND, and characterise conserved sequence motifs in this protein family. |
format | Text |
id | pubmed-2447211 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2001 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-24472112008-07-14 SAND, a New Protein Family: From Nucleic Acid to Protein Structure and Function Prediction Cottage, Amanda Edwards, Yvonne J. K. Elgar, Greg Comp Funct Genomics Research Article As a result of genome, EST and cDNA sequencing projects, there are huge numbers of predicted and/or partially characterised protein sequences compared with a relatively small number of proteins with experimentally determined function and structure. Thus, there is a considerable attention focused on the accurate prediction of gene function and structure from sequence by using bioinformatics. In the course of our analysis of genomic sequence from Fugu rubripes, we identified a novel gene, SAND, with significant sequence identity to hypothetical proteins predicted in Saccharomyces cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans, a Drosophila melanogaster gene, and mouse and human cDNAs. Here we identify a further SAND homologue in human and Arabidopsis thaliana by use of standard computational tools. We describe the genomic organisation of SAND in these evolutionarily divergent species and identify sequence homologues from EST database searches confirming the expression of SAND in over 20 different eukaryotes. We confirm the expression of two different SAND paralogues in mammals and determine expression of one SAND in other vertebrates and eukaryotes. Furthermore, we predict structural properties of SAND, and characterise conserved sequence motifs in this protein family. Hindawi Publishing Corporation 2001-08 /pmc/articles/PMC2447211/ /pubmed/18628914 http://dx.doi.org/10.1002/cfg.93 Text en Copyright © 2001 Hindawi Publishing Corporation. http://creativecommons.org/licenses/by/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Cottage, Amanda Edwards, Yvonne J. K. Elgar, Greg SAND, a New Protein Family: From Nucleic Acid to Protein Structure and Function Prediction |
title | SAND, a New Protein Family: From Nucleic Acid to Protein Structure and Function Prediction |
title_full | SAND, a New Protein Family: From Nucleic Acid to Protein Structure and Function Prediction |
title_fullStr | SAND, a New Protein Family: From Nucleic Acid to Protein Structure and Function Prediction |
title_full_unstemmed | SAND, a New Protein Family: From Nucleic Acid to Protein Structure and Function Prediction |
title_short | SAND, a New Protein Family: From Nucleic Acid to Protein Structure and Function Prediction |
title_sort | sand, a new protein family: from nucleic acid to protein structure and function prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447211/ https://www.ncbi.nlm.nih.gov/pubmed/18628914 http://dx.doi.org/10.1002/cfg.93 |
work_keys_str_mv | AT cottageamanda sandanewproteinfamilyfromnucleicacidtoproteinstructureandfunctionprediction AT edwardsyvonnejk sandanewproteinfamilyfromnucleicacidtoproteinstructureandfunctionprediction AT elgargreg sandanewproteinfamilyfromnucleicacidtoproteinstructureandfunctionprediction |