Cargando…
Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
BACKGROUND: Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to be invoked and analyzed. The unclear evolutionary relationship between classica...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1435942/ https://www.ncbi.nlm.nih.gov/pubmed/16551354 http://dx.doi.org/10.1186/1471-2105-7-164 |
_version_ | 1782127300602494976 |
---|---|
author | Schneider, Georg Neuberger, Georg Wildpaner, Michael Tian, Sun Berezovsky, Igor Eisenhaber, Frank |
author_facet | Schneider, Georg Neuberger, Georg Wildpaner, Michael Tian, Sun Berezovsky, Igor Eisenhaber, Frank |
author_sort | Schneider, Georg |
collection | PubMed |
description | BACKGROUND: Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to be invoked and analyzed. The unclear evolutionary relationship between classical mammalian lipases and the recently discovered human adipose triglyceride lipase (ATGL; a patatin family member) is an exemplary case for such a problem. RESULTS: We describe an unsupervised, sensitive sequence segment collection heuristic suitable for assembling very large protein families. It is based on fan-like expanding, iterative database searches. To prevent inclusion of unrelated hits, additional criteria are introduced: minimal alignment length and overlap with starting sequence segments, finding starting sequences in reciprocal searches, automated filtering for compositional bias and repetitive patterns. This heuristic was implemented as FAMILYSEARCHER in the ANNIE sequence analysis environment and applied to search for protein links between the classical lipase family and the patatin-like group. CONCLUSION: The FAMILYSEARCHER is an efficient tool for tracing distant evolutionary relationships involving large protein families. Although classical lipases and ATGL have no obvious sequence similarity and differ with regard to fold and catalytic mechanism, homology links detected with FAMILYSEARCHER show that they are evolutionarily related. The conserved sequence parts can be narrowed down to an ancestral core module consisting of three β-strands, one α-helix and a turn containing the typical nucleophilic serine. Moreover, this ancestral module also appears in numerous enzymes with various substrate specificities, but that critically rely on nucleophilic attack mechanisms. |
format | Text |
id | pubmed-1435942 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14359422006-04-14 Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases Schneider, Georg Neuberger, Georg Wildpaner, Michael Tian, Sun Berezovsky, Igor Eisenhaber, Frank BMC Bioinformatics Methodology Article BACKGROUND: Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to be invoked and analyzed. The unclear evolutionary relationship between classical mammalian lipases and the recently discovered human adipose triglyceride lipase (ATGL; a patatin family member) is an exemplary case for such a problem. RESULTS: We describe an unsupervised, sensitive sequence segment collection heuristic suitable for assembling very large protein families. It is based on fan-like expanding, iterative database searches. To prevent inclusion of unrelated hits, additional criteria are introduced: minimal alignment length and overlap with starting sequence segments, finding starting sequences in reciprocal searches, automated filtering for compositional bias and repetitive patterns. This heuristic was implemented as FAMILYSEARCHER in the ANNIE sequence analysis environment and applied to search for protein links between the classical lipase family and the patatin-like group. CONCLUSION: The FAMILYSEARCHER is an efficient tool for tracing distant evolutionary relationships involving large protein families. Although classical lipases and ATGL have no obvious sequence similarity and differ with regard to fold and catalytic mechanism, homology links detected with FAMILYSEARCHER show that they are evolutionarily related. The conserved sequence parts can be narrowed down to an ancestral core module consisting of three β-strands, one α-helix and a turn containing the typical nucleophilic serine. Moreover, this ancestral module also appears in numerous enzymes with various substrate specificities, but that critically rely on nucleophilic attack mechanisms. BioMed Central 2006-03-21 /pmc/articles/PMC1435942/ /pubmed/16551354 http://dx.doi.org/10.1186/1471-2105-7-164 Text en Copyright © 2006 Schneider et al; licensee BioMed Central Ltd. |
spellingShingle | Methodology Article Schneider, Georg Neuberger, Georg Wildpaner, Michael Tian, Sun Berezovsky, Igor Eisenhaber, Frank Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases |
title | Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases |
title_full | Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases |
title_fullStr | Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases |
title_full_unstemmed | Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases |
title_short | Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases |
title_sort | application of a sensitive collection heuristic for very large protein families: evolutionary relationship between adipose triglyceride lipase (atgl) and classic mammalian lipases |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1435942/ https://www.ncbi.nlm.nih.gov/pubmed/16551354 http://dx.doi.org/10.1186/1471-2105-7-164 |
work_keys_str_mv | AT schneidergeorg applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases AT neubergergeorg applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases AT wildpanermichael applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases AT tiansun applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases AT berezovskyigor applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases AT eisenhaberfrank applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases |