Cargando…

Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases

BACKGROUND: Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to be invoked and analyzed. The unclear evolutionary relationship between classica...

Descripción completa

Detalles Bibliográficos
Autores principales: Schneider, Georg, Neuberger, Georg, Wildpaner, Michael, Tian, Sun, Berezovsky, Igor, Eisenhaber, Frank
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1435942/
https://www.ncbi.nlm.nih.gov/pubmed/16551354
http://dx.doi.org/10.1186/1471-2105-7-164
_version_ 1782127300602494976
author Schneider, Georg
Neuberger, Georg
Wildpaner, Michael
Tian, Sun
Berezovsky, Igor
Eisenhaber, Frank
author_facet Schneider, Georg
Neuberger, Georg
Wildpaner, Michael
Tian, Sun
Berezovsky, Igor
Eisenhaber, Frank
author_sort Schneider, Georg
collection PubMed
description BACKGROUND: Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to be invoked and analyzed. The unclear evolutionary relationship between classical mammalian lipases and the recently discovered human adipose triglyceride lipase (ATGL; a patatin family member) is an exemplary case for such a problem. RESULTS: We describe an unsupervised, sensitive sequence segment collection heuristic suitable for assembling very large protein families. It is based on fan-like expanding, iterative database searches. To prevent inclusion of unrelated hits, additional criteria are introduced: minimal alignment length and overlap with starting sequence segments, finding starting sequences in reciprocal searches, automated filtering for compositional bias and repetitive patterns. This heuristic was implemented as FAMILYSEARCHER in the ANNIE sequence analysis environment and applied to search for protein links between the classical lipase family and the patatin-like group. CONCLUSION: The FAMILYSEARCHER is an efficient tool for tracing distant evolutionary relationships involving large protein families. Although classical lipases and ATGL have no obvious sequence similarity and differ with regard to fold and catalytic mechanism, homology links detected with FAMILYSEARCHER show that they are evolutionarily related. The conserved sequence parts can be narrowed down to an ancestral core module consisting of three β-strands, one α-helix and a turn containing the typical nucleophilic serine. Moreover, this ancestral module also appears in numerous enzymes with various substrate specificities, but that critically rely on nucleophilic attack mechanisms.
format Text
id pubmed-1435942
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14359422006-04-14 Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases Schneider, Georg Neuberger, Georg Wildpaner, Michael Tian, Sun Berezovsky, Igor Eisenhaber, Frank BMC Bioinformatics Methodology Article BACKGROUND: Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to be invoked and analyzed. The unclear evolutionary relationship between classical mammalian lipases and the recently discovered human adipose triglyceride lipase (ATGL; a patatin family member) is an exemplary case for such a problem. RESULTS: We describe an unsupervised, sensitive sequence segment collection heuristic suitable for assembling very large protein families. It is based on fan-like expanding, iterative database searches. To prevent inclusion of unrelated hits, additional criteria are introduced: minimal alignment length and overlap with starting sequence segments, finding starting sequences in reciprocal searches, automated filtering for compositional bias and repetitive patterns. This heuristic was implemented as FAMILYSEARCHER in the ANNIE sequence analysis environment and applied to search for protein links between the classical lipase family and the patatin-like group. CONCLUSION: The FAMILYSEARCHER is an efficient tool for tracing distant evolutionary relationships involving large protein families. Although classical lipases and ATGL have no obvious sequence similarity and differ with regard to fold and catalytic mechanism, homology links detected with FAMILYSEARCHER show that they are evolutionarily related. The conserved sequence parts can be narrowed down to an ancestral core module consisting of three β-strands, one α-helix and a turn containing the typical nucleophilic serine. Moreover, this ancestral module also appears in numerous enzymes with various substrate specificities, but that critically rely on nucleophilic attack mechanisms. BioMed Central 2006-03-21 /pmc/articles/PMC1435942/ /pubmed/16551354 http://dx.doi.org/10.1186/1471-2105-7-164 Text en Copyright © 2006 Schneider et al; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Schneider, Georg
Neuberger, Georg
Wildpaner, Michael
Tian, Sun
Berezovsky, Igor
Eisenhaber, Frank
Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
title Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
title_full Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
title_fullStr Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
title_full_unstemmed Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
title_short Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
title_sort application of a sensitive collection heuristic for very large protein families: evolutionary relationship between adipose triglyceride lipase (atgl) and classic mammalian lipases
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1435942/
https://www.ncbi.nlm.nih.gov/pubmed/16551354
http://dx.doi.org/10.1186/1471-2105-7-164
work_keys_str_mv AT schneidergeorg applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases
AT neubergergeorg applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases
AT wildpanermichael applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases
AT tiansun applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases
AT berezovskyigor applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases
AT eisenhaberfrank applicationofasensitivecollectionheuristicforverylargeproteinfamiliesevolutionaryrelationshipbetweenadiposetriglyceridelipaseatglandclassicmammalianlipases