Cargando…

The complex set of internal repeats in SpTransformer protein sequences result in multiple but limited alternative alignments

The SpTransformer (SpTrf) gene family encodes a set of proteins that function in the sea urchin immune system. The gene sequences have a series of internal repeats in a mosaic pattern that is characteristic of this family. This mosaic pattern necessitates the insertion of large gaps, which has made...

Descripción completa

Detalles Bibliográficos
Autores principales: Barela Hudgell, Megan A., Smith, L. Courtney
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623053/
https://www.ncbi.nlm.nih.gov/pubmed/36330505
http://dx.doi.org/10.3389/fimmu.2022.1000177
_version_ 1784821908420689920
author Barela Hudgell, Megan A.
Smith, L. Courtney
author_facet Barela Hudgell, Megan A.
Smith, L. Courtney
author_sort Barela Hudgell, Megan A.
collection PubMed
description The SpTransformer (SpTrf) gene family encodes a set of proteins that function in the sea urchin immune system. The gene sequences have a series of internal repeats in a mosaic pattern that is characteristic of this family. This mosaic pattern necessitates the insertion of large gaps, which has made alignments of the deduced protein sequences computationally difficult such that only manual alignments have been reported previously. Because manual alignments are time consuming for evaluating newly available SpTrf sequences, computational approaches were evaluated for the sequences reported previously. Furthermore, because two different manual alignments of the SpTrf sequences are feasible because of the multiple internal repeats, it is not known whether additional alternative alignments can be identified using different approaches. The bioinformatic program, PRANK, was used because it was designed to align sequences with large gaps and indels. The results from PRANK show that the alignments of the internal repeats are similar to those done manually, suggesting multiple feasible alignments for some regions. GUIDANCE based analysis of the alignments identified regions that were excellent and other regions that failed to align. This suggests that computational approaches have limits for aligning the SpTrf sequences that include multiple repeats and that require inserted gaps. Furthermore, it is unlikely that alternative alignments for the full-length SpTrf sequences will be identified.
format Online
Article
Text
id pubmed-9623053
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96230532022-11-02 The complex set of internal repeats in SpTransformer protein sequences result in multiple but limited alternative alignments Barela Hudgell, Megan A. Smith, L. Courtney Front Immunol Immunology The SpTransformer (SpTrf) gene family encodes a set of proteins that function in the sea urchin immune system. The gene sequences have a series of internal repeats in a mosaic pattern that is characteristic of this family. This mosaic pattern necessitates the insertion of large gaps, which has made alignments of the deduced protein sequences computationally difficult such that only manual alignments have been reported previously. Because manual alignments are time consuming for evaluating newly available SpTrf sequences, computational approaches were evaluated for the sequences reported previously. Furthermore, because two different manual alignments of the SpTrf sequences are feasible because of the multiple internal repeats, it is not known whether additional alternative alignments can be identified using different approaches. The bioinformatic program, PRANK, was used because it was designed to align sequences with large gaps and indels. The results from PRANK show that the alignments of the internal repeats are similar to those done manually, suggesting multiple feasible alignments for some regions. GUIDANCE based analysis of the alignments identified regions that were excellent and other regions that failed to align. This suggests that computational approaches have limits for aligning the SpTrf sequences that include multiple repeats and that require inserted gaps. Furthermore, it is unlikely that alternative alignments for the full-length SpTrf sequences will be identified. Frontiers Media S.A. 2022-10-18 /pmc/articles/PMC9623053/ /pubmed/36330505 http://dx.doi.org/10.3389/fimmu.2022.1000177 Text en Copyright © 2022 Barela Hudgell and Smith https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Barela Hudgell, Megan A.
Smith, L. Courtney
The complex set of internal repeats in SpTransformer protein sequences result in multiple but limited alternative alignments
title The complex set of internal repeats in SpTransformer protein sequences result in multiple but limited alternative alignments
title_full The complex set of internal repeats in SpTransformer protein sequences result in multiple but limited alternative alignments
title_fullStr The complex set of internal repeats in SpTransformer protein sequences result in multiple but limited alternative alignments
title_full_unstemmed The complex set of internal repeats in SpTransformer protein sequences result in multiple but limited alternative alignments
title_short The complex set of internal repeats in SpTransformer protein sequences result in multiple but limited alternative alignments
title_sort complex set of internal repeats in sptransformer protein sequences result in multiple but limited alternative alignments
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9623053/
https://www.ncbi.nlm.nih.gov/pubmed/36330505
http://dx.doi.org/10.3389/fimmu.2022.1000177
work_keys_str_mv AT barelahudgellmegana thecomplexsetofinternalrepeatsinsptransformerproteinsequencesresultinmultiplebutlimitedalternativealignments
AT smithlcourtney thecomplexsetofinternalrepeatsinsptransformerproteinsequencesresultinmultiplebutlimitedalternativealignments
AT barelahudgellmegana complexsetofinternalrepeatsinsptransformerproteinsequencesresultinmultiplebutlimitedalternativealignments
AT smithlcourtney complexsetofinternalrepeatsinsptransformerproteinsequencesresultinmultiplebutlimitedalternativealignments