Cargando…

Approaches to increase the validity of gene family identification using manual homology search tools

Identifying homologs is an important process in the analysis of genetic patterns underlying traits and evolutionary relationships among species. Analysis of gene families is often used to form and support hypotheses on genetic patterns such as gene presence, absence, or functional divergence which u...

Descripción completa

Detalles Bibliográficos
Autores principales: Nestor, Benjamin J., Bayer, Philipp E., Fernandez, Cassandria G. Tay, Edwards, David, Finnegan, Patrick M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692271/
https://www.ncbi.nlm.nih.gov/pubmed/37817002
http://dx.doi.org/10.1007/s10709-023-00196-8
_version_ 1785152907295850496
author Nestor, Benjamin J.
Bayer, Philipp E.
Fernandez, Cassandria G. Tay
Edwards, David
Finnegan, Patrick M.
author_facet Nestor, Benjamin J.
Bayer, Philipp E.
Fernandez, Cassandria G. Tay
Edwards, David
Finnegan, Patrick M.
author_sort Nestor, Benjamin J.
collection PubMed
description Identifying homologs is an important process in the analysis of genetic patterns underlying traits and evolutionary relationships among species. Analysis of gene families is often used to form and support hypotheses on genetic patterns such as gene presence, absence, or functional divergence which underlie traits examined in functional studies. These analyses often require precise identification of all members in a targeted gene family. Manual pipelines where homology search and orthology assignment tools are used separately are the most common approach for identifying small gene families where accurate identification of all members is important. The ability to curate sequences between steps in manual pipelines allows for simple and precise identification of all possible gene family members. However, the validity of such manual pipeline analyses is often decreased by inappropriate approaches to homology searches including too relaxed or stringent statistical thresholds, inappropriate query sequences, homology classification based on sequence similarity alone, and low-quality proteome or genome sequences. In this article, we propose several approaches to mitigate these issues and allow for precise identification of gene family members and support for hypotheses linking genetic patterns to functional traits.
format Online
Article
Text
id pubmed-10692271
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-106922712023-12-03 Approaches to increase the validity of gene family identification using manual homology search tools Nestor, Benjamin J. Bayer, Philipp E. Fernandez, Cassandria G. Tay Edwards, David Finnegan, Patrick M. Genetica Review Identifying homologs is an important process in the analysis of genetic patterns underlying traits and evolutionary relationships among species. Analysis of gene families is often used to form and support hypotheses on genetic patterns such as gene presence, absence, or functional divergence which underlie traits examined in functional studies. These analyses often require precise identification of all members in a targeted gene family. Manual pipelines where homology search and orthology assignment tools are used separately are the most common approach for identifying small gene families where accurate identification of all members is important. The ability to curate sequences between steps in manual pipelines allows for simple and precise identification of all possible gene family members. However, the validity of such manual pipeline analyses is often decreased by inappropriate approaches to homology searches including too relaxed or stringent statistical thresholds, inappropriate query sequences, homology classification based on sequence similarity alone, and low-quality proteome or genome sequences. In this article, we propose several approaches to mitigate these issues and allow for precise identification of gene family members and support for hypotheses linking genetic patterns to functional traits. Springer International Publishing 2023-10-10 2023 /pmc/articles/PMC10692271/ /pubmed/37817002 http://dx.doi.org/10.1007/s10709-023-00196-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Review
Nestor, Benjamin J.
Bayer, Philipp E.
Fernandez, Cassandria G. Tay
Edwards, David
Finnegan, Patrick M.
Approaches to increase the validity of gene family identification using manual homology search tools
title Approaches to increase the validity of gene family identification using manual homology search tools
title_full Approaches to increase the validity of gene family identification using manual homology search tools
title_fullStr Approaches to increase the validity of gene family identification using manual homology search tools
title_full_unstemmed Approaches to increase the validity of gene family identification using manual homology search tools
title_short Approaches to increase the validity of gene family identification using manual homology search tools
title_sort approaches to increase the validity of gene family identification using manual homology search tools
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692271/
https://www.ncbi.nlm.nih.gov/pubmed/37817002
http://dx.doi.org/10.1007/s10709-023-00196-8
work_keys_str_mv AT nestorbenjaminj approachestoincreasethevalidityofgenefamilyidentificationusingmanualhomologysearchtools
AT bayerphilippe approachestoincreasethevalidityofgenefamilyidentificationusingmanualhomologysearchtools
AT fernandezcassandriagtay approachestoincreasethevalidityofgenefamilyidentificationusingmanualhomologysearchtools
AT edwardsdavid approachestoincreasethevalidityofgenefamilyidentificationusingmanualhomologysearchtools
AT finneganpatrickm approachestoincreasethevalidityofgenefamilyidentificationusingmanualhomologysearchtools