Cargando…

Homoeolog Inference Methods Requiring Bidirectional Best Hits or Synteny Miss Many Pairs

Homoeologs are pairs of genes or chromosomes in the same species that originated by speciation and were brought back together in the same genome by allopolyploidization. Bioinformatic methods for accurate homoeology inference are crucial for studying the evolutionary consequences of polyploidization...

Descripción completa

Detalles Bibliográficos
Autores principales: Glover, Natasha, Sheppard, Shaoline, Dessimoz, Christophe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8214411/
https://www.ncbi.nlm.nih.gov/pubmed/33871639
http://dx.doi.org/10.1093/gbe/evab077
_version_ 1783710058901667840
author Glover, Natasha
Sheppard, Shaoline
Dessimoz, Christophe
author_facet Glover, Natasha
Sheppard, Shaoline
Dessimoz, Christophe
author_sort Glover, Natasha
collection PubMed
description Homoeologs are pairs of genes or chromosomes in the same species that originated by speciation and were brought back together in the same genome by allopolyploidization. Bioinformatic methods for accurate homoeology inference are crucial for studying the evolutionary consequences of polyploidization, and homoeology is typically inferred on the basis of bidirectional best hit (BBH) and/or positional conservation (synteny). However, these methods neglect the fact that genes can duplicate and move, both prior to and after the allopolyploidization event. These duplications and movements can result in many-to-many and/or nonsyntenic homoeologs—which thus remain undetected and unstudied. Here, using the allotetraploid upland cotton (Gossypium hirsutum) as a case study, we show that conventional approaches indeed miss a substantial proportion of homoeologs. Additionally, we found that many of the missed pairs of homoeologs are broadly and highly expressed. A gene ontology analysis revealed a high proportion of the nonsyntenic and non-BBH homoeologs to be involved in protein translation and are likely to contribute to the functional repertoire of cotton. Thus, from an evolutionary and functional genomics standpoint, choosing a homoeolog inference method which does not solely rely on 1:1 relationship cardinality or synteny is crucial for not missing these potentially important homoeolog pairs.
format Online
Article
Text
id pubmed-8214411
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-82144112021-06-21 Homoeolog Inference Methods Requiring Bidirectional Best Hits or Synteny Miss Many Pairs Glover, Natasha Sheppard, Shaoline Dessimoz, Christophe Genome Biol Evol Research Article Homoeologs are pairs of genes or chromosomes in the same species that originated by speciation and were brought back together in the same genome by allopolyploidization. Bioinformatic methods for accurate homoeology inference are crucial for studying the evolutionary consequences of polyploidization, and homoeology is typically inferred on the basis of bidirectional best hit (BBH) and/or positional conservation (synteny). However, these methods neglect the fact that genes can duplicate and move, both prior to and after the allopolyploidization event. These duplications and movements can result in many-to-many and/or nonsyntenic homoeologs—which thus remain undetected and unstudied. Here, using the allotetraploid upland cotton (Gossypium hirsutum) as a case study, we show that conventional approaches indeed miss a substantial proportion of homoeologs. Additionally, we found that many of the missed pairs of homoeologs are broadly and highly expressed. A gene ontology analysis revealed a high proportion of the nonsyntenic and non-BBH homoeologs to be involved in protein translation and are likely to contribute to the functional repertoire of cotton. Thus, from an evolutionary and functional genomics standpoint, choosing a homoeolog inference method which does not solely rely on 1:1 relationship cardinality or synteny is crucial for not missing these potentially important homoeolog pairs. Oxford University Press 2021-04-19 /pmc/articles/PMC8214411/ /pubmed/33871639 http://dx.doi.org/10.1093/gbe/evab077 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Glover, Natasha
Sheppard, Shaoline
Dessimoz, Christophe
Homoeolog Inference Methods Requiring Bidirectional Best Hits or Synteny Miss Many Pairs
title Homoeolog Inference Methods Requiring Bidirectional Best Hits or Synteny Miss Many Pairs
title_full Homoeolog Inference Methods Requiring Bidirectional Best Hits or Synteny Miss Many Pairs
title_fullStr Homoeolog Inference Methods Requiring Bidirectional Best Hits or Synteny Miss Many Pairs
title_full_unstemmed Homoeolog Inference Methods Requiring Bidirectional Best Hits or Synteny Miss Many Pairs
title_short Homoeolog Inference Methods Requiring Bidirectional Best Hits or Synteny Miss Many Pairs
title_sort homoeolog inference methods requiring bidirectional best hits or synteny miss many pairs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8214411/
https://www.ncbi.nlm.nih.gov/pubmed/33871639
http://dx.doi.org/10.1093/gbe/evab077
work_keys_str_mv AT glovernatasha homoeologinferencemethodsrequiringbidirectionalbesthitsorsyntenymissmanypairs
AT sheppardshaoline homoeologinferencemethodsrequiringbidirectionalbesthitsorsyntenymissmanypairs
AT dessimozchristophe homoeologinferencemethodsrequiringbidirectionalbesthitsorsyntenymissmanypairs