Cargando…

Whole‐genome sequencing and genome regions of special interest: Lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility

Whole‐genome sequencing of non‐model organisms is now widely accessible and has allowed a range of questions in the field of molecular ecology to be investigated with greater power. However, some genomic regions that are of high biological interest remain problematic for assembly and data‐handling....

Descripción completa

Detalles Bibliográficos
Autores principales: Vekemans, Xavier, Castric, Vincent, Hipperson, Helen, Müller, Niels A., Westerdahl, Helena, Cronk, Quentin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9290700/
https://www.ncbi.nlm.nih.gov/pubmed/34137092
http://dx.doi.org/10.1111/mec.16020
_version_ 1784748969717399552
author Vekemans, Xavier
Castric, Vincent
Hipperson, Helen
Müller, Niels A.
Westerdahl, Helena
Cronk, Quentin
author_facet Vekemans, Xavier
Castric, Vincent
Hipperson, Helen
Müller, Niels A.
Westerdahl, Helena
Cronk, Quentin
author_sort Vekemans, Xavier
collection PubMed
description Whole‐genome sequencing of non‐model organisms is now widely accessible and has allowed a range of questions in the field of molecular ecology to be investigated with greater power. However, some genomic regions that are of high biological interest remain problematic for assembly and data‐handling. Three such regions are the major histocompatibility complex (MHC), sex‐determining regions (SDRs) and the plant self‐incompatibility locus (S‐locus). Using these as examples, we illustrate the challenges of both assembling and resequencing these highly polymorphic regions and how bioinformatic and technological developments are enabling new approaches to their study. Mapping short‐read sequences against multiple alternative references improves genotyping comprehensiveness at the S‐locus thereby contributing to more accurate assessments of allelic frequencies. Long‐read sequencing, producing reads of several tens to hundreds of kilobase pairs in length, facilitates the assembly of such regions as single sequences can span the multiple duplicated gene copies of the MHC region, and sequence through repetitive stretches and translocations in SDRs and S‐locus haplotypes. These advances are adding value to short‐read genome resequencing approaches by allowing, for example, more accurate haplotype phasing across longer regions. Finally, we assessed further technical improvements, such as nanopore adaptive sequencing and bioinformatic tools using pangenomes, which have the potential to further expand our knowledge of a number of genomic regions that remain challenging to study with classical resequencing approaches.
format Online
Article
Text
id pubmed-9290700
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-92907002022-07-20 Whole‐genome sequencing and genome regions of special interest: Lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility Vekemans, Xavier Castric, Vincent Hipperson, Helen Müller, Niels A. Westerdahl, Helena Cronk, Quentin Mol Ecol Methodological Approaches and Advances for Wgs Whole‐genome sequencing of non‐model organisms is now widely accessible and has allowed a range of questions in the field of molecular ecology to be investigated with greater power. However, some genomic regions that are of high biological interest remain problematic for assembly and data‐handling. Three such regions are the major histocompatibility complex (MHC), sex‐determining regions (SDRs) and the plant self‐incompatibility locus (S‐locus). Using these as examples, we illustrate the challenges of both assembling and resequencing these highly polymorphic regions and how bioinformatic and technological developments are enabling new approaches to their study. Mapping short‐read sequences against multiple alternative references improves genotyping comprehensiveness at the S‐locus thereby contributing to more accurate assessments of allelic frequencies. Long‐read sequencing, producing reads of several tens to hundreds of kilobase pairs in length, facilitates the assembly of such regions as single sequences can span the multiple duplicated gene copies of the MHC region, and sequence through repetitive stretches and translocations in SDRs and S‐locus haplotypes. These advances are adding value to short‐read genome resequencing approaches by allowing, for example, more accurate haplotype phasing across longer regions. Finally, we assessed further technical improvements, such as nanopore adaptive sequencing and bioinformatic tools using pangenomes, which have the potential to further expand our knowledge of a number of genomic regions that remain challenging to study with classical resequencing approaches. John Wiley and Sons Inc. 2021-07-01 2021-12 /pmc/articles/PMC9290700/ /pubmed/34137092 http://dx.doi.org/10.1111/mec.16020 Text en © 2021 The Authors. Molecular Ecology published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Methodological Approaches and Advances for Wgs
Vekemans, Xavier
Castric, Vincent
Hipperson, Helen
Müller, Niels A.
Westerdahl, Helena
Cronk, Quentin
Whole‐genome sequencing and genome regions of special interest: Lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility
title Whole‐genome sequencing and genome regions of special interest: Lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility
title_full Whole‐genome sequencing and genome regions of special interest: Lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility
title_fullStr Whole‐genome sequencing and genome regions of special interest: Lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility
title_full_unstemmed Whole‐genome sequencing and genome regions of special interest: Lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility
title_short Whole‐genome sequencing and genome regions of special interest: Lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility
title_sort whole‐genome sequencing and genome regions of special interest: lessons from major histocompatibility complex, sex determination, and plant self‐incompatibility
topic Methodological Approaches and Advances for Wgs
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9290700/
https://www.ncbi.nlm.nih.gov/pubmed/34137092
http://dx.doi.org/10.1111/mec.16020
work_keys_str_mv AT vekemansxavier wholegenomesequencingandgenomeregionsofspecialinterestlessonsfrommajorhistocompatibilitycomplexsexdeterminationandplantselfincompatibility
AT castricvincent wholegenomesequencingandgenomeregionsofspecialinterestlessonsfrommajorhistocompatibilitycomplexsexdeterminationandplantselfincompatibility
AT hippersonhelen wholegenomesequencingandgenomeregionsofspecialinterestlessonsfrommajorhistocompatibilitycomplexsexdeterminationandplantselfincompatibility
AT mullernielsa wholegenomesequencingandgenomeregionsofspecialinterestlessonsfrommajorhistocompatibilitycomplexsexdeterminationandplantselfincompatibility
AT westerdahlhelena wholegenomesequencingandgenomeregionsofspecialinterestlessonsfrommajorhistocompatibilitycomplexsexdeterminationandplantselfincompatibility
AT cronkquentin wholegenomesequencingandgenomeregionsofspecialinterestlessonsfrommajorhistocompatibilitycomplexsexdeterminationandplantselfincompatibility