Cargando…

How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes

Accurate sex identification is crucial for elucidating the biology of a species. In the absence of directly observable sexual characteristics, sex identification of wild fauna can be challenging, if not impossible. Molecular sexing offers a powerful alternative to morphological sexing approaches. He...

Descripción completa

Detalles Bibliográficos
Autores principales: Cabrera, Andrea A., Rey‐Iglesia, Alba, Louis, Marie, Skovrind, Mikkel, Westbury, Michael V., Lorenzen, Eline D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9405501/
https://www.ncbi.nlm.nih.gov/pubmed/36035270
http://dx.doi.org/10.1002/ece3.9185
_version_ 1784773894373113856
author Cabrera, Andrea A.
Rey‐Iglesia, Alba
Louis, Marie
Skovrind, Mikkel
Westbury, Michael V.
Lorenzen, Eline D.
author_facet Cabrera, Andrea A.
Rey‐Iglesia, Alba
Louis, Marie
Skovrind, Mikkel
Westbury, Michael V.
Lorenzen, Eline D.
author_sort Cabrera, Andrea A.
collection PubMed
description Accurate sex identification is crucial for elucidating the biology of a species. In the absence of directly observable sexual characteristics, sex identification of wild fauna can be challenging, if not impossible. Molecular sexing offers a powerful alternative to morphological sexing approaches. Here, we present SeXY, a novel sex‐identification pipeline, for very low‐coverage shotgun sequencing data from a single individual. SeXY was designed to utilize low‐effort screening data for sex identification and does not require a conspecific sex‐chromosome assembly as reference. We assess the accuracy of our pipeline to data quantity by downsampling sequencing data from 100,000 to 1000 mapped reads and to reference genome selection by mapping to a variety of reference genomes of various qualities and phylogenetic distance. We show that our method is 100% accurate when mapping to a high‐quality (highly contiguous N50 > 30 Mb) conspecific genome, even down to 1000 mapped reads. For lower‐quality reference assemblies (N50 < 30 Mb), our method is 100% accurate with 50,000 mapped reads, regardless of reference assembly quality or phylogenetic distance. The SeXY pipeline provides several advantages over previously implemented methods; SeXY (i) requires sequencing data from only a single individual, (ii) does not require assembled conspecific sex chromosomes, or even a conspecific reference assembly, (iii) takes into account variation in coverage across the genome, and (iv) is accurate with only 1000 mapped reads in many cases.
format Online
Article
Text
id pubmed-9405501
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-94055012022-08-26 How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes Cabrera, Andrea A. Rey‐Iglesia, Alba Louis, Marie Skovrind, Mikkel Westbury, Michael V. Lorenzen, Eline D. Ecol Evol Research Articles Accurate sex identification is crucial for elucidating the biology of a species. In the absence of directly observable sexual characteristics, sex identification of wild fauna can be challenging, if not impossible. Molecular sexing offers a powerful alternative to morphological sexing approaches. Here, we present SeXY, a novel sex‐identification pipeline, for very low‐coverage shotgun sequencing data from a single individual. SeXY was designed to utilize low‐effort screening data for sex identification and does not require a conspecific sex‐chromosome assembly as reference. We assess the accuracy of our pipeline to data quantity by downsampling sequencing data from 100,000 to 1000 mapped reads and to reference genome selection by mapping to a variety of reference genomes of various qualities and phylogenetic distance. We show that our method is 100% accurate when mapping to a high‐quality (highly contiguous N50 > 30 Mb) conspecific genome, even down to 1000 mapped reads. For lower‐quality reference assemblies (N50 < 30 Mb), our method is 100% accurate with 50,000 mapped reads, regardless of reference assembly quality or phylogenetic distance. The SeXY pipeline provides several advantages over previously implemented methods; SeXY (i) requires sequencing data from only a single individual, (ii) does not require assembled conspecific sex chromosomes, or even a conspecific reference assembly, (iii) takes into account variation in coverage across the genome, and (iv) is accurate with only 1000 mapped reads in many cases. John Wiley and Sons Inc. 2022-08-25 /pmc/articles/PMC9405501/ /pubmed/36035270 http://dx.doi.org/10.1002/ece3.9185 Text en © 2022 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Cabrera, Andrea A.
Rey‐Iglesia, Alba
Louis, Marie
Skovrind, Mikkel
Westbury, Michael V.
Lorenzen, Eline D.
How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes
title How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes
title_full How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes
title_fullStr How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes
title_full_unstemmed How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes
title_short How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes
title_sort how low can you go? introducing sexy: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9405501/
https://www.ncbi.nlm.nih.gov/pubmed/36035270
http://dx.doi.org/10.1002/ece3.9185
work_keys_str_mv AT cabreraandreaa howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes
AT reyiglesiaalba howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes
AT louismarie howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes
AT skovrindmikkel howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes
AT westburymichaelv howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes
AT lorenzenelined howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes