Cargando…
How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes
Accurate sex identification is crucial for elucidating the biology of a species. In the absence of directly observable sexual characteristics, sex identification of wild fauna can be challenging, if not impossible. Molecular sexing offers a powerful alternative to morphological sexing approaches. He...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9405501/ https://www.ncbi.nlm.nih.gov/pubmed/36035270 http://dx.doi.org/10.1002/ece3.9185 |
_version_ | 1784773894373113856 |
---|---|
author | Cabrera, Andrea A. Rey‐Iglesia, Alba Louis, Marie Skovrind, Mikkel Westbury, Michael V. Lorenzen, Eline D. |
author_facet | Cabrera, Andrea A. Rey‐Iglesia, Alba Louis, Marie Skovrind, Mikkel Westbury, Michael V. Lorenzen, Eline D. |
author_sort | Cabrera, Andrea A. |
collection | PubMed |
description | Accurate sex identification is crucial for elucidating the biology of a species. In the absence of directly observable sexual characteristics, sex identification of wild fauna can be challenging, if not impossible. Molecular sexing offers a powerful alternative to morphological sexing approaches. Here, we present SeXY, a novel sex‐identification pipeline, for very low‐coverage shotgun sequencing data from a single individual. SeXY was designed to utilize low‐effort screening data for sex identification and does not require a conspecific sex‐chromosome assembly as reference. We assess the accuracy of our pipeline to data quantity by downsampling sequencing data from 100,000 to 1000 mapped reads and to reference genome selection by mapping to a variety of reference genomes of various qualities and phylogenetic distance. We show that our method is 100% accurate when mapping to a high‐quality (highly contiguous N50 > 30 Mb) conspecific genome, even down to 1000 mapped reads. For lower‐quality reference assemblies (N50 < 30 Mb), our method is 100% accurate with 50,000 mapped reads, regardless of reference assembly quality or phylogenetic distance. The SeXY pipeline provides several advantages over previously implemented methods; SeXY (i) requires sequencing data from only a single individual, (ii) does not require assembled conspecific sex chromosomes, or even a conspecific reference assembly, (iii) takes into account variation in coverage across the genome, and (iv) is accurate with only 1000 mapped reads in many cases. |
format | Online Article Text |
id | pubmed-9405501 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-94055012022-08-26 How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes Cabrera, Andrea A. Rey‐Iglesia, Alba Louis, Marie Skovrind, Mikkel Westbury, Michael V. Lorenzen, Eline D. Ecol Evol Research Articles Accurate sex identification is crucial for elucidating the biology of a species. In the absence of directly observable sexual characteristics, sex identification of wild fauna can be challenging, if not impossible. Molecular sexing offers a powerful alternative to morphological sexing approaches. Here, we present SeXY, a novel sex‐identification pipeline, for very low‐coverage shotgun sequencing data from a single individual. SeXY was designed to utilize low‐effort screening data for sex identification and does not require a conspecific sex‐chromosome assembly as reference. We assess the accuracy of our pipeline to data quantity by downsampling sequencing data from 100,000 to 1000 mapped reads and to reference genome selection by mapping to a variety of reference genomes of various qualities and phylogenetic distance. We show that our method is 100% accurate when mapping to a high‐quality (highly contiguous N50 > 30 Mb) conspecific genome, even down to 1000 mapped reads. For lower‐quality reference assemblies (N50 < 30 Mb), our method is 100% accurate with 50,000 mapped reads, regardless of reference assembly quality or phylogenetic distance. The SeXY pipeline provides several advantages over previously implemented methods; SeXY (i) requires sequencing data from only a single individual, (ii) does not require assembled conspecific sex chromosomes, or even a conspecific reference assembly, (iii) takes into account variation in coverage across the genome, and (iv) is accurate with only 1000 mapped reads in many cases. John Wiley and Sons Inc. 2022-08-25 /pmc/articles/PMC9405501/ /pubmed/36035270 http://dx.doi.org/10.1002/ece3.9185 Text en © 2022 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Cabrera, Andrea A. Rey‐Iglesia, Alba Louis, Marie Skovrind, Mikkel Westbury, Michael V. Lorenzen, Eline D. How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes |
title | How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes |
title_full | How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes |
title_fullStr | How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes |
title_full_unstemmed | How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes |
title_short | How low can you go? Introducing SeXY: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes |
title_sort | how low can you go? introducing sexy: sex identification from low‐quantity sequencing data despite lacking assembled sex chromosomes |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9405501/ https://www.ncbi.nlm.nih.gov/pubmed/36035270 http://dx.doi.org/10.1002/ece3.9185 |
work_keys_str_mv | AT cabreraandreaa howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes AT reyiglesiaalba howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes AT louismarie howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes AT skovrindmikkel howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes AT westburymichaelv howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes AT lorenzenelined howlowcanyougointroducingsexysexidentificationfromlowquantitysequencingdatadespitelackingassembledsexchromosomes |