Cargando…

Efficient identification of Y chromosome sequences in the human and Drosophila genomes

Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large nu...

Descripción completa

Detalles Bibliográficos
Autores principales: Carvalho, Antonio Bernardo, Clark, Andrew G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814889/
https://www.ncbi.nlm.nih.gov/pubmed/23921660
http://dx.doi.org/10.1101/gr.156034.113
_version_ 1782289318446891008
author Carvalho, Antonio Bernardo
Clark, Andrew G.
author_facet Carvalho, Antonio Bernardo
Clark, Andrew G.
author_sort Carvalho, Antonio Bernardo
collection PubMed
description Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unmapped scaffolds. Identification of Y-linked genes among these fragments has yielded important insights about the origin and evolution of Y chromosomes, but the process is labor intensive, restricting studies to a small number of species. Apart from these fragmentary assemblies, in a few mammalian species, the euchromatic sequence of the Y is essentially complete, owing to painstaking BAC mapping and sequencing. Here we use female short-read sequencing and k-mer comparison to identify Y-linked sequences in two very different genomes, Drosophila virilis and human. Using this method, essentially all D. virilis scaffolds were unambiguously classified as Y-linked or not Y-linked. We found 800 new scaffolds (totaling 8.5 Mbp), and four new genes in the Y chromosome of D. virilis, including JYalpha, a gene involved in hybrid male sterility. Our results also strongly support the preponderance of gene gains over gene losses in the evolution of the Drosophila Y. In the intensively studied human genome, used here as a positive control, we recovered all previously known genes or gene families, plus a small amount (283 kb) of new, unfinished sequence. Hence, this method works in large and complex genomes and can be applied to any species with sex chromosomes.
format Online
Article
Text
id pubmed-3814889
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-38148892014-05-01 Efficient identification of Y chromosome sequences in the human and Drosophila genomes Carvalho, Antonio Bernardo Clark, Andrew G. Genome Res Method Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unmapped scaffolds. Identification of Y-linked genes among these fragments has yielded important insights about the origin and evolution of Y chromosomes, but the process is labor intensive, restricting studies to a small number of species. Apart from these fragmentary assemblies, in a few mammalian species, the euchromatic sequence of the Y is essentially complete, owing to painstaking BAC mapping and sequencing. Here we use female short-read sequencing and k-mer comparison to identify Y-linked sequences in two very different genomes, Drosophila virilis and human. Using this method, essentially all D. virilis scaffolds were unambiguously classified as Y-linked or not Y-linked. We found 800 new scaffolds (totaling 8.5 Mbp), and four new genes in the Y chromosome of D. virilis, including JYalpha, a gene involved in hybrid male sterility. Our results also strongly support the preponderance of gene gains over gene losses in the evolution of the Drosophila Y. In the intensively studied human genome, used here as a positive control, we recovered all previously known genes or gene families, plus a small amount (283 kb) of new, unfinished sequence. Hence, this method works in large and complex genomes and can be applied to any species with sex chromosomes. Cold Spring Harbor Laboratory Press 2013-11 /pmc/articles/PMC3814889/ /pubmed/23921660 http://dx.doi.org/10.1101/gr.156034.113 Text en © 2013 Carvalho and Clark; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/3.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported), as described at http://creativecommons.org/licenses/by-nc/3.0/.
spellingShingle Method
Carvalho, Antonio Bernardo
Clark, Andrew G.
Efficient identification of Y chromosome sequences in the human and Drosophila genomes
title Efficient identification of Y chromosome sequences in the human and Drosophila genomes
title_full Efficient identification of Y chromosome sequences in the human and Drosophila genomes
title_fullStr Efficient identification of Y chromosome sequences in the human and Drosophila genomes
title_full_unstemmed Efficient identification of Y chromosome sequences in the human and Drosophila genomes
title_short Efficient identification of Y chromosome sequences in the human and Drosophila genomes
title_sort efficient identification of y chromosome sequences in the human and drosophila genomes
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3814889/
https://www.ncbi.nlm.nih.gov/pubmed/23921660
http://dx.doi.org/10.1101/gr.156034.113
work_keys_str_mv AT carvalhoantoniobernardo efficientidentificationofychromosomesequencesinthehumananddrosophilagenomes
AT clarkandrewg efficientidentificationofychromosomesequencesinthehumananddrosophilagenomes