Cargando…

The gyrfalcon (Falco rusticolus) genome

High-quality genome assemblies are characterized by high-sequence contiguity, completeness, and a low error rate, thus providing the basis for a wide array of studies focusing on natural species ecology, conservation, evolution, and population genomics. To provide this valuable resource for conserva...

Descripción completa

Detalles Bibliográficos
Autores principales: Zuccolo, Andrea, Mfarrej, Sara, Celii, Mirko, Mussurova, Saule, Rivera, Luis F, Llaca, Victor, Mohammed, Nahed, Pain, Arnab, Alrefaei, Abdulmajeed Fahad, Alrefaei, Abdulwahed Fahad, Wing, Rod A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9997569/
https://www.ncbi.nlm.nih.gov/pubmed/36611193
http://dx.doi.org/10.1093/g3journal/jkad001
_version_ 1784903281685823488
author Zuccolo, Andrea
Mfarrej, Sara
Celii, Mirko
Mussurova, Saule
Rivera, Luis F
Llaca, Victor
Mohammed, Nahed
Pain, Arnab
Alrefaei, Abdulmajeed Fahad
Alrefaei, Abdulwahed Fahad
Wing, Rod A
author_facet Zuccolo, Andrea
Mfarrej, Sara
Celii, Mirko
Mussurova, Saule
Rivera, Luis F
Llaca, Victor
Mohammed, Nahed
Pain, Arnab
Alrefaei, Abdulmajeed Fahad
Alrefaei, Abdulwahed Fahad
Wing, Rod A
author_sort Zuccolo, Andrea
collection PubMed
description High-quality genome assemblies are characterized by high-sequence contiguity, completeness, and a low error rate, thus providing the basis for a wide array of studies focusing on natural species ecology, conservation, evolution, and population genomics. To provide this valuable resource for conservation projects and comparative genomics studies on gyrfalcon (Falco rusticolus), we sequenced and assembled the genome of this species using third-generation sequencing strategies and optical maps. Here, we describe a highly contiguous and complete genome assembly comprising 20 scaffolds and 13 contigs with a total size of 1.193 Gbp, including 8,064 complete Benchmarking Universal Single-Copy Orthologs (BUSCOs) of the total 8,338 BUSCO groups present in the library aves_odb10. Of these BUSCO genes, 96.7% were complete, 96.1% were present as a single copy, and 0.6% were duplicated. Furthermore, 0.8% of BUSCO genes were fragmented and 2.5% (210) were missing. A de novo search for transposable elements (TEs) identified 5,716 TEs that masked 7.61% of the F. rusticolus genome assembly when combined with publicly available TE collections. Long interspersed nuclear elements, in particular, the element Chicken-repeat 1 (CR1), were the most abundant TEs in the F. rusticolus genome. A de novo first-pass gene annotation was performed using 293,349 PacBio Iso-Seq transcripts and 496,195 transcripts derived from the assembly of 42,429,525 Illumina PE RNA-seq reads. In all, 19,602 putative genes, of which 59.31% were functionally characterized and associated with Gene Ontology terms, were annotated. A comparison of the gyrfalcon genome assembly with the publicly available assemblies of the domestic chicken (Gallus gallus), zebra finch (Taeniopygia guttata), and hummingbird (Calypte anna) revealed several genome rearrangements. In particular, nine putative chromosome fusions were identified in the gyrfalcon genome assembly compared with those in the G. gallus genome assembly. This genome assembly, its annotation for TEs and genes, and the comparative analyses presented, complement and strength the base of high-quality genome assemblies and associated resources available for comparative studies focusing on the evolution, ecology, and conservation of Aves.
format Online
Article
Text
id pubmed-9997569
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99975692023-03-10 The gyrfalcon (Falco rusticolus) genome Zuccolo, Andrea Mfarrej, Sara Celii, Mirko Mussurova, Saule Rivera, Luis F Llaca, Victor Mohammed, Nahed Pain, Arnab Alrefaei, Abdulmajeed Fahad Alrefaei, Abdulwahed Fahad Wing, Rod A G3 (Bethesda) Genome Report High-quality genome assemblies are characterized by high-sequence contiguity, completeness, and a low error rate, thus providing the basis for a wide array of studies focusing on natural species ecology, conservation, evolution, and population genomics. To provide this valuable resource for conservation projects and comparative genomics studies on gyrfalcon (Falco rusticolus), we sequenced and assembled the genome of this species using third-generation sequencing strategies and optical maps. Here, we describe a highly contiguous and complete genome assembly comprising 20 scaffolds and 13 contigs with a total size of 1.193 Gbp, including 8,064 complete Benchmarking Universal Single-Copy Orthologs (BUSCOs) of the total 8,338 BUSCO groups present in the library aves_odb10. Of these BUSCO genes, 96.7% were complete, 96.1% were present as a single copy, and 0.6% were duplicated. Furthermore, 0.8% of BUSCO genes were fragmented and 2.5% (210) were missing. A de novo search for transposable elements (TEs) identified 5,716 TEs that masked 7.61% of the F. rusticolus genome assembly when combined with publicly available TE collections. Long interspersed nuclear elements, in particular, the element Chicken-repeat 1 (CR1), were the most abundant TEs in the F. rusticolus genome. A de novo first-pass gene annotation was performed using 293,349 PacBio Iso-Seq transcripts and 496,195 transcripts derived from the assembly of 42,429,525 Illumina PE RNA-seq reads. In all, 19,602 putative genes, of which 59.31% were functionally characterized and associated with Gene Ontology terms, were annotated. A comparison of the gyrfalcon genome assembly with the publicly available assemblies of the domestic chicken (Gallus gallus), zebra finch (Taeniopygia guttata), and hummingbird (Calypte anna) revealed several genome rearrangements. In particular, nine putative chromosome fusions were identified in the gyrfalcon genome assembly compared with those in the G. gallus genome assembly. This genome assembly, its annotation for TEs and genes, and the comparative analyses presented, complement and strength the base of high-quality genome assemblies and associated resources available for comparative studies focusing on the evolution, ecology, and conservation of Aves. Oxford University Press 2023-01-05 /pmc/articles/PMC9997569/ /pubmed/36611193 http://dx.doi.org/10.1093/g3journal/jkad001 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genome Report
Zuccolo, Andrea
Mfarrej, Sara
Celii, Mirko
Mussurova, Saule
Rivera, Luis F
Llaca, Victor
Mohammed, Nahed
Pain, Arnab
Alrefaei, Abdulmajeed Fahad
Alrefaei, Abdulwahed Fahad
Wing, Rod A
The gyrfalcon (Falco rusticolus) genome
title The gyrfalcon (Falco rusticolus) genome
title_full The gyrfalcon (Falco rusticolus) genome
title_fullStr The gyrfalcon (Falco rusticolus) genome
title_full_unstemmed The gyrfalcon (Falco rusticolus) genome
title_short The gyrfalcon (Falco rusticolus) genome
title_sort gyrfalcon (falco rusticolus) genome
topic Genome Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9997569/
https://www.ncbi.nlm.nih.gov/pubmed/36611193
http://dx.doi.org/10.1093/g3journal/jkad001
work_keys_str_mv AT zuccoloandrea thegyrfalconfalcorusticolusgenome
AT mfarrejsara thegyrfalconfalcorusticolusgenome
AT celiimirko thegyrfalconfalcorusticolusgenome
AT mussurovasaule thegyrfalconfalcorusticolusgenome
AT riveraluisf thegyrfalconfalcorusticolusgenome
AT llacavictor thegyrfalconfalcorusticolusgenome
AT mohammednahed thegyrfalconfalcorusticolusgenome
AT painarnab thegyrfalconfalcorusticolusgenome
AT alrefaeiabdulmajeedfahad thegyrfalconfalcorusticolusgenome
AT alrefaeiabdulwahedfahad thegyrfalconfalcorusticolusgenome
AT wingroda thegyrfalconfalcorusticolusgenome
AT zuccoloandrea gyrfalconfalcorusticolusgenome
AT mfarrejsara gyrfalconfalcorusticolusgenome
AT celiimirko gyrfalconfalcorusticolusgenome
AT mussurovasaule gyrfalconfalcorusticolusgenome
AT riveraluisf gyrfalconfalcorusticolusgenome
AT llacavictor gyrfalconfalcorusticolusgenome
AT mohammednahed gyrfalconfalcorusticolusgenome
AT painarnab gyrfalconfalcorusticolusgenome
AT alrefaeiabdulmajeedfahad gyrfalconfalcorusticolusgenome
AT alrefaeiabdulwahedfahad gyrfalconfalcorusticolusgenome
AT wingroda gyrfalconfalcorusticolusgenome