Cargando…
A Transcriptomic Pipeline Adapted for Genomic Sequence Discovery of Germline-Restricted Sequence in Zebra Finch, Taeniopygia guttata
Songbirds have an unusual genomic element which is only found in their germline cells, known as the germline-restricted chromosome (GRC). Because germ cells contain both GRC and non-GRC (or A-chromosome) sequences, confidently identifying the GRC-derived elements from genome assemblies has proven di...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8245190/ https://www.ncbi.nlm.nih.gov/pubmed/33905492 http://dx.doi.org/10.1093/gbe/evab088 |
_version_ | 1783716066909749248 |
---|---|
author | Asalone, Kathryn C Takkar, Ajuni K Saldanha, Colin J Bracht, John R |
author_facet | Asalone, Kathryn C Takkar, Ajuni K Saldanha, Colin J Bracht, John R |
author_sort | Asalone, Kathryn C |
collection | PubMed |
description | Songbirds have an unusual genomic element which is only found in their germline cells, known as the germline-restricted chromosome (GRC). Because germ cells contain both GRC and non-GRC (or A-chromosome) sequences, confidently identifying the GRC-derived elements from genome assemblies has proven difficult. Here, we introduce a new application of a transcriptomic method for GRC sequence identification. By adapting the Stringtie/Ballgown pipeline to use somatic and germline DNA reads, we find that the ratio of fragments per kilobase per million mapped reads can be used to confidently assign contigs to the GRC. Using this comparative coverage analysis, we successfully identify 733 contigs as high confidence GRC sequences (720 newly identified in this study) and 51 contigs which were validated using quantitative polymerase chain reaction. We also identified two new GRC genes, one hypothetical protein and one gene encoding an RNase H-like domain, and placed 16 previously identified but unplaced genes onto their host contigs. With the current focus on sequencing GRCs from different songbirds, our work adds to the genomic toolkit to identify GRC elements, and we provide a detailed protocol and GitHub repository at https://github.com/brachtlab/Comparative_Coverage_Analysis (last accessed May 12, 2021). |
format | Online Article Text |
id | pubmed-8245190 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-82451902021-07-01 A Transcriptomic Pipeline Adapted for Genomic Sequence Discovery of Germline-Restricted Sequence in Zebra Finch, Taeniopygia guttata Asalone, Kathryn C Takkar, Ajuni K Saldanha, Colin J Bracht, John R Genome Biol Evol within-Individual Genome Variation and Germline/Soma Distinction Songbirds have an unusual genomic element which is only found in their germline cells, known as the germline-restricted chromosome (GRC). Because germ cells contain both GRC and non-GRC (or A-chromosome) sequences, confidently identifying the GRC-derived elements from genome assemblies has proven difficult. Here, we introduce a new application of a transcriptomic method for GRC sequence identification. By adapting the Stringtie/Ballgown pipeline to use somatic and germline DNA reads, we find that the ratio of fragments per kilobase per million mapped reads can be used to confidently assign contigs to the GRC. Using this comparative coverage analysis, we successfully identify 733 contigs as high confidence GRC sequences (720 newly identified in this study) and 51 contigs which were validated using quantitative polymerase chain reaction. We also identified two new GRC genes, one hypothetical protein and one gene encoding an RNase H-like domain, and placed 16 previously identified but unplaced genes onto their host contigs. With the current focus on sequencing GRCs from different songbirds, our work adds to the genomic toolkit to identify GRC elements, and we provide a detailed protocol and GitHub repository at https://github.com/brachtlab/Comparative_Coverage_Analysis (last accessed May 12, 2021). Oxford University Press 2021-04-26 /pmc/articles/PMC8245190/ /pubmed/33905492 http://dx.doi.org/10.1093/gbe/evab088 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | within-Individual Genome Variation and Germline/Soma Distinction Asalone, Kathryn C Takkar, Ajuni K Saldanha, Colin J Bracht, John R A Transcriptomic Pipeline Adapted for Genomic Sequence Discovery of Germline-Restricted Sequence in Zebra Finch, Taeniopygia guttata |
title | A Transcriptomic Pipeline Adapted for Genomic Sequence Discovery of Germline-Restricted Sequence in Zebra Finch, Taeniopygia guttata |
title_full | A Transcriptomic Pipeline Adapted for Genomic Sequence Discovery of Germline-Restricted Sequence in Zebra Finch, Taeniopygia guttata |
title_fullStr | A Transcriptomic Pipeline Adapted for Genomic Sequence Discovery of Germline-Restricted Sequence in Zebra Finch, Taeniopygia guttata |
title_full_unstemmed | A Transcriptomic Pipeline Adapted for Genomic Sequence Discovery of Germline-Restricted Sequence in Zebra Finch, Taeniopygia guttata |
title_short | A Transcriptomic Pipeline Adapted for Genomic Sequence Discovery of Germline-Restricted Sequence in Zebra Finch, Taeniopygia guttata |
title_sort | transcriptomic pipeline adapted for genomic sequence discovery of germline-restricted sequence in zebra finch, taeniopygia guttata |
topic | within-Individual Genome Variation and Germline/Soma Distinction |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8245190/ https://www.ncbi.nlm.nih.gov/pubmed/33905492 http://dx.doi.org/10.1093/gbe/evab088 |
work_keys_str_mv | AT asalonekathrync atranscriptomicpipelineadaptedforgenomicsequencediscoveryofgermlinerestrictedsequenceinzebrafinchtaeniopygiaguttata AT takkarajunik atranscriptomicpipelineadaptedforgenomicsequencediscoveryofgermlinerestrictedsequenceinzebrafinchtaeniopygiaguttata AT saldanhacolinj atranscriptomicpipelineadaptedforgenomicsequencediscoveryofgermlinerestrictedsequenceinzebrafinchtaeniopygiaguttata AT brachtjohnr atranscriptomicpipelineadaptedforgenomicsequencediscoveryofgermlinerestrictedsequenceinzebrafinchtaeniopygiaguttata AT asalonekathrync transcriptomicpipelineadaptedforgenomicsequencediscoveryofgermlinerestrictedsequenceinzebrafinchtaeniopygiaguttata AT takkarajunik transcriptomicpipelineadaptedforgenomicsequencediscoveryofgermlinerestrictedsequenceinzebrafinchtaeniopygiaguttata AT saldanhacolinj transcriptomicpipelineadaptedforgenomicsequencediscoveryofgermlinerestrictedsequenceinzebrafinchtaeniopygiaguttata AT brachtjohnr transcriptomicpipelineadaptedforgenomicsequencediscoveryofgermlinerestrictedsequenceinzebrafinchtaeniopygiaguttata |