Cargando…

Identification and characterisation of endogenous Avian Leukosis Virus subgroup E (ALVE) insertions in chicken whole genome sequencing data

BACKGROUND: Endogenous retroviruses (ERVs) are the remnants of retroviral infections which can elicit prolonged genomic and immunological stress on their host organism. In chickens, endogenous Avian Leukosis Virus subgroup E (ALVE) expression has been associated with reductions in muscle growth rate...

Descripción completa

Detalles Bibliográficos
Autores principales: Mason, Andrew S., Lund, Ashlee R., Hocking, Paul M., Fulton, Janet E., Burt, David W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7325683/
https://www.ncbi.nlm.nih.gov/pubmed/32617122
http://dx.doi.org/10.1186/s13100-020-00216-w
_version_ 1783552192564690944
author Mason, Andrew S.
Lund, Ashlee R.
Hocking, Paul M.
Fulton, Janet E.
Burt, David W.
author_facet Mason, Andrew S.
Lund, Ashlee R.
Hocking, Paul M.
Fulton, Janet E.
Burt, David W.
author_sort Mason, Andrew S.
collection PubMed
description BACKGROUND: Endogenous retroviruses (ERVs) are the remnants of retroviral infections which can elicit prolonged genomic and immunological stress on their host organism. In chickens, endogenous Avian Leukosis Virus subgroup E (ALVE) expression has been associated with reductions in muscle growth rate and egg production, as well as providing the potential for novel recombinant viruses. However, ALVEs can remain in commercial stock due to their incomplete identification and association with desirable traits, such as ALVE21 and slow feathering. The availability of whole genome sequencing (WGS) data facilitates high-throughput identification and characterisation of these retroviral remnants. RESULTS: We have developed obsERVer, a new bioinformatic ERV identification pipeline which can identify ALVEs in WGS data without further sequencing. With this pipeline, 20 ALVEs were identified across eight elite layer lines from Hy-Line International, including four novel integrations and characterisation of a fast feathered phenotypic revertant that still contained ALVE21. These bioinformatically detected sites were subsequently validated using new high-throughput KASP assays, which showed that obsERVer was highly precise and exhibited a 0% false discovery rate. A further fifty-seven diverse chicken WGS datasets were analysed for their ALVE content, identifying a total of 322 integration sites, over 80% of which were novel. Like exogenous ALV, ALVEs show site preference for proximity to protein-coding genes, but also exhibit signs of selection against deleterious integrations within genes. CONCLUSIONS: obsERVer is a highly precise and broadly applicable pipeline for identifying retroviral integrations in WGS data. ALVE identification in commercial layers has aided development of high-throughput diagnostic assays which will aid ALVE management, with the aim to eventually eradicate ALVEs from high performance lines. Analysis of non-commercial chicken datasets with obsERVer has revealed broad ALVE diversity and facilitates the study of the biological effects of these ERVs in wild and domesticated populations.
format Online
Article
Text
id pubmed-7325683
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73256832020-07-01 Identification and characterisation of endogenous Avian Leukosis Virus subgroup E (ALVE) insertions in chicken whole genome sequencing data Mason, Andrew S. Lund, Ashlee R. Hocking, Paul M. Fulton, Janet E. Burt, David W. Mob DNA Research BACKGROUND: Endogenous retroviruses (ERVs) are the remnants of retroviral infections which can elicit prolonged genomic and immunological stress on their host organism. In chickens, endogenous Avian Leukosis Virus subgroup E (ALVE) expression has been associated with reductions in muscle growth rate and egg production, as well as providing the potential for novel recombinant viruses. However, ALVEs can remain in commercial stock due to their incomplete identification and association with desirable traits, such as ALVE21 and slow feathering. The availability of whole genome sequencing (WGS) data facilitates high-throughput identification and characterisation of these retroviral remnants. RESULTS: We have developed obsERVer, a new bioinformatic ERV identification pipeline which can identify ALVEs in WGS data without further sequencing. With this pipeline, 20 ALVEs were identified across eight elite layer lines from Hy-Line International, including four novel integrations and characterisation of a fast feathered phenotypic revertant that still contained ALVE21. These bioinformatically detected sites were subsequently validated using new high-throughput KASP assays, which showed that obsERVer was highly precise and exhibited a 0% false discovery rate. A further fifty-seven diverse chicken WGS datasets were analysed for their ALVE content, identifying a total of 322 integration sites, over 80% of which were novel. Like exogenous ALV, ALVEs show site preference for proximity to protein-coding genes, but also exhibit signs of selection against deleterious integrations within genes. CONCLUSIONS: obsERVer is a highly precise and broadly applicable pipeline for identifying retroviral integrations in WGS data. ALVE identification in commercial layers has aided development of high-throughput diagnostic assays which will aid ALVE management, with the aim to eventually eradicate ALVEs from high performance lines. Analysis of non-commercial chicken datasets with obsERVer has revealed broad ALVE diversity and facilitates the study of the biological effects of these ERVs in wild and domesticated populations. BioMed Central 2020-06-30 /pmc/articles/PMC7325683/ /pubmed/32617122 http://dx.doi.org/10.1186/s13100-020-00216-w Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Mason, Andrew S.
Lund, Ashlee R.
Hocking, Paul M.
Fulton, Janet E.
Burt, David W.
Identification and characterisation of endogenous Avian Leukosis Virus subgroup E (ALVE) insertions in chicken whole genome sequencing data
title Identification and characterisation of endogenous Avian Leukosis Virus subgroup E (ALVE) insertions in chicken whole genome sequencing data
title_full Identification and characterisation of endogenous Avian Leukosis Virus subgroup E (ALVE) insertions in chicken whole genome sequencing data
title_fullStr Identification and characterisation of endogenous Avian Leukosis Virus subgroup E (ALVE) insertions in chicken whole genome sequencing data
title_full_unstemmed Identification and characterisation of endogenous Avian Leukosis Virus subgroup E (ALVE) insertions in chicken whole genome sequencing data
title_short Identification and characterisation of endogenous Avian Leukosis Virus subgroup E (ALVE) insertions in chicken whole genome sequencing data
title_sort identification and characterisation of endogenous avian leukosis virus subgroup e (alve) insertions in chicken whole genome sequencing data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7325683/
https://www.ncbi.nlm.nih.gov/pubmed/32617122
http://dx.doi.org/10.1186/s13100-020-00216-w
work_keys_str_mv AT masonandrews identificationandcharacterisationofendogenousavianleukosisvirussubgroupealveinsertionsinchickenwholegenomesequencingdata
AT lundashleer identificationandcharacterisationofendogenousavianleukosisvirussubgroupealveinsertionsinchickenwholegenomesequencingdata
AT hockingpaulm identificationandcharacterisationofendogenousavianleukosisvirussubgroupealveinsertionsinchickenwholegenomesequencingdata
AT fultonjanete identificationandcharacterisationofendogenousavianleukosisvirussubgroupealveinsertionsinchickenwholegenomesequencingdata
AT burtdavidw identificationandcharacterisationofendogenousavianleukosisvirussubgroupealveinsertionsinchickenwholegenomesequencingdata