Cargando…

Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler

BACKGROUND: The rate of emergence of human pathogens is steadily increasing; most of these novel agents originate in wildlife. Bats, remarkably, are the natural reservoirs of many of the most pathogenic viruses in humans. There are two bat genome projects currently underway, a circumstance that prom...

Descripción completa

Detalles Bibliográficos
Autores principales: Kepler, Thomas B, Sample, Christopher, Hudak, Kathryn, Roach, Jeffrey, Haines, Albert, Walsh, Allyson, Ramsburg, Elizabeth A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091641/
https://www.ncbi.nlm.nih.gov/pubmed/20663124
http://dx.doi.org/10.1186/1471-2164-11-444
_version_ 1782203293328474112
author Kepler, Thomas B
Sample, Christopher
Hudak, Kathryn
Roach, Jeffrey
Haines, Albert
Walsh, Allyson
Ramsburg, Elizabeth A
author_facet Kepler, Thomas B
Sample, Christopher
Hudak, Kathryn
Roach, Jeffrey
Haines, Albert
Walsh, Allyson
Ramsburg, Elizabeth A
author_sort Kepler, Thomas B
collection PubMed
description BACKGROUND: The rate of emergence of human pathogens is steadily increasing; most of these novel agents originate in wildlife. Bats, remarkably, are the natural reservoirs of many of the most pathogenic viruses in humans. There are two bat genome projects currently underway, a circumstance that promises to speed the discovery host factors important in the coevolution of bats with their viruses. These genomes, however, are not yet assembled and one of them will provide only low coverage, making the inference of most genes of immunological interest error-prone. Many more wildlife genome projects are underway and intend to provide only shallow coverage. RESULTS: We have developed a statistical method for the assembly of gene families from partial genomes. The method takes full advantage of the quality scores generated by base-calling software, incorporating them into a complete probabilistic error model, to overcome the limitation inherent in the inference of gene family members from partial sequence information. We validated the method by inferring the human IFNA genes from the genome trace archives, and used it to infer 61 type-I interferon genes, and single type-II interferon genes in the bats Pteropus vampyrus and Myotis lucifugus. We confirmed our inferences by direct cloning and sequencing of IFNA, IFNB, IFND, and IFNK in P. vampyrus, and by demonstrating transcription of some of the inferred genes by known interferon-inducing stimuli. CONCLUSION: The statistical trace assembler described here provides a reliable method for extracting information from the many available and forthcoming partial or shallow genome sequencing projects, thereby facilitating the study of a wider variety of organisms with ecological and biomedical significance to humans than would otherwise be possible.
format Text
id pubmed-3091641
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30916412011-05-11 Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler Kepler, Thomas B Sample, Christopher Hudak, Kathryn Roach, Jeffrey Haines, Albert Walsh, Allyson Ramsburg, Elizabeth A BMC Genomics Research Article BACKGROUND: The rate of emergence of human pathogens is steadily increasing; most of these novel agents originate in wildlife. Bats, remarkably, are the natural reservoirs of many of the most pathogenic viruses in humans. There are two bat genome projects currently underway, a circumstance that promises to speed the discovery host factors important in the coevolution of bats with their viruses. These genomes, however, are not yet assembled and one of them will provide only low coverage, making the inference of most genes of immunological interest error-prone. Many more wildlife genome projects are underway and intend to provide only shallow coverage. RESULTS: We have developed a statistical method for the assembly of gene families from partial genomes. The method takes full advantage of the quality scores generated by base-calling software, incorporating them into a complete probabilistic error model, to overcome the limitation inherent in the inference of gene family members from partial sequence information. We validated the method by inferring the human IFNA genes from the genome trace archives, and used it to infer 61 type-I interferon genes, and single type-II interferon genes in the bats Pteropus vampyrus and Myotis lucifugus. We confirmed our inferences by direct cloning and sequencing of IFNA, IFNB, IFND, and IFNK in P. vampyrus, and by demonstrating transcription of some of the inferred genes by known interferon-inducing stimuli. CONCLUSION: The statistical trace assembler described here provides a reliable method for extracting information from the many available and forthcoming partial or shallow genome sequencing projects, thereby facilitating the study of a wider variety of organisms with ecological and biomedical significance to humans than would otherwise be possible. BioMed Central 2010-07-21 /pmc/articles/PMC3091641/ /pubmed/20663124 http://dx.doi.org/10.1186/1471-2164-11-444 Text en Copyright ©2010 Kepler et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kepler, Thomas B
Sample, Christopher
Hudak, Kathryn
Roach, Jeffrey
Haines, Albert
Walsh, Allyson
Ramsburg, Elizabeth A
Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler
title Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler
title_full Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler
title_fullStr Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler
title_full_unstemmed Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler
title_short Chiropteran types I and II interferon genes inferred from genome sequencing traces by a statistical gene-family assembler
title_sort chiropteran types i and ii interferon genes inferred from genome sequencing traces by a statistical gene-family assembler
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091641/
https://www.ncbi.nlm.nih.gov/pubmed/20663124
http://dx.doi.org/10.1186/1471-2164-11-444
work_keys_str_mv AT keplerthomasb chiropterantypesiandiiinterferongenesinferredfromgenomesequencingtracesbyastatisticalgenefamilyassembler
AT samplechristopher chiropterantypesiandiiinterferongenesinferredfromgenomesequencingtracesbyastatisticalgenefamilyassembler
AT hudakkathryn chiropterantypesiandiiinterferongenesinferredfromgenomesequencingtracesbyastatisticalgenefamilyassembler
AT roachjeffrey chiropterantypesiandiiinterferongenesinferredfromgenomesequencingtracesbyastatisticalgenefamilyassembler
AT hainesalbert chiropterantypesiandiiinterferongenesinferredfromgenomesequencingtracesbyastatisticalgenefamilyassembler
AT walshallyson chiropterantypesiandiiinterferongenesinferredfromgenomesequencingtracesbyastatisticalgenefamilyassembler
AT ramsburgelizabetha chiropterantypesiandiiinterferongenesinferredfromgenomesequencingtracesbyastatisticalgenefamilyassembler