Cargando…

Gen(Htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing

BACKGROUND: Microevolution is the study of short-term changes of alleles within a population and their effects on the phenotype of organisms. The result of the below-species-level evolution is heterogeneity, where populations consist of subpopulations with a large number of structural variations. He...

Descripción completa

Detalles Bibliográficos
Autor principal: Yu, GongXin
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2967562/
https://www.ncbi.nlm.nih.gov/pubmed/20939910
http://dx.doi.org/10.1186/1471-2105-11-508
_version_ 1782189688223694848
author Yu, GongXin
author_facet Yu, GongXin
author_sort Yu, GongXin
collection PubMed
description BACKGROUND: Microevolution is the study of short-term changes of alleles within a population and their effects on the phenotype of organisms. The result of the below-species-level evolution is heterogeneity, where populations consist of subpopulations with a large number of structural variations. Heterogeneity analysis is thus essential to our understanding of how selective and neutral forces shape bacterial populations over a short period of time. The Solexa Genome Analyzer, a next-generation sequencing platform, allows millions of short sequencing reads to be obtained with great accuracy, allowing for the ability to study the dynamics of the bacterial population at the whole genome level. The tool referred to as Gen(Htr )was developed for genome-wide heterogeneity analysis. RESULTS: For particular bacterial strains, Gen(Htr )relies on a set of Solexa short reads on given bacteria pathogens and their isogenic reference genome to identify heterogeneity sites, the chromosomal positions with multiple variants of genes in the bacterial population, and variations that occur in large gene families. Gen(Htr )accomplishes this by building and comparatively analyzing genome-wide heterogeneity genotypes for both the newly sequenced genomes (using massive short-read sequencing) and their isogenic reference (using simulated data). As proof of the concept, this approach was applied to SRX007711, the Solexa sequencing data for a newly sequenced Staphylococcus aureus subsp. USA300 cell line, and demonstrated that it could predict such multiple variants. They include multiple variants of genes critical in pathogenesis, e.g. genes encoding a LysR family transcriptional regulator, 23 S ribosomal RNA, and DNA mismatch repair protein MutS. The heterogeneity results in non-synonymous and nonsense mutations, leading to truncated proteins for both LysR and MutS. CONCLUSION: Gen(Htr )was developed for genome-wide heterogeneity analysis. Although it is much more time-consuming when compared to Maq, a popular tool for SNP analysis, Gen(Htr )is able to predict potential multiple variants that pre-exist in the bacterial population as well as SNPs that occur in the highly duplicated gene families. It is expected that, with the proper experimental design, this analysis can improve our understanding of the molecular mechanism underlying the dynamics and the evolution of drug-resistant bacterial pathogens.
format Text
id pubmed-2967562
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29675622010-11-02 Gen(Htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing Yu, GongXin BMC Bioinformatics Methodology Article BACKGROUND: Microevolution is the study of short-term changes of alleles within a population and their effects on the phenotype of organisms. The result of the below-species-level evolution is heterogeneity, where populations consist of subpopulations with a large number of structural variations. Heterogeneity analysis is thus essential to our understanding of how selective and neutral forces shape bacterial populations over a short period of time. The Solexa Genome Analyzer, a next-generation sequencing platform, allows millions of short sequencing reads to be obtained with great accuracy, allowing for the ability to study the dynamics of the bacterial population at the whole genome level. The tool referred to as Gen(Htr )was developed for genome-wide heterogeneity analysis. RESULTS: For particular bacterial strains, Gen(Htr )relies on a set of Solexa short reads on given bacteria pathogens and their isogenic reference genome to identify heterogeneity sites, the chromosomal positions with multiple variants of genes in the bacterial population, and variations that occur in large gene families. Gen(Htr )accomplishes this by building and comparatively analyzing genome-wide heterogeneity genotypes for both the newly sequenced genomes (using massive short-read sequencing) and their isogenic reference (using simulated data). As proof of the concept, this approach was applied to SRX007711, the Solexa sequencing data for a newly sequenced Staphylococcus aureus subsp. USA300 cell line, and demonstrated that it could predict such multiple variants. They include multiple variants of genes critical in pathogenesis, e.g. genes encoding a LysR family transcriptional regulator, 23 S ribosomal RNA, and DNA mismatch repair protein MutS. The heterogeneity results in non-synonymous and nonsense mutations, leading to truncated proteins for both LysR and MutS. CONCLUSION: Gen(Htr )was developed for genome-wide heterogeneity analysis. Although it is much more time-consuming when compared to Maq, a popular tool for SNP analysis, Gen(Htr )is able to predict potential multiple variants that pre-exist in the bacterial population as well as SNPs that occur in the highly duplicated gene families. It is expected that, with the proper experimental design, this analysis can improve our understanding of the molecular mechanism underlying the dynamics and the evolution of drug-resistant bacterial pathogens. BioMed Central 2010-10-12 /pmc/articles/PMC2967562/ /pubmed/20939910 http://dx.doi.org/10.1186/1471-2105-11-508 Text en Copyright ©2010 Yu; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Yu, GongXin
Gen(Htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing
title Gen(Htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing
title_full Gen(Htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing
title_fullStr Gen(Htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing
title_full_unstemmed Gen(Htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing
title_short Gen(Htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing
title_sort gen(htr): a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2967562/
https://www.ncbi.nlm.nih.gov/pubmed/20939910
http://dx.doi.org/10.1186/1471-2105-11-508
work_keys_str_mv AT yugongxin genhtratoolforcomparativeassessmentofgeneticheterogeneityinmicrobialgenomesgeneratedbymassiveshortreadsequencing