Cargando…

Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection

Parametric methods for identifying laterally transferred genes exploit the directional mutational biases unique to each genome. Yet the development of new, more robust methods—as well as the evaluation and proper implementation of existing methods—relies on an arbitrary assessment of performance usi...

Descripción completa

Detalles Bibliográficos
Autores principales: Azad, Rajeev K, Lawrence, Jeffrey G
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1282332/
https://www.ncbi.nlm.nih.gov/pubmed/16292353
http://dx.doi.org/10.1371/journal.pcbi.0010056
_version_ 1782126133281554432
author Azad, Rajeev K
Lawrence, Jeffrey G
author_facet Azad, Rajeev K
Lawrence, Jeffrey G
author_sort Azad, Rajeev K
collection PubMed
description Parametric methods for identifying laterally transferred genes exploit the directional mutational biases unique to each genome. Yet the development of new, more robust methods—as well as the evaluation and proper implementation of existing methods—relies on an arbitrary assessment of performance using real genomes, where the evolutionary histories of genes are not known. We have used the framework of a generalized hidden Markov model to create artificial genomes modeled after genuine genomes. To model a genome, “core” genes—those displaying patterns of mutational biases shared among large numbers of genes—are identified by a novel gene clustering approach based on the Akaike information criterion. Gene models derived from multiple “core” gene clusters are used to generate an artificial genome that models the properties of a genuine genome. Chimeric artificial genomes—representing those having experienced lateral gene transfer—were created by combining genes from multiple artificial genomes, and the performance of the parametric methods for identifying “atypical” genes was assessed directly. We found that a hidden Markov model that included multiple gene models, each trained on sets of genes representing the range of genotypic variability within a genome, could produce artificial genomes that mimicked the properties of genuine genomes. Moreover, different methods for detecting foreign genes performed differently—i.e., they had different sets of strengths and weaknesses—when identifying atypical genes within chimeric artificial genomes.
format Text
id pubmed-1282332
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-12823322005-12-01 Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection Azad, Rajeev K Lawrence, Jeffrey G PLoS Comput Biol Research Article Parametric methods for identifying laterally transferred genes exploit the directional mutational biases unique to each genome. Yet the development of new, more robust methods—as well as the evaluation and proper implementation of existing methods—relies on an arbitrary assessment of performance using real genomes, where the evolutionary histories of genes are not known. We have used the framework of a generalized hidden Markov model to create artificial genomes modeled after genuine genomes. To model a genome, “core” genes—those displaying patterns of mutational biases shared among large numbers of genes—are identified by a novel gene clustering approach based on the Akaike information criterion. Gene models derived from multiple “core” gene clusters are used to generate an artificial genome that models the properties of a genuine genome. Chimeric artificial genomes—representing those having experienced lateral gene transfer—were created by combining genes from multiple artificial genomes, and the performance of the parametric methods for identifying “atypical” genes was assessed directly. We found that a hidden Markov model that included multiple gene models, each trained on sets of genes representing the range of genotypic variability within a genome, could produce artificial genomes that mimicked the properties of genuine genomes. Moreover, different methods for detecting foreign genes performed differently—i.e., they had different sets of strengths and weaknesses—when identifying atypical genes within chimeric artificial genomes. Public Library of Science 2005-11 2005-11-11 /pmc/articles/PMC1282332/ /pubmed/16292353 http://dx.doi.org/10.1371/journal.pcbi.0010056 Text en Copyright: © 2005 Azad and Lawrence. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Azad, Rajeev K
Lawrence, Jeffrey G
Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection
title Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection
title_full Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection
title_fullStr Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection
title_full_unstemmed Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection
title_short Use of Artificial Genomes in Assessing Methods for Atypical Gene Detection
title_sort use of artificial genomes in assessing methods for atypical gene detection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1282332/
https://www.ncbi.nlm.nih.gov/pubmed/16292353
http://dx.doi.org/10.1371/journal.pcbi.0010056
work_keys_str_mv AT azadrajeevk useofartificialgenomesinassessingmethodsforatypicalgenedetection
AT lawrencejeffreyg useofartificialgenomesinassessingmethodsforatypicalgenedetection