Cargando…

Algorithm for large-scale clustering across multiple genomes

Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational...

Descripción completa

Detalles Bibliográficos
Autores principales: Yi, Gangman, Jung, Jaehee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218420/
https://www.ncbi.nlm.nih.gov/pubmed/22125394
_version_ 1782216695620829184
author Yi, Gangman
Jung, Jaehee
author_facet Yi, Gangman
Jung, Jaehee
author_sort Yi, Gangman
collection PubMed
description Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational methods that have been proposed defining gene clusters to identify gene families and operons. However, most of those algorithms are only available on a data set of small size. We developed an efficient gene clustering algorithm that can be applied on hundreds of genomes at the same time. This approach allows for large-scale study of evolutionary relationships of gene clusters and study of operon formation and destruction. An analysis of proposed algorithms shows that more biological insight can be obtained by analyzing gene clusters across hundreds of genomes, which can help us understand operon occurrences, gene orientations and gene rearrangements.
format Online
Article
Text
id pubmed-3218420
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-32184202011-11-28 Algorithm for large-scale clustering across multiple genomes Yi, Gangman Jung, Jaehee Bioinformation Hypothesis Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational methods that have been proposed defining gene clusters to identify gene families and operons. However, most of those algorithms are only available on a data set of small size. We developed an efficient gene clustering algorithm that can be applied on hundreds of genomes at the same time. This approach allows for large-scale study of evolutionary relationships of gene clusters and study of operon formation and destruction. An analysis of proposed algorithms shows that more biological insight can be obtained by analyzing gene clusters across hundreds of genomes, which can help us understand operon occurrences, gene orientations and gene rearrangements. Biomedical Informatics 2011-10-31 /pmc/articles/PMC3218420/ /pubmed/22125394 Text en © 2011 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Hypothesis
Yi, Gangman
Jung, Jaehee
Algorithm for large-scale clustering across multiple genomes
title Algorithm for large-scale clustering across multiple genomes
title_full Algorithm for large-scale clustering across multiple genomes
title_fullStr Algorithm for large-scale clustering across multiple genomes
title_full_unstemmed Algorithm for large-scale clustering across multiple genomes
title_short Algorithm for large-scale clustering across multiple genomes
title_sort algorithm for large-scale clustering across multiple genomes
topic Hypothesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218420/
https://www.ncbi.nlm.nih.gov/pubmed/22125394
work_keys_str_mv AT yigangman algorithmforlargescaleclusteringacrossmultiplegenomes
AT jungjaehee algorithmforlargescaleclusteringacrossmultiplegenomes