Cargando…
Algorithm for large-scale clustering across multiple genomes
Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Biomedical Informatics
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218420/ https://www.ncbi.nlm.nih.gov/pubmed/22125394 |
_version_ | 1782216695620829184 |
---|---|
author | Yi, Gangman Jung, Jaehee |
author_facet | Yi, Gangman Jung, Jaehee |
author_sort | Yi, Gangman |
collection | PubMed |
description | Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational methods that have been proposed defining gene clusters to identify gene families and operons. However, most of those algorithms are only available on a data set of small size. We developed an efficient gene clustering algorithm that can be applied on hundreds of genomes at the same time. This approach allows for large-scale study of evolutionary relationships of gene clusters and study of operon formation and destruction. An analysis of proposed algorithms shows that more biological insight can be obtained by analyzing gene clusters across hundreds of genomes, which can help us understand operon occurrences, gene orientations and gene rearrangements. |
format | Online Article Text |
id | pubmed-3218420 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Biomedical Informatics |
record_format | MEDLINE/PubMed |
spelling | pubmed-32184202011-11-28 Algorithm for large-scale clustering across multiple genomes Yi, Gangman Jung, Jaehee Bioinformation Hypothesis Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational methods that have been proposed defining gene clusters to identify gene families and operons. However, most of those algorithms are only available on a data set of small size. We developed an efficient gene clustering algorithm that can be applied on hundreds of genomes at the same time. This approach allows for large-scale study of evolutionary relationships of gene clusters and study of operon formation and destruction. An analysis of proposed algorithms shows that more biological insight can be obtained by analyzing gene clusters across hundreds of genomes, which can help us understand operon occurrences, gene orientations and gene rearrangements. Biomedical Informatics 2011-10-31 /pmc/articles/PMC3218420/ /pubmed/22125394 Text en © 2011 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited. |
spellingShingle | Hypothesis Yi, Gangman Jung, Jaehee Algorithm for large-scale clustering across multiple genomes |
title | Algorithm for large-scale clustering across multiple genomes |
title_full | Algorithm for large-scale clustering across multiple genomes |
title_fullStr | Algorithm for large-scale clustering across multiple genomes |
title_full_unstemmed | Algorithm for large-scale clustering across multiple genomes |
title_short | Algorithm for large-scale clustering across multiple genomes |
title_sort | algorithm for large-scale clustering across multiple genomes |
topic | Hypothesis |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218420/ https://www.ncbi.nlm.nih.gov/pubmed/22125394 |
work_keys_str_mv | AT yigangman algorithmforlargescaleclusteringacrossmultiplegenomes AT jungjaehee algorithmforlargescaleclusteringacrossmultiplegenomes |