Cargando…

Algorithm for large-scale clustering across multiple genomes

Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yi, Gangman, Jung, Jaehee
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Biomedical Informatics 2011
Materias:	Hypothesis
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218420/ https://www.ncbi.nlm.nih.gov/pubmed/22125394

_version_	1782216695620829184
author	Yi, Gangman Jung, Jaehee
author_facet	Yi, Gangman Jung, Jaehee
author_sort	Yi, Gangman
collection	PubMed
description	Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational methods that have been proposed defining gene clusters to identify gene families and operons. However, most of those algorithms are only available on a data set of small size. We developed an efficient gene clustering algorithm that can be applied on hundreds of genomes at the same time. This approach allows for large-scale study of evolutionary relationships of gene clusters and study of operon formation and destruction. An analysis of proposed algorithms shows that more biological insight can be obtained by analyzing gene clusters across hundreds of genomes, which can help us understand operon occurrences, gene orientations and gene rearrangements.
format	Online Article Text
id	pubmed-3218420
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	Biomedical Informatics
record_format	MEDLINE/PubMed
spelling	pubmed-32184202011-11-28 Algorithm for large-scale clustering across multiple genomes Yi, Gangman Jung, Jaehee Bioinformation Hypothesis Identifying genomic regions that descended from a common ancestor helps us study the gene function and genome evolution. In distantly related genomes, clusters of homologous gene pairs are evidently used in function prediction, operon detection, etc. Currently, there are many kinds of computational methods that have been proposed defining gene clusters to identify gene families and operons. However, most of those algorithms are only available on a data set of small size. We developed an efficient gene clustering algorithm that can be applied on hundreds of genomes at the same time. This approach allows for large-scale study of evolutionary relationships of gene clusters and study of operon formation and destruction. An analysis of proposed algorithms shows that more biological insight can be obtained by analyzing gene clusters across hundreds of genomes, which can help us understand operon occurrences, gene orientations and gene rearrangements. Biomedical Informatics 2011-10-31 /pmc/articles/PMC3218420/ /pubmed/22125394 Text en © 2011 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle	Hypothesis Yi, Gangman Jung, Jaehee Algorithm for large-scale clustering across multiple genomes
title	Algorithm for large-scale clustering across multiple genomes
title_full	Algorithm for large-scale clustering across multiple genomes
title_fullStr	Algorithm for large-scale clustering across multiple genomes
title_full_unstemmed	Algorithm for large-scale clustering across multiple genomes
title_short	Algorithm for large-scale clustering across multiple genomes
title_sort	algorithm for large-scale clustering across multiple genomes
topic	Hypothesis
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218420/ https://www.ncbi.nlm.nih.gov/pubmed/22125394
work_keys_str_mv	AT yigangman algorithmforlargescaleclusteringacrossmultiplegenomes AT jungjaehee algorithmforlargescaleclusteringacrossmultiplegenomes

Algorithm for large-scale clustering across multiple genomes

Ejemplares similares