Cargando…

Homology-Independent Metrics for Comparative Genomics

A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance,...

Descripción completa

Detalles Bibliográficos
Autores principales: Coutinho, Tarcisio José Domingos, Franco, Glória Regina, Lobo, Francisco Pereira
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4446528/
https://www.ncbi.nlm.nih.gov/pubmed/26029354
http://dx.doi.org/10.1016/j.csbj.2015.04.005
_version_ 1782373445135237120
author Coutinho, Tarcisio José Domingos
Franco, Glória Regina
Lobo, Francisco Pereira
author_facet Coutinho, Tarcisio José Domingos
Franco, Glória Regina
Lobo, Francisco Pereira
author_sort Coutinho, Tarcisio José Domingos
collection PubMed
description A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance, comparative genomic procedures that rely on homology inference are obviously not applicable if no homologous regions are detectable. This fact excludes a considerable portion of “genomic dark matter” with no significant similarity — and, consequently, no inferred homology to any other known sequence — from several downstream comparative genomic methods. In this review we compile several sequence metrics that do not rely on homology inference and can be used to compare nucleotide sequences and extract biologically meaningful information from them. These metrics comprise several compositional parameters calculated from sequence data alone, such as GC content, dinucleotide odds ratio, and several codon bias metrics. They also share other interesting properties, such as pervasiveness (patterns persist on smaller scales) and phylogenetic signal. We also cite examples where these homology-independent metrics have been successfully applied to support several bioinformatics challenges, such as taxonomic classification of biological sequences without homology inference. They where also used to detect higher-order patterns of interactions in biological systems, ranging from detecting coevolutionary trends between the genomes of viruses and their hosts to characterization of gene pools of entire microbial communities. We argue that, if correctly understood and applied, homology-independent metrics can add important layers of biological information in comparative genomic studies without prior homology inference.
format Online
Article
Text
id pubmed-4446528
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-44465282015-05-29 Homology-Independent Metrics for Comparative Genomics Coutinho, Tarcisio José Domingos Franco, Glória Regina Lobo, Francisco Pereira Comput Struct Biotechnol J Mini Review A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance, comparative genomic procedures that rely on homology inference are obviously not applicable if no homologous regions are detectable. This fact excludes a considerable portion of “genomic dark matter” with no significant similarity — and, consequently, no inferred homology to any other known sequence — from several downstream comparative genomic methods. In this review we compile several sequence metrics that do not rely on homology inference and can be used to compare nucleotide sequences and extract biologically meaningful information from them. These metrics comprise several compositional parameters calculated from sequence data alone, such as GC content, dinucleotide odds ratio, and several codon bias metrics. They also share other interesting properties, such as pervasiveness (patterns persist on smaller scales) and phylogenetic signal. We also cite examples where these homology-independent metrics have been successfully applied to support several bioinformatics challenges, such as taxonomic classification of biological sequences without homology inference. They where also used to detect higher-order patterns of interactions in biological systems, ranging from detecting coevolutionary trends between the genomes of viruses and their hosts to characterization of gene pools of entire microbial communities. We argue that, if correctly understood and applied, homology-independent metrics can add important layers of biological information in comparative genomic studies without prior homology inference. Research Network of Computational and Structural Biotechnology 2015-05-04 /pmc/articles/PMC4446528/ /pubmed/26029354 http://dx.doi.org/10.1016/j.csbj.2015.04.005 Text en © 2015 Coutinho et al. Published by Elsevier B.V. on behalf of the Research Network of Computational and Structural Biotechnology. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Mini Review
Coutinho, Tarcisio José Domingos
Franco, Glória Regina
Lobo, Francisco Pereira
Homology-Independent Metrics for Comparative Genomics
title Homology-Independent Metrics for Comparative Genomics
title_full Homology-Independent Metrics for Comparative Genomics
title_fullStr Homology-Independent Metrics for Comparative Genomics
title_full_unstemmed Homology-Independent Metrics for Comparative Genomics
title_short Homology-Independent Metrics for Comparative Genomics
title_sort homology-independent metrics for comparative genomics
topic Mini Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4446528/
https://www.ncbi.nlm.nih.gov/pubmed/26029354
http://dx.doi.org/10.1016/j.csbj.2015.04.005
work_keys_str_mv AT coutinhotarcisiojosedomingos homologyindependentmetricsforcomparativegenomics
AT francogloriaregina homologyindependentmetricsforcomparativegenomics
AT lobofranciscopereira homologyindependentmetricsforcomparativegenomics