Cargando…
Homology-Independent Metrics for Comparative Genomics
A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4446528/ https://www.ncbi.nlm.nih.gov/pubmed/26029354 http://dx.doi.org/10.1016/j.csbj.2015.04.005 |
_version_ | 1782373445135237120 |
---|---|
author | Coutinho, Tarcisio José Domingos Franco, Glória Regina Lobo, Francisco Pereira |
author_facet | Coutinho, Tarcisio José Domingos Franco, Glória Regina Lobo, Francisco Pereira |
author_sort | Coutinho, Tarcisio José Domingos |
collection | PubMed |
description | A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance, comparative genomic procedures that rely on homology inference are obviously not applicable if no homologous regions are detectable. This fact excludes a considerable portion of “genomic dark matter” with no significant similarity — and, consequently, no inferred homology to any other known sequence — from several downstream comparative genomic methods. In this review we compile several sequence metrics that do not rely on homology inference and can be used to compare nucleotide sequences and extract biologically meaningful information from them. These metrics comprise several compositional parameters calculated from sequence data alone, such as GC content, dinucleotide odds ratio, and several codon bias metrics. They also share other interesting properties, such as pervasiveness (patterns persist on smaller scales) and phylogenetic signal. We also cite examples where these homology-independent metrics have been successfully applied to support several bioinformatics challenges, such as taxonomic classification of biological sequences without homology inference. They where also used to detect higher-order patterns of interactions in biological systems, ranging from detecting coevolutionary trends between the genomes of viruses and their hosts to characterization of gene pools of entire microbial communities. We argue that, if correctly understood and applied, homology-independent metrics can add important layers of biological information in comparative genomic studies without prior homology inference. |
format | Online Article Text |
id | pubmed-4446528 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-44465282015-05-29 Homology-Independent Metrics for Comparative Genomics Coutinho, Tarcisio José Domingos Franco, Glória Regina Lobo, Francisco Pereira Comput Struct Biotechnol J Mini Review A mainstream procedure to analyze the wealth of genomic data available nowadays is the detection of homologous regions shared across genomes, followed by the extraction of biological information from the patterns of conservation and variation observed in such regions. Although of pivotal importance, comparative genomic procedures that rely on homology inference are obviously not applicable if no homologous regions are detectable. This fact excludes a considerable portion of “genomic dark matter” with no significant similarity — and, consequently, no inferred homology to any other known sequence — from several downstream comparative genomic methods. In this review we compile several sequence metrics that do not rely on homology inference and can be used to compare nucleotide sequences and extract biologically meaningful information from them. These metrics comprise several compositional parameters calculated from sequence data alone, such as GC content, dinucleotide odds ratio, and several codon bias metrics. They also share other interesting properties, such as pervasiveness (patterns persist on smaller scales) and phylogenetic signal. We also cite examples where these homology-independent metrics have been successfully applied to support several bioinformatics challenges, such as taxonomic classification of biological sequences without homology inference. They where also used to detect higher-order patterns of interactions in biological systems, ranging from detecting coevolutionary trends between the genomes of viruses and their hosts to characterization of gene pools of entire microbial communities. We argue that, if correctly understood and applied, homology-independent metrics can add important layers of biological information in comparative genomic studies without prior homology inference. Research Network of Computational and Structural Biotechnology 2015-05-04 /pmc/articles/PMC4446528/ /pubmed/26029354 http://dx.doi.org/10.1016/j.csbj.2015.04.005 Text en © 2015 Coutinho et al. Published by Elsevier B.V. on behalf of the Research Network of Computational and Structural Biotechnology. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Mini Review Coutinho, Tarcisio José Domingos Franco, Glória Regina Lobo, Francisco Pereira Homology-Independent Metrics for Comparative Genomics |
title | Homology-Independent Metrics for Comparative Genomics |
title_full | Homology-Independent Metrics for Comparative Genomics |
title_fullStr | Homology-Independent Metrics for Comparative Genomics |
title_full_unstemmed | Homology-Independent Metrics for Comparative Genomics |
title_short | Homology-Independent Metrics for Comparative Genomics |
title_sort | homology-independent metrics for comparative genomics |
topic | Mini Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4446528/ https://www.ncbi.nlm.nih.gov/pubmed/26029354 http://dx.doi.org/10.1016/j.csbj.2015.04.005 |
work_keys_str_mv | AT coutinhotarcisiojosedomingos homologyindependentmetricsforcomparativegenomics AT francogloriaregina homologyindependentmetricsforcomparativegenomics AT lobofranciscopereira homologyindependentmetricsforcomparativegenomics |