Cargando…
Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants
BACKGROUND: Chaining is a major problem in constructing gene families. RESULTS: We define a new kind of cluster on graphs with strong and weak edges: soft cliques with backbones (SCWiB). This differs from other definitions in how it controls the "chaining effect", by ensuring clusters sati...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240082/ https://www.ncbi.nlm.nih.gov/pubmed/25572777 http://dx.doi.org/10.1186/1471-2164-15-S6-S8 |
_version_ | 1782345683083198464 |
---|---|
author | Zheng, Chunfang Kononenko, Alexey Leebens-Mack, Jim Lyons, Eric Sankoff, David |
author_facet | Zheng, Chunfang Kononenko, Alexey Leebens-Mack, Jim Lyons, Eric Sankoff, David |
author_sort | Zheng, Chunfang |
collection | PubMed |
description | BACKGROUND: Chaining is a major problem in constructing gene families. RESULTS: We define a new kind of cluster on graphs with strong and weak edges: soft cliques with backbones (SCWiB). This differs from other definitions in how it controls the "chaining effect", by ensuring clusters satisfy a tolerant edge density criterion that takes into account cluster size. We implement algorithms for decomposing a graph of similarities into SCWiBs. We compare examples of output from SCWiB and the Markov Cluster Algorithm (MCL), and also compare some curated Arabidopsis thaliana gene families with the results of automatic clustering. We apply our method to 44 published angiosperm genomes with annotation, and discover that Amborella trichopoda is distinct from all the others in having substantially and systematically smaller proportions of moderate- and large-size gene families. CONCLUSIONS: We offer several possible evolutionary explanations for this result. |
format | Online Article Text |
id | pubmed-4240082 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42400822014-11-25 Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants Zheng, Chunfang Kononenko, Alexey Leebens-Mack, Jim Lyons, Eric Sankoff, David BMC Genomics Research BACKGROUND: Chaining is a major problem in constructing gene families. RESULTS: We define a new kind of cluster on graphs with strong and weak edges: soft cliques with backbones (SCWiB). This differs from other definitions in how it controls the "chaining effect", by ensuring clusters satisfy a tolerant edge density criterion that takes into account cluster size. We implement algorithms for decomposing a graph of similarities into SCWiBs. We compare examples of output from SCWiB and the Markov Cluster Algorithm (MCL), and also compare some curated Arabidopsis thaliana gene families with the results of automatic clustering. We apply our method to 44 published angiosperm genomes with annotation, and discover that Amborella trichopoda is distinct from all the others in having substantially and systematically smaller proportions of moderate- and large-size gene families. CONCLUSIONS: We offer several possible evolutionary explanations for this result. BioMed Central 2014-10-17 /pmc/articles/PMC4240082/ /pubmed/25572777 http://dx.doi.org/10.1186/1471-2164-15-S6-S8 Text en Copyright © 2014 Zheng et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zheng, Chunfang Kononenko, Alexey Leebens-Mack, Jim Lyons, Eric Sankoff, David Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants |
title | Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants |
title_full | Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants |
title_fullStr | Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants |
title_full_unstemmed | Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants |
title_short | Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants |
title_sort | gene families as soft cliques with backbones: amborella contrasted with other flowering plants |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240082/ https://www.ncbi.nlm.nih.gov/pubmed/25572777 http://dx.doi.org/10.1186/1471-2164-15-S6-S8 |
work_keys_str_mv | AT zhengchunfang genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants AT kononenkoalexey genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants AT leebensmackjim genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants AT lyonseric genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants AT sankoffdavid genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants |