Cargando…

Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants

BACKGROUND: Chaining is a major problem in constructing gene families. RESULTS: We define a new kind of cluster on graphs with strong and weak edges: soft cliques with backbones (SCWiB). This differs from other definitions in how it controls the "chaining effect", by ensuring clusters sati...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Chunfang, Kononenko, Alexey, Leebens-Mack, Jim, Lyons, Eric, Sankoff, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240082/
https://www.ncbi.nlm.nih.gov/pubmed/25572777
http://dx.doi.org/10.1186/1471-2164-15-S6-S8
_version_ 1782345683083198464
author Zheng, Chunfang
Kononenko, Alexey
Leebens-Mack, Jim
Lyons, Eric
Sankoff, David
author_facet Zheng, Chunfang
Kononenko, Alexey
Leebens-Mack, Jim
Lyons, Eric
Sankoff, David
author_sort Zheng, Chunfang
collection PubMed
description BACKGROUND: Chaining is a major problem in constructing gene families. RESULTS: We define a new kind of cluster on graphs with strong and weak edges: soft cliques with backbones (SCWiB). This differs from other definitions in how it controls the "chaining effect", by ensuring clusters satisfy a tolerant edge density criterion that takes into account cluster size. We implement algorithms for decomposing a graph of similarities into SCWiBs. We compare examples of output from SCWiB and the Markov Cluster Algorithm (MCL), and also compare some curated Arabidopsis thaliana gene families with the results of automatic clustering. We apply our method to 44 published angiosperm genomes with annotation, and discover that Amborella trichopoda is distinct from all the others in having substantially and systematically smaller proportions of moderate- and large-size gene families. CONCLUSIONS: We offer several possible evolutionary explanations for this result.
format Online
Article
Text
id pubmed-4240082
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42400822014-11-25 Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants Zheng, Chunfang Kononenko, Alexey Leebens-Mack, Jim Lyons, Eric Sankoff, David BMC Genomics Research BACKGROUND: Chaining is a major problem in constructing gene families. RESULTS: We define a new kind of cluster on graphs with strong and weak edges: soft cliques with backbones (SCWiB). This differs from other definitions in how it controls the "chaining effect", by ensuring clusters satisfy a tolerant edge density criterion that takes into account cluster size. We implement algorithms for decomposing a graph of similarities into SCWiBs. We compare examples of output from SCWiB and the Markov Cluster Algorithm (MCL), and also compare some curated Arabidopsis thaliana gene families with the results of automatic clustering. We apply our method to 44 published angiosperm genomes with annotation, and discover that Amborella trichopoda is distinct from all the others in having substantially and systematically smaller proportions of moderate- and large-size gene families. CONCLUSIONS: We offer several possible evolutionary explanations for this result. BioMed Central 2014-10-17 /pmc/articles/PMC4240082/ /pubmed/25572777 http://dx.doi.org/10.1186/1471-2164-15-S6-S8 Text en Copyright © 2014 Zheng et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zheng, Chunfang
Kononenko, Alexey
Leebens-Mack, Jim
Lyons, Eric
Sankoff, David
Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants
title Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants
title_full Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants
title_fullStr Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants
title_full_unstemmed Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants
title_short Gene families as soft cliques with backbones: Amborella contrasted with other flowering plants
title_sort gene families as soft cliques with backbones: amborella contrasted with other flowering plants
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240082/
https://www.ncbi.nlm.nih.gov/pubmed/25572777
http://dx.doi.org/10.1186/1471-2164-15-S6-S8
work_keys_str_mv AT zhengchunfang genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants
AT kononenkoalexey genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants
AT leebensmackjim genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants
AT lyonseric genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants
AT sankoffdavid genefamiliesassoftcliqueswithbackbonesamborellacontrastedwithotherfloweringplants