Cargando…
GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data
BACKGROUND: B cell affinity maturation enables B cells to generate high-affinity antibodies. This process involves somatic hypermutation of B cell immunoglobulin receptor (BCR) genes and selection by their ability to bind antigens. Lineage trees are used to describe this microevolution of B cell imm...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7488003/ https://www.ncbi.nlm.nih.gov/pubmed/32900378 http://dx.doi.org/10.1186/s12864-020-06936-w |
_version_ | 1783581604342398976 |
---|---|
author | Yang, Xingyu Tipton, Christopher M. Woodruff, Matthew C. Zhou, Enlu Lee, F. Eun-Hyung Sanz, Inãki Qiu, Peng |
author_facet | Yang, Xingyu Tipton, Christopher M. Woodruff, Matthew C. Zhou, Enlu Lee, F. Eun-Hyung Sanz, Inãki Qiu, Peng |
author_sort | Yang, Xingyu |
collection | PubMed |
description | BACKGROUND: B cell affinity maturation enables B cells to generate high-affinity antibodies. This process involves somatic hypermutation of B cell immunoglobulin receptor (BCR) genes and selection by their ability to bind antigens. Lineage trees are used to describe this microevolution of B cell immunoglobulin genes. In a lineage tree, each node is one BCR sequence that mutated from the germinal center and each directed edge represents a single base mutation, insertion or deletion. In BCR sequencing data, the observed data only contains a subset of BCR sequences in this microevolution process. Therefore, reconstructing the lineage tree from experimental data requires algorithms to build the tree based on partially observed tree nodes. RESULTS: We developed a new algorithm named Grow Lineages along Minimum Spanning Tree (GLaMST), which efficiently reconstruct the lineage tree given observed BCR sequences that correspond to a subset of the tree nodes. Through comparison using simulated and real data, GLaMST outperforms existing algorithms in simulations with high rates of mutation, insertion and deletion, and generates lineage trees with smaller size and closer to ground truth according to tree features that highly correlated with selection pressure. CONCLUSIONS: GLaMST outperforms state-of-art in reconstruction of the BCR lineage tree in both efficiency and accuracy. Integrating it into existing BCR sequencing analysis frameworks can significant improve lineage tree reconstruction aspect of the analysis. |
format | Online Article Text |
id | pubmed-7488003 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-74880032020-09-16 GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data Yang, Xingyu Tipton, Christopher M. Woodruff, Matthew C. Zhou, Enlu Lee, F. Eun-Hyung Sanz, Inãki Qiu, Peng BMC Genomics Research BACKGROUND: B cell affinity maturation enables B cells to generate high-affinity antibodies. This process involves somatic hypermutation of B cell immunoglobulin receptor (BCR) genes and selection by their ability to bind antigens. Lineage trees are used to describe this microevolution of B cell immunoglobulin genes. In a lineage tree, each node is one BCR sequence that mutated from the germinal center and each directed edge represents a single base mutation, insertion or deletion. In BCR sequencing data, the observed data only contains a subset of BCR sequences in this microevolution process. Therefore, reconstructing the lineage tree from experimental data requires algorithms to build the tree based on partially observed tree nodes. RESULTS: We developed a new algorithm named Grow Lineages along Minimum Spanning Tree (GLaMST), which efficiently reconstruct the lineage tree given observed BCR sequences that correspond to a subset of the tree nodes. Through comparison using simulated and real data, GLaMST outperforms existing algorithms in simulations with high rates of mutation, insertion and deletion, and generates lineage trees with smaller size and closer to ground truth according to tree features that highly correlated with selection pressure. CONCLUSIONS: GLaMST outperforms state-of-art in reconstruction of the BCR lineage tree in both efficiency and accuracy. Integrating it into existing BCR sequencing analysis frameworks can significant improve lineage tree reconstruction aspect of the analysis. BioMed Central 2020-09-09 /pmc/articles/PMC7488003/ /pubmed/32900378 http://dx.doi.org/10.1186/s12864-020-06936-w Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Yang, Xingyu Tipton, Christopher M. Woodruff, Matthew C. Zhou, Enlu Lee, F. Eun-Hyung Sanz, Inãki Qiu, Peng GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data |
title | GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data |
title_full | GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data |
title_fullStr | GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data |
title_full_unstemmed | GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data |
title_short | GLaMST: grow lineages along minimum spanning tree for b cell receptor sequencing data |
title_sort | glamst: grow lineages along minimum spanning tree for b cell receptor sequencing data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7488003/ https://www.ncbi.nlm.nih.gov/pubmed/32900378 http://dx.doi.org/10.1186/s12864-020-06936-w |
work_keys_str_mv | AT yangxingyu glamstgrowlineagesalongminimumspanningtreeforbcellreceptorsequencingdata AT tiptonchristopherm glamstgrowlineagesalongminimumspanningtreeforbcellreceptorsequencingdata AT woodruffmatthewc glamstgrowlineagesalongminimumspanningtreeforbcellreceptorsequencingdata AT zhouenlu glamstgrowlineagesalongminimumspanningtreeforbcellreceptorsequencingdata AT leefeunhyung glamstgrowlineagesalongminimumspanningtreeforbcellreceptorsequencingdata AT sanzinaki glamstgrowlineagesalongminimumspanningtreeforbcellreceptorsequencingdata AT qiupeng glamstgrowlineagesalongminimumspanningtreeforbcellreceptorsequencingdata |