Cargando…
Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle
BACKGROUND: There are 346 serologically defined red blood cell (RBC) antigens and 33 serologically defined platelet (PLT) antigens, most of which have known genetic changes in 45 RBC or six PLT genes that correlate with antigen expression. Polymorphic sites associated with antigen expression in the...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5019240/ https://www.ncbi.nlm.nih.gov/pubmed/26634332 http://dx.doi.org/10.1111/trf.13416 |
_version_ | 1782453021307830272 |
---|---|
author | Lane, William J. Westhoff, Connie M. Uy, Jon Michael Aguad, Maria Smeland‐Wagman, Robin Kaufman, Richard M. Rehm, Heidi L. Green, Robert C. Silberstein, Leslie E. |
author_facet | Lane, William J. Westhoff, Connie M. Uy, Jon Michael Aguad, Maria Smeland‐Wagman, Robin Kaufman, Richard M. Rehm, Heidi L. Green, Robert C. Silberstein, Leslie E. |
author_sort | Lane, William J. |
collection | PubMed |
description | BACKGROUND: There are 346 serologically defined red blood cell (RBC) antigens and 33 serologically defined platelet (PLT) antigens, most of which have known genetic changes in 45 RBC or six PLT genes that correlate with antigen expression. Polymorphic sites associated with antigen expression in the primary literature and reference databases are annotated according to nucleotide positions in cDNA. This makes antigen prediction from next‐generation sequencing data challenging, since it uses genomic coordinates. STUDY DESIGN AND METHODS: The conventional cDNA reference sequences for all known RBC and PLT genes that correlate with antigen expression were aligned to the human reference genome. The alignments allowed conversion of conventional cDNA nucleotide positions to the corresponding genomic coordinates. RBC and PLT antigen prediction was then performed using the human reference genome and whole genome sequencing (WGS) data with serologic confirmation. RESULTS: Some major differences and alignment issues were found when attempting to convert the conventional cDNA to human reference genome sequences for the following genes: ABO, A4GALT, RHD, RHCE, FUT3, ACKR1 (previously DARC), ACHE, FUT2, CR1, GCNT2, and RHAG. However, it was possible to create usable alignments, which facilitated the prediction of all RBC and PLT antigens with a known molecular basis from WGS data. Traditional serologic typing for 18 RBC antigens were in agreement with the WGS‐based antigen predictions, providing proof of principle for this approach. CONCLUSION: Detailed mapping of conventional cDNA annotated RBC and PLT alleles can enable accurate prediction of RBC and PLT antigens from whole genomic sequencing data. |
format | Online Article Text |
id | pubmed-5019240 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-50192402016-09-23 Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle Lane, William J. Westhoff, Connie M. Uy, Jon Michael Aguad, Maria Smeland‐Wagman, Robin Kaufman, Richard M. Rehm, Heidi L. Green, Robert C. Silberstein, Leslie E. Transfusion Blood Group Genomics BACKGROUND: There are 346 serologically defined red blood cell (RBC) antigens and 33 serologically defined platelet (PLT) antigens, most of which have known genetic changes in 45 RBC or six PLT genes that correlate with antigen expression. Polymorphic sites associated with antigen expression in the primary literature and reference databases are annotated according to nucleotide positions in cDNA. This makes antigen prediction from next‐generation sequencing data challenging, since it uses genomic coordinates. STUDY DESIGN AND METHODS: The conventional cDNA reference sequences for all known RBC and PLT genes that correlate with antigen expression were aligned to the human reference genome. The alignments allowed conversion of conventional cDNA nucleotide positions to the corresponding genomic coordinates. RBC and PLT antigen prediction was then performed using the human reference genome and whole genome sequencing (WGS) data with serologic confirmation. RESULTS: Some major differences and alignment issues were found when attempting to convert the conventional cDNA to human reference genome sequences for the following genes: ABO, A4GALT, RHD, RHCE, FUT3, ACKR1 (previously DARC), ACHE, FUT2, CR1, GCNT2, and RHAG. However, it was possible to create usable alignments, which facilitated the prediction of all RBC and PLT antigens with a known molecular basis from WGS data. Traditional serologic typing for 18 RBC antigens were in agreement with the WGS‐based antigen predictions, providing proof of principle for this approach. CONCLUSION: Detailed mapping of conventional cDNA annotated RBC and PLT alleles can enable accurate prediction of RBC and PLT antigens from whole genomic sequencing data. John Wiley and Sons Inc. 2015-12-03 2016-03 /pmc/articles/PMC5019240/ /pubmed/26634332 http://dx.doi.org/10.1111/trf.13416 Text en © 2015 The Authors Transfusion published by Wiley Periodicals, Inc. on behalf of AABB This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial‐NoDerivs (http://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Blood Group Genomics Lane, William J. Westhoff, Connie M. Uy, Jon Michael Aguad, Maria Smeland‐Wagman, Robin Kaufman, Richard M. Rehm, Heidi L. Green, Robert C. Silberstein, Leslie E. Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle |
title | Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle |
title_full | Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle |
title_fullStr | Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle |
title_full_unstemmed | Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle |
title_short | Comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle |
title_sort | comprehensive red blood cell and platelet antigen prediction from whole genome sequencing: proof of principle |
topic | Blood Group Genomics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5019240/ https://www.ncbi.nlm.nih.gov/pubmed/26634332 http://dx.doi.org/10.1111/trf.13416 |
work_keys_str_mv | AT lanewilliamj comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT westhoffconniem comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT uyjonmichael comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT aguadmaria comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT smelandwagmanrobin comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT kaufmanrichardm comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT rehmheidil comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT greenrobertc comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT silbersteinlesliee comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple AT comprehensiveredbloodcellandplateletantigenpredictionfromwholegenomesequencingproofofprinciple |