Cargando…
454 antibody sequencing - error characterization and correction
BACKGROUND: 454 sequencing is currently the method of choice for sequencing of antibody repertoires and libraries containing large numbers (10(6 )to 10(12)) of different molecules with similar frameworks and variable regions which poses significant challenges for identifying sequencing errors. Ident...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228814/ https://www.ncbi.nlm.nih.gov/pubmed/21992227 http://dx.doi.org/10.1186/1756-0500-4-404 |
_version_ | 1782217877743468544 |
---|---|
author | Prabakaran, Ponraj Streaker, Emily Chen, Weizao Dimitrov, Dimiter S |
author_facet | Prabakaran, Ponraj Streaker, Emily Chen, Weizao Dimitrov, Dimiter S |
author_sort | Prabakaran, Ponraj |
collection | PubMed |
description | BACKGROUND: 454 sequencing is currently the method of choice for sequencing of antibody repertoires and libraries containing large numbers (10(6 )to 10(12)) of different molecules with similar frameworks and variable regions which poses significant challenges for identifying sequencing errors. Identification and correction of sequencing errors in such mixtures is especially important for the exploration of complex maturation pathways and identification of putative germline predecessors of highly somatically mutated antibodies. To quantify and correct errors incorporated in 454 antibody sequencing, we sequenced six antibodies at different known concentrations twice over and compared them with the corresponding known sequences as determined by standard Sanger sequencing. RESULTS: We found that 454 antibody sequencing could lead to approximately 20% incorrect reads due to insertions that were mostly found at shorter homopolymer regions of 2-3 nucleotide length, and less so by insertions, deletions and other variants at random sites. Correction of errors might reduce this population of erroneous reads down to 5-10%. However, there are a certain number of errors accounting for 4-8% of the total reads that could not be corrected unless several repeated sequencing is performed, although this may not be possible for large diverse libraries and repertoires including complete sets of antibodies (antibodyomes). CONCLUSIONS: The experimental test procedure carried out for assessing 454 antibody sequencing errors reveals high (up to 20%) incorrect reads; the errors can be reduced down to 5-10% but not less which suggests the use of caution to avoid false discovery of antibody variants and diversity. |
format | Online Article Text |
id | pubmed-3228814 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32288142011-12-02 454 antibody sequencing - error characterization and correction Prabakaran, Ponraj Streaker, Emily Chen, Weizao Dimitrov, Dimiter S BMC Res Notes Research Article BACKGROUND: 454 sequencing is currently the method of choice for sequencing of antibody repertoires and libraries containing large numbers (10(6 )to 10(12)) of different molecules with similar frameworks and variable regions which poses significant challenges for identifying sequencing errors. Identification and correction of sequencing errors in such mixtures is especially important for the exploration of complex maturation pathways and identification of putative germline predecessors of highly somatically mutated antibodies. To quantify and correct errors incorporated in 454 antibody sequencing, we sequenced six antibodies at different known concentrations twice over and compared them with the corresponding known sequences as determined by standard Sanger sequencing. RESULTS: We found that 454 antibody sequencing could lead to approximately 20% incorrect reads due to insertions that were mostly found at shorter homopolymer regions of 2-3 nucleotide length, and less so by insertions, deletions and other variants at random sites. Correction of errors might reduce this population of erroneous reads down to 5-10%. However, there are a certain number of errors accounting for 4-8% of the total reads that could not be corrected unless several repeated sequencing is performed, although this may not be possible for large diverse libraries and repertoires including complete sets of antibodies (antibodyomes). CONCLUSIONS: The experimental test procedure carried out for assessing 454 antibody sequencing errors reveals high (up to 20%) incorrect reads; the errors can be reduced down to 5-10% but not less which suggests the use of caution to avoid false discovery of antibody variants and diversity. BioMed Central 2011-10-12 /pmc/articles/PMC3228814/ /pubmed/21992227 http://dx.doi.org/10.1186/1756-0500-4-404 Text en Copyright ©2011 Dimitrov et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Prabakaran, Ponraj Streaker, Emily Chen, Weizao Dimitrov, Dimiter S 454 antibody sequencing - error characterization and correction |
title | 454 antibody sequencing - error characterization and correction |
title_full | 454 antibody sequencing - error characterization and correction |
title_fullStr | 454 antibody sequencing - error characterization and correction |
title_full_unstemmed | 454 antibody sequencing - error characterization and correction |
title_short | 454 antibody sequencing - error characterization and correction |
title_sort | 454 antibody sequencing - error characterization and correction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228814/ https://www.ncbi.nlm.nih.gov/pubmed/21992227 http://dx.doi.org/10.1186/1756-0500-4-404 |
work_keys_str_mv | AT prabakaranponraj 454antibodysequencingerrorcharacterizationandcorrection AT streakeremily 454antibodysequencingerrorcharacterizationandcorrection AT chenweizao 454antibodysequencingerrorcharacterizationandcorrection AT dimitrovdimiters 454antibodysequencingerrorcharacterizationandcorrection |