Cargando…

A novel approach to sequence validating protein expression clones with automated decision making

BACKGROUND: Whereas the molecular assembly of protein expression clones is readily automated and routinely accomplished in high throughput, sequence verification of these clones is still largely performed manually, an arduous and time consuming process. The ultimate goal of validation is to determin...

Descripción completa

Detalles Bibliográficos
Autores principales: Taycher, Elena, Rolfs, Andreas, Hu, Yanhui, Zuo, Dongmei, Mohr, Stephanie E, Williamson, Janice, LaBaer, Joshua
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1914086/
https://www.ncbi.nlm.nih.gov/pubmed/17567908
http://dx.doi.org/10.1186/1471-2105-8-198
_version_ 1782134103317938176
author Taycher, Elena
Rolfs, Andreas
Hu, Yanhui
Zuo, Dongmei
Mohr, Stephanie E
Williamson, Janice
LaBaer, Joshua
author_facet Taycher, Elena
Rolfs, Andreas
Hu, Yanhui
Zuo, Dongmei
Mohr, Stephanie E
Williamson, Janice
LaBaer, Joshua
author_sort Taycher, Elena
collection PubMed
description BACKGROUND: Whereas the molecular assembly of protein expression clones is readily automated and routinely accomplished in high throughput, sequence verification of these clones is still largely performed manually, an arduous and time consuming process. The ultimate goal of validation is to determine if a given plasmid clone matches its reference sequence sufficiently to be "acceptable" for use in protein expression experiments. Given the accelerating increase in availability of tens of thousands of unverified clones, there is a strong demand for rapid, efficient and accurate software that automates clone validation. RESULTS: We have developed an Automated Clone Evaluation (ACE) system – the first comprehensive, multi-platform, web-based plasmid sequence verification software package. ACE automates the clone verification process by defining each clone sequence as a list of multidimensional discrepancy objects, each describing a difference between the clone and its expected sequence including the resulting polypeptide consequences. To evaluate clones automatically, this list can be compared against user acceptance criteria that specify the allowable number of discrepancies of each type. This strategy allows users to re-evaluate the same set of clones against different acceptance criteria as needed for use in other experiments. ACE manages the entire sequence validation process including contig management, identifying and annotating discrepancies, determining if discrepancies correspond to polymorphisms and clone finishing. Designed to manage thousands of clones simultaneously, ACE maintains a relational database to store information about clones at various completion stages, project processing parameters and acceptance criteria. In a direct comparison, the automated analysis by ACE took less time and was more accurate than a manual analysis of a 93 gene clone set. CONCLUSION: ACE was designed to facilitate high throughput clone sequence verification projects. The software has been used successfully to evaluate more than 55,000 clones at the Harvard Institute of Proteomics. The software dramatically reduced the amount of time and labor required to evaluate clone sequences and decreased the number of missed sequence discrepancies, which commonly occur during manual evaluation. In addition, ACE helped to reduce the number of sequencing reads needed to achieve adequate coverage for making decisions on clones.
format Text
id pubmed-1914086
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19140862007-07-13 A novel approach to sequence validating protein expression clones with automated decision making Taycher, Elena Rolfs, Andreas Hu, Yanhui Zuo, Dongmei Mohr, Stephanie E Williamson, Janice LaBaer, Joshua BMC Bioinformatics Software BACKGROUND: Whereas the molecular assembly of protein expression clones is readily automated and routinely accomplished in high throughput, sequence verification of these clones is still largely performed manually, an arduous and time consuming process. The ultimate goal of validation is to determine if a given plasmid clone matches its reference sequence sufficiently to be "acceptable" for use in protein expression experiments. Given the accelerating increase in availability of tens of thousands of unverified clones, there is a strong demand for rapid, efficient and accurate software that automates clone validation. RESULTS: We have developed an Automated Clone Evaluation (ACE) system – the first comprehensive, multi-platform, web-based plasmid sequence verification software package. ACE automates the clone verification process by defining each clone sequence as a list of multidimensional discrepancy objects, each describing a difference between the clone and its expected sequence including the resulting polypeptide consequences. To evaluate clones automatically, this list can be compared against user acceptance criteria that specify the allowable number of discrepancies of each type. This strategy allows users to re-evaluate the same set of clones against different acceptance criteria as needed for use in other experiments. ACE manages the entire sequence validation process including contig management, identifying and annotating discrepancies, determining if discrepancies correspond to polymorphisms and clone finishing. Designed to manage thousands of clones simultaneously, ACE maintains a relational database to store information about clones at various completion stages, project processing parameters and acceptance criteria. In a direct comparison, the automated analysis by ACE took less time and was more accurate than a manual analysis of a 93 gene clone set. CONCLUSION: ACE was designed to facilitate high throughput clone sequence verification projects. The software has been used successfully to evaluate more than 55,000 clones at the Harvard Institute of Proteomics. The software dramatically reduced the amount of time and labor required to evaluate clone sequences and decreased the number of missed sequence discrepancies, which commonly occur during manual evaluation. In addition, ACE helped to reduce the number of sequencing reads needed to achieve adequate coverage for making decisions on clones. BioMed Central 2007-06-13 /pmc/articles/PMC1914086/ /pubmed/17567908 http://dx.doi.org/10.1186/1471-2105-8-198 Text en Copyright © 2007 Taycher et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Taycher, Elena
Rolfs, Andreas
Hu, Yanhui
Zuo, Dongmei
Mohr, Stephanie E
Williamson, Janice
LaBaer, Joshua
A novel approach to sequence validating protein expression clones with automated decision making
title A novel approach to sequence validating protein expression clones with automated decision making
title_full A novel approach to sequence validating protein expression clones with automated decision making
title_fullStr A novel approach to sequence validating protein expression clones with automated decision making
title_full_unstemmed A novel approach to sequence validating protein expression clones with automated decision making
title_short A novel approach to sequence validating protein expression clones with automated decision making
title_sort novel approach to sequence validating protein expression clones with automated decision making
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1914086/
https://www.ncbi.nlm.nih.gov/pubmed/17567908
http://dx.doi.org/10.1186/1471-2105-8-198
work_keys_str_mv AT taycherelena anovelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT rolfsandreas anovelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT huyanhui anovelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT zuodongmei anovelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT mohrstephaniee anovelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT williamsonjanice anovelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT labaerjoshua anovelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT taycherelena novelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT rolfsandreas novelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT huyanhui novelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT zuodongmei novelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT mohrstephaniee novelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT williamsonjanice novelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking
AT labaerjoshua novelapproachtosequencevalidatingproteinexpressioncloneswithautomateddecisionmaking