Cargando…

A multiway analysis for identifying high integrity bovine BACs

BACKGROUND: In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the res...

Descripción completa

Detalles Bibliográficos
Autores principales: Ratnakumar, Abhirami, Barris, Wesley, McWilliam, Sean, Brauning, Rudiger, McEwan, John C, Snelling, Warren M, Dalrymple, Brian P
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2660975/
https://www.ncbi.nlm.nih.gov/pubmed/19166603
http://dx.doi.org/10.1186/1471-2164-10-46
_version_ 1782165770660216832
author Ratnakumar, Abhirami
Barris, Wesley
McWilliam, Sean
Brauning, Rudiger
McEwan, John C
Snelling, Warren M
Dalrymple, Brian P
author_facet Ratnakumar, Abhirami
Barris, Wesley
McWilliam, Sean
Brauning, Rudiger
McEwan, John C
Snelling, Warren M
Dalrymple, Brian P
author_sort Ratnakumar, Abhirami
collection PubMed
description BACKGROUND: In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the results from the different analyses, inconsistencies can be identified and a set of high integrity BACs preferred for future research can be defined. RESULTS: The location of each bovine BAC in the BAC fingerprint-based genome map and in the genome assembly were compared based on the reported BESs, and for a smaller number of BACs the full sequence. BACs with consistent positions in all three datasets, or if the full sequence was not available, for both the fingerprint map and BES-based alignments, were deemed to be correctly positioned. BACs with consistent BES-based and fingerprint-based locations, but with conflicting locations based on the fully sequenced BAC, appeared to have been misidentified during sequencing, and included a number of apparently swapped BACs. Inconsistencies between BES-based and fingerprint map positions identified thirty one plates from the CHORI-240 library that appear to have suffered substantial systematic problems during the end-sequencing of the BACs. No systematic problems were identified in the fingerprinting of the BACs. Analysis of BACs overlapping in the assembly identified a small overrepresentation of clones with substantial overlap in the library and a substantial enrichment of highly overlapping BACs on the same plate in the CHORI-240 library. More than half of these BACs appear to have been present as duplicates on the original BAC-library plates and thus should be avoided in subsequent projects. CONCLUSION: Our analysis shows that ~95% of the bovine CHORI-240 library clones with both a BAC fingerprint and two BESs mapping to the genome in the expected orientations (~27% of all BACs) have consistent locations in the BAC fingerprint map and the genome assembly. We have developed a broadly applicable methodology for checking the integrity of BAC-based datasets even where only incomplete and partially assembled genomic sequence is available.
format Text
id pubmed-2660975
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26609752009-03-26 A multiway analysis for identifying high integrity bovine BACs Ratnakumar, Abhirami Barris, Wesley McWilliam, Sean Brauning, Rudiger McEwan, John C Snelling, Warren M Dalrymple, Brian P BMC Genomics Research Article BACKGROUND: In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the results from the different analyses, inconsistencies can be identified and a set of high integrity BACs preferred for future research can be defined. RESULTS: The location of each bovine BAC in the BAC fingerprint-based genome map and in the genome assembly were compared based on the reported BESs, and for a smaller number of BACs the full sequence. BACs with consistent positions in all three datasets, or if the full sequence was not available, for both the fingerprint map and BES-based alignments, were deemed to be correctly positioned. BACs with consistent BES-based and fingerprint-based locations, but with conflicting locations based on the fully sequenced BAC, appeared to have been misidentified during sequencing, and included a number of apparently swapped BACs. Inconsistencies between BES-based and fingerprint map positions identified thirty one plates from the CHORI-240 library that appear to have suffered substantial systematic problems during the end-sequencing of the BACs. No systematic problems were identified in the fingerprinting of the BACs. Analysis of BACs overlapping in the assembly identified a small overrepresentation of clones with substantial overlap in the library and a substantial enrichment of highly overlapping BACs on the same plate in the CHORI-240 library. More than half of these BACs appear to have been present as duplicates on the original BAC-library plates and thus should be avoided in subsequent projects. CONCLUSION: Our analysis shows that ~95% of the bovine CHORI-240 library clones with both a BAC fingerprint and two BESs mapping to the genome in the expected orientations (~27% of all BACs) have consistent locations in the BAC fingerprint map and the genome assembly. We have developed a broadly applicable methodology for checking the integrity of BAC-based datasets even where only incomplete and partially assembled genomic sequence is available. BioMed Central 2009-01-23 /pmc/articles/PMC2660975/ /pubmed/19166603 http://dx.doi.org/10.1186/1471-2164-10-46 Text en Copyright © 2009 Ratnakumar et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ratnakumar, Abhirami
Barris, Wesley
McWilliam, Sean
Brauning, Rudiger
McEwan, John C
Snelling, Warren M
Dalrymple, Brian P
A multiway analysis for identifying high integrity bovine BACs
title A multiway analysis for identifying high integrity bovine BACs
title_full A multiway analysis for identifying high integrity bovine BACs
title_fullStr A multiway analysis for identifying high integrity bovine BACs
title_full_unstemmed A multiway analysis for identifying high integrity bovine BACs
title_short A multiway analysis for identifying high integrity bovine BACs
title_sort multiway analysis for identifying high integrity bovine bacs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2660975/
https://www.ncbi.nlm.nih.gov/pubmed/19166603
http://dx.doi.org/10.1186/1471-2164-10-46
work_keys_str_mv AT ratnakumarabhirami amultiwayanalysisforidentifyinghighintegritybovinebacs
AT barriswesley amultiwayanalysisforidentifyinghighintegritybovinebacs
AT mcwilliamsean amultiwayanalysisforidentifyinghighintegritybovinebacs
AT brauningrudiger amultiwayanalysisforidentifyinghighintegritybovinebacs
AT mcewanjohnc amultiwayanalysisforidentifyinghighintegritybovinebacs
AT snellingwarrenm amultiwayanalysisforidentifyinghighintegritybovinebacs
AT dalrymplebrianp amultiwayanalysisforidentifyinghighintegritybovinebacs
AT ratnakumarabhirami multiwayanalysisforidentifyinghighintegritybovinebacs
AT barriswesley multiwayanalysisforidentifyinghighintegritybovinebacs
AT mcwilliamsean multiwayanalysisforidentifyinghighintegritybovinebacs
AT brauningrudiger multiwayanalysisforidentifyinghighintegritybovinebacs
AT mcewanjohnc multiwayanalysisforidentifyinghighintegritybovinebacs
AT snellingwarrenm multiwayanalysisforidentifyinghighintegritybovinebacs
AT dalrymplebrianp multiwayanalysisforidentifyinghighintegritybovinebacs