Cargando…

A simple procedure for directly obtaining haplotype sequences of diploid genomes

BACKGROUND: Almost all genome sequencing projects neglect the fact that diploid organisms contain two genome copies and consequently what is published is a composite of the two. This means that the relationship between alternate alleles at two or more linked loci is lost. We have developed a simplif...

Descripción completa

Detalles Bibliográficos
Autores principales: Noyes, Harry A., Daly, Derek, Goodhead, Ian, Kay, Suzanne, Kemp, Steven J., Kenny, John, Saccheri, Ilik, Schnabel, Robert D., Taylor, Jeremy F., Hall, Neil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551521/
https://www.ncbi.nlm.nih.gov/pubmed/26311067
http://dx.doi.org/10.1186/s12864-015-1818-4
_version_ 1782387579023261696
author Noyes, Harry A.
Daly, Derek
Goodhead, Ian
Kay, Suzanne
Kemp, Steven J.
Kenny, John
Saccheri, Ilik
Schnabel, Robert D.
Taylor, Jeremy F.
Hall, Neil
author_facet Noyes, Harry A.
Daly, Derek
Goodhead, Ian
Kay, Suzanne
Kemp, Steven J.
Kenny, John
Saccheri, Ilik
Schnabel, Robert D.
Taylor, Jeremy F.
Hall, Neil
author_sort Noyes, Harry A.
collection PubMed
description BACKGROUND: Almost all genome sequencing projects neglect the fact that diploid organisms contain two genome copies and consequently what is published is a composite of the two. This means that the relationship between alternate alleles at two or more linked loci is lost. We have developed a simplified method of directly obtaining the haploid sequences of each genome copy from an individual organism. RESULTS: The diploid sequences of three groups of cattle samples were obtained using a simple sample preparation procedure requiring only a microscope and a haemocytometer. Samples were: 1) lymphocytes from a single Angus steer; 2) sperm cells from an Angus bull; 3) lymphocytes from East African Zebu (EAZ) cattle collected and processed in a field laboratory in Eastern Kenya. Haploid sequence from a fosmid library prepared from lymphocytes of an EAZ cow was used for comparison. Cells were serially diluted to a concentration of one cell per microlitre by counting with a haemocytometer at each dilution. One microlitre samples, each potentially containing a single cell, were lysed and divided into six aliquots (except for the sperm samples which were not divided into aliquots). Each aliquot was amplified with phi29 polymerase and sequenced. Contigs were obtained by mapping to the bovine UMD3.1 reference genome assembly and scaffolds were assembled by joining adjacent contigs that were within a threshold distance of each other. Scaffolds that appeared to contain artefacts of CNV or repeats were filtered out leaving scaffolds with an N50 length of 27–133 kb and a 88–98 % genome coverage. SNP haplotypes were assembled with the Single Individual Haplotyper program to generate an N50 size of 97–201 kb but only ~27–68 % genome coverage. This method can be used in any laboratory with no special equipment at only slightly higher costs than conventional diploid genome sequencing. A substantial body of software for analysis and workflow management was written and is available as supplementary data. CONCLUSIONS: We have developed a set of laboratory protocols and software tools that will enable any laboratory to obtain haplotype sequences at only modestly greater cost than traditional mixed diploid sequences.
format Online
Article
Text
id pubmed-4551521
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45515212015-08-29 A simple procedure for directly obtaining haplotype sequences of diploid genomes Noyes, Harry A. Daly, Derek Goodhead, Ian Kay, Suzanne Kemp, Steven J. Kenny, John Saccheri, Ilik Schnabel, Robert D. Taylor, Jeremy F. Hall, Neil BMC Genomics Methodology Article BACKGROUND: Almost all genome sequencing projects neglect the fact that diploid organisms contain two genome copies and consequently what is published is a composite of the two. This means that the relationship between alternate alleles at two or more linked loci is lost. We have developed a simplified method of directly obtaining the haploid sequences of each genome copy from an individual organism. RESULTS: The diploid sequences of three groups of cattle samples were obtained using a simple sample preparation procedure requiring only a microscope and a haemocytometer. Samples were: 1) lymphocytes from a single Angus steer; 2) sperm cells from an Angus bull; 3) lymphocytes from East African Zebu (EAZ) cattle collected and processed in a field laboratory in Eastern Kenya. Haploid sequence from a fosmid library prepared from lymphocytes of an EAZ cow was used for comparison. Cells were serially diluted to a concentration of one cell per microlitre by counting with a haemocytometer at each dilution. One microlitre samples, each potentially containing a single cell, were lysed and divided into six aliquots (except for the sperm samples which were not divided into aliquots). Each aliquot was amplified with phi29 polymerase and sequenced. Contigs were obtained by mapping to the bovine UMD3.1 reference genome assembly and scaffolds were assembled by joining adjacent contigs that were within a threshold distance of each other. Scaffolds that appeared to contain artefacts of CNV or repeats were filtered out leaving scaffolds with an N50 length of 27–133 kb and a 88–98 % genome coverage. SNP haplotypes were assembled with the Single Individual Haplotyper program to generate an N50 size of 97–201 kb but only ~27–68 % genome coverage. This method can be used in any laboratory with no special equipment at only slightly higher costs than conventional diploid genome sequencing. A substantial body of software for analysis and workflow management was written and is available as supplementary data. CONCLUSIONS: We have developed a set of laboratory protocols and software tools that will enable any laboratory to obtain haplotype sequences at only modestly greater cost than traditional mixed diploid sequences. BioMed Central 2015-08-28 /pmc/articles/PMC4551521/ /pubmed/26311067 http://dx.doi.org/10.1186/s12864-015-1818-4 Text en © Noyes et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Noyes, Harry A.
Daly, Derek
Goodhead, Ian
Kay, Suzanne
Kemp, Steven J.
Kenny, John
Saccheri, Ilik
Schnabel, Robert D.
Taylor, Jeremy F.
Hall, Neil
A simple procedure for directly obtaining haplotype sequences of diploid genomes
title A simple procedure for directly obtaining haplotype sequences of diploid genomes
title_full A simple procedure for directly obtaining haplotype sequences of diploid genomes
title_fullStr A simple procedure for directly obtaining haplotype sequences of diploid genomes
title_full_unstemmed A simple procedure for directly obtaining haplotype sequences of diploid genomes
title_short A simple procedure for directly obtaining haplotype sequences of diploid genomes
title_sort simple procedure for directly obtaining haplotype sequences of diploid genomes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551521/
https://www.ncbi.nlm.nih.gov/pubmed/26311067
http://dx.doi.org/10.1186/s12864-015-1818-4
work_keys_str_mv AT noyesharrya asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT dalyderek asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT goodheadian asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT kaysuzanne asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT kempstevenj asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT kennyjohn asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT saccheriilik asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT schnabelrobertd asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT taylorjeremyf asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT hallneil asimpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT noyesharrya simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT dalyderek simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT goodheadian simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT kaysuzanne simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT kempstevenj simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT kennyjohn simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT saccheriilik simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT schnabelrobertd simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT taylorjeremyf simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes
AT hallneil simpleprocedurefordirectlyobtaininghaplotypesequencesofdiploidgenomes