Cargando…

Sequencing and analysis of an Irish human genome

BACKGROUND: Recent studies generating complete human sequences from Asian, African and European subgroups have revealed population-specific variation and disease susceptibility loci. Here, choosing a DNA sample from a population of interest due to its relative geographical isolation and genetic impa...

Descripción completa

Detalles Bibliográficos
Autores principales: Tong, Pin, Prendergast, James GD, Lohan, Amanda J, Farrington, Susan M, Cronin, Simon, Friel, Nial, Bradley, Dan G, Hardiman, Orla, Evans, Alex, Wilson, James F, Loftus, Brendan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2965383/
https://www.ncbi.nlm.nih.gov/pubmed/20822512
http://dx.doi.org/10.1186/gb-2010-11-9-r91
_version_ 1782189510134595584
author Tong, Pin
Prendergast, James GD
Lohan, Amanda J
Farrington, Susan M
Cronin, Simon
Friel, Nial
Bradley, Dan G
Hardiman, Orla
Evans, Alex
Wilson, James F
Loftus, Brendan
author_facet Tong, Pin
Prendergast, James GD
Lohan, Amanda J
Farrington, Susan M
Cronin, Simon
Friel, Nial
Bradley, Dan G
Hardiman, Orla
Evans, Alex
Wilson, James F
Loftus, Brendan
author_sort Tong, Pin
collection PubMed
description BACKGROUND: Recent studies generating complete human sequences from Asian, African and European subgroups have revealed population-specific variation and disease susceptibility loci. Here, choosing a DNA sample from a population of interest due to its relative geographical isolation and genetic impact on further populations, we extend the above studies through the generation of 11-fold coverage of the first Irish human genome sequence. RESULTS: Using sequence data from a branch of the European ancestral tree as yet unsequenced, we identify variants that may be specific to this population. Through comparisons with HapMap and previous genetic association studies, we identified novel disease-associated variants, including a novel nonsense variant putatively associated with inflammatory bowel disease. We describe a novel method for improving SNP calling accuracy at low genome coverage using haplotype information. This analysis has implications for future re-sequencing studies and validates the imputation of Irish haplotypes using data from the current Human Genome Diversity Cell Line Panel (HGDP-CEPH). Finally, we identify gene duplication events as constituting significant targets of recent positive selection in the human lineage. CONCLUSIONS: Our findings show that there remains utility in generating whole genome sequences to illustrate both general principles and reveal specific instances of human biology. With increasing access to low cost sequencing we would predict that even armed with the resources of a small research group a number of similar initiatives geared towards answering specific biological questions will emerge.
format Text
id pubmed-2965383
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29653832010-10-28 Sequencing and analysis of an Irish human genome Tong, Pin Prendergast, James GD Lohan, Amanda J Farrington, Susan M Cronin, Simon Friel, Nial Bradley, Dan G Hardiman, Orla Evans, Alex Wilson, James F Loftus, Brendan Genome Biol Research BACKGROUND: Recent studies generating complete human sequences from Asian, African and European subgroups have revealed population-specific variation and disease susceptibility loci. Here, choosing a DNA sample from a population of interest due to its relative geographical isolation and genetic impact on further populations, we extend the above studies through the generation of 11-fold coverage of the first Irish human genome sequence. RESULTS: Using sequence data from a branch of the European ancestral tree as yet unsequenced, we identify variants that may be specific to this population. Through comparisons with HapMap and previous genetic association studies, we identified novel disease-associated variants, including a novel nonsense variant putatively associated with inflammatory bowel disease. We describe a novel method for improving SNP calling accuracy at low genome coverage using haplotype information. This analysis has implications for future re-sequencing studies and validates the imputation of Irish haplotypes using data from the current Human Genome Diversity Cell Line Panel (HGDP-CEPH). Finally, we identify gene duplication events as constituting significant targets of recent positive selection in the human lineage. CONCLUSIONS: Our findings show that there remains utility in generating whole genome sequences to illustrate both general principles and reveal specific instances of human biology. With increasing access to low cost sequencing we would predict that even armed with the resources of a small research group a number of similar initiatives geared towards answering specific biological questions will emerge. BioMed Central 2010 2010-09-07 /pmc/articles/PMC2965383/ /pubmed/20822512 http://dx.doi.org/10.1186/gb-2010-11-9-r91 Text en Copyright ©2010 Tong et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Tong, Pin
Prendergast, James GD
Lohan, Amanda J
Farrington, Susan M
Cronin, Simon
Friel, Nial
Bradley, Dan G
Hardiman, Orla
Evans, Alex
Wilson, James F
Loftus, Brendan
Sequencing and analysis of an Irish human genome
title Sequencing and analysis of an Irish human genome
title_full Sequencing and analysis of an Irish human genome
title_fullStr Sequencing and analysis of an Irish human genome
title_full_unstemmed Sequencing and analysis of an Irish human genome
title_short Sequencing and analysis of an Irish human genome
title_sort sequencing and analysis of an irish human genome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2965383/
https://www.ncbi.nlm.nih.gov/pubmed/20822512
http://dx.doi.org/10.1186/gb-2010-11-9-r91
work_keys_str_mv AT tongpin sequencingandanalysisofanirishhumangenome
AT prendergastjamesgd sequencingandanalysisofanirishhumangenome
AT lohanamandaj sequencingandanalysisofanirishhumangenome
AT farringtonsusanm sequencingandanalysisofanirishhumangenome
AT croninsimon sequencingandanalysisofanirishhumangenome
AT frielnial sequencingandanalysisofanirishhumangenome
AT bradleydang sequencingandanalysisofanirishhumangenome
AT hardimanorla sequencingandanalysisofanirishhumangenome
AT evansalex sequencingandanalysisofanirishhumangenome
AT wilsonjamesf sequencingandanalysisofanirishhumangenome
AT loftusbrendan sequencingandanalysisofanirishhumangenome