Cargando…

Whole Genome Sequence of a Turkish Individual

Although whole human genome sequencing can be done with readily available technical and financial resources, the need for detailed analyses of genomes of certain populations still exists. Here we present, for the first time, sequencing and analysis of a Turkish human genome. We have performed 35x co...

Descripción completa

Detalles Bibliográficos
Autores principales: Dogan, Haluk, Can, Handan, Otu, Hasan H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3887021/
https://www.ncbi.nlm.nih.gov/pubmed/24416366
http://dx.doi.org/10.1371/journal.pone.0085233
_version_ 1782478957517471744
author Dogan, Haluk
Can, Handan
Otu, Hasan H.
author_facet Dogan, Haluk
Can, Handan
Otu, Hasan H.
author_sort Dogan, Haluk
collection PubMed
description Although whole human genome sequencing can be done with readily available technical and financial resources, the need for detailed analyses of genomes of certain populations still exists. Here we present, for the first time, sequencing and analysis of a Turkish human genome. We have performed 35x coverage using paired-end sequencing, where over 95% of sequencing reads are mapped to the reference genome covering more than 99% of the bases. The assembly of unmapped reads rendered 11,654 contigs, 2,168 of which did not reveal any homology to known sequences, resulting in ∼1 Mbp of unmapped sequence. Single nucleotide polymorphism (SNP) discovery resulted in 3,537,794 SNP calls with 29,184 SNPs identified in coding regions, where 106 were nonsense and 259 were categorized as having a high-impact effect. The homo/hetero zygosity (1,415,123∶2,122,671 or 1∶1.5) and transition/transversion ratios (2,383,204∶1,154,590 or 2.06∶1) were within expected limits. Of the identified SNPs, 480,396 were potentially novel with 2,925 in coding regions, including 48 nonsense and 95 high-impact SNPs. Functional analysis of novel high-impact SNPs revealed various interaction networks, notably involving hereditary and neurological disorders or diseases. Assembly results indicated 713,640 indels (1∶1.09 insertion/deletion ratio), ranging from −52 bp to 34 bp in length and causing about 180 codon insertion/deletions and 246 frame shifts. Using paired-end- and read-depth-based methods, we discovered 9,109 structural variants and compared our variant findings with other populations. Our results suggest that whole genome sequencing is a valuable tool for understanding variations in the human genome across different populations. Detailed analyses of genomes of diverse origins greatly benefits research in genetics and medicine and should be conducted on a larger scale.
format Online
Article
Text
id pubmed-3887021
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38870212014-01-10 Whole Genome Sequence of a Turkish Individual Dogan, Haluk Can, Handan Otu, Hasan H. PLoS One Research Article Although whole human genome sequencing can be done with readily available technical and financial resources, the need for detailed analyses of genomes of certain populations still exists. Here we present, for the first time, sequencing and analysis of a Turkish human genome. We have performed 35x coverage using paired-end sequencing, where over 95% of sequencing reads are mapped to the reference genome covering more than 99% of the bases. The assembly of unmapped reads rendered 11,654 contigs, 2,168 of which did not reveal any homology to known sequences, resulting in ∼1 Mbp of unmapped sequence. Single nucleotide polymorphism (SNP) discovery resulted in 3,537,794 SNP calls with 29,184 SNPs identified in coding regions, where 106 were nonsense and 259 were categorized as having a high-impact effect. The homo/hetero zygosity (1,415,123∶2,122,671 or 1∶1.5) and transition/transversion ratios (2,383,204∶1,154,590 or 2.06∶1) were within expected limits. Of the identified SNPs, 480,396 were potentially novel with 2,925 in coding regions, including 48 nonsense and 95 high-impact SNPs. Functional analysis of novel high-impact SNPs revealed various interaction networks, notably involving hereditary and neurological disorders or diseases. Assembly results indicated 713,640 indels (1∶1.09 insertion/deletion ratio), ranging from −52 bp to 34 bp in length and causing about 180 codon insertion/deletions and 246 frame shifts. Using paired-end- and read-depth-based methods, we discovered 9,109 structural variants and compared our variant findings with other populations. Our results suggest that whole genome sequencing is a valuable tool for understanding variations in the human genome across different populations. Detailed analyses of genomes of diverse origins greatly benefits research in genetics and medicine and should be conducted on a larger scale. Public Library of Science 2014-01-09 /pmc/articles/PMC3887021/ /pubmed/24416366 http://dx.doi.org/10.1371/journal.pone.0085233 Text en © 2014 Dogan et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dogan, Haluk
Can, Handan
Otu, Hasan H.
Whole Genome Sequence of a Turkish Individual
title Whole Genome Sequence of a Turkish Individual
title_full Whole Genome Sequence of a Turkish Individual
title_fullStr Whole Genome Sequence of a Turkish Individual
title_full_unstemmed Whole Genome Sequence of a Turkish Individual
title_short Whole Genome Sequence of a Turkish Individual
title_sort whole genome sequence of a turkish individual
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3887021/
https://www.ncbi.nlm.nih.gov/pubmed/24416366
http://dx.doi.org/10.1371/journal.pone.0085233
work_keys_str_mv AT doganhaluk wholegenomesequenceofaturkishindividual
AT canhandan wholegenomesequenceofaturkishindividual
AT otuhasanh wholegenomesequenceofaturkishindividual