Cargando…
Whole Genome Sequence of a Turkish Individual
Although whole human genome sequencing can be done with readily available technical and financial resources, the need for detailed analyses of genomes of certain populations still exists. Here we present, for the first time, sequencing and analysis of a Turkish human genome. We have performed 35x co...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3887021/ https://www.ncbi.nlm.nih.gov/pubmed/24416366 http://dx.doi.org/10.1371/journal.pone.0085233 |
_version_ | 1782478957517471744 |
---|---|
author | Dogan, Haluk Can, Handan Otu, Hasan H. |
author_facet | Dogan, Haluk Can, Handan Otu, Hasan H. |
author_sort | Dogan, Haluk |
collection | PubMed |
description | Although whole human genome sequencing can be done with readily available technical and financial resources, the need for detailed analyses of genomes of certain populations still exists. Here we present, for the first time, sequencing and analysis of a Turkish human genome. We have performed 35x coverage using paired-end sequencing, where over 95% of sequencing reads are mapped to the reference genome covering more than 99% of the bases. The assembly of unmapped reads rendered 11,654 contigs, 2,168 of which did not reveal any homology to known sequences, resulting in ∼1 Mbp of unmapped sequence. Single nucleotide polymorphism (SNP) discovery resulted in 3,537,794 SNP calls with 29,184 SNPs identified in coding regions, where 106 were nonsense and 259 were categorized as having a high-impact effect. The homo/hetero zygosity (1,415,123∶2,122,671 or 1∶1.5) and transition/transversion ratios (2,383,204∶1,154,590 or 2.06∶1) were within expected limits. Of the identified SNPs, 480,396 were potentially novel with 2,925 in coding regions, including 48 nonsense and 95 high-impact SNPs. Functional analysis of novel high-impact SNPs revealed various interaction networks, notably involving hereditary and neurological disorders or diseases. Assembly results indicated 713,640 indels (1∶1.09 insertion/deletion ratio), ranging from −52 bp to 34 bp in length and causing about 180 codon insertion/deletions and 246 frame shifts. Using paired-end- and read-depth-based methods, we discovered 9,109 structural variants and compared our variant findings with other populations. Our results suggest that whole genome sequencing is a valuable tool for understanding variations in the human genome across different populations. Detailed analyses of genomes of diverse origins greatly benefits research in genetics and medicine and should be conducted on a larger scale. |
format | Online Article Text |
id | pubmed-3887021 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38870212014-01-10 Whole Genome Sequence of a Turkish Individual Dogan, Haluk Can, Handan Otu, Hasan H. PLoS One Research Article Although whole human genome sequencing can be done with readily available technical and financial resources, the need for detailed analyses of genomes of certain populations still exists. Here we present, for the first time, sequencing and analysis of a Turkish human genome. We have performed 35x coverage using paired-end sequencing, where over 95% of sequencing reads are mapped to the reference genome covering more than 99% of the bases. The assembly of unmapped reads rendered 11,654 contigs, 2,168 of which did not reveal any homology to known sequences, resulting in ∼1 Mbp of unmapped sequence. Single nucleotide polymorphism (SNP) discovery resulted in 3,537,794 SNP calls with 29,184 SNPs identified in coding regions, where 106 were nonsense and 259 were categorized as having a high-impact effect. The homo/hetero zygosity (1,415,123∶2,122,671 or 1∶1.5) and transition/transversion ratios (2,383,204∶1,154,590 or 2.06∶1) were within expected limits. Of the identified SNPs, 480,396 were potentially novel with 2,925 in coding regions, including 48 nonsense and 95 high-impact SNPs. Functional analysis of novel high-impact SNPs revealed various interaction networks, notably involving hereditary and neurological disorders or diseases. Assembly results indicated 713,640 indels (1∶1.09 insertion/deletion ratio), ranging from −52 bp to 34 bp in length and causing about 180 codon insertion/deletions and 246 frame shifts. Using paired-end- and read-depth-based methods, we discovered 9,109 structural variants and compared our variant findings with other populations. Our results suggest that whole genome sequencing is a valuable tool for understanding variations in the human genome across different populations. Detailed analyses of genomes of diverse origins greatly benefits research in genetics and medicine and should be conducted on a larger scale. Public Library of Science 2014-01-09 /pmc/articles/PMC3887021/ /pubmed/24416366 http://dx.doi.org/10.1371/journal.pone.0085233 Text en © 2014 Dogan et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Dogan, Haluk Can, Handan Otu, Hasan H. Whole Genome Sequence of a Turkish Individual |
title | Whole Genome Sequence of a Turkish Individual |
title_full | Whole Genome Sequence of a Turkish Individual |
title_fullStr | Whole Genome Sequence of a Turkish Individual |
title_full_unstemmed | Whole Genome Sequence of a Turkish Individual |
title_short | Whole Genome Sequence of a Turkish Individual |
title_sort | whole genome sequence of a turkish individual |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3887021/ https://www.ncbi.nlm.nih.gov/pubmed/24416366 http://dx.doi.org/10.1371/journal.pone.0085233 |
work_keys_str_mv | AT doganhaluk wholegenomesequenceofaturkishindividual AT canhandan wholegenomesequenceofaturkishindividual AT otuhasanh wholegenomesequenceofaturkishindividual |