Cargando…

Whole genome sequencing data for two individuals of Pakistani descent

Here we report next-generation based whole genome sequencing of two individuals (H1 and H2) from a family of Pakistani descent. The genomic DNA was used to prepare paired-end libraries for whole-genome sequencing. Deep sequencing yielded 706.49 and 778.12 million mapped reads corresponding to 70.64...

Descripción completa

Detalles Bibliográficos
Autores principales: Khan, Shahid Y., Kabir, Firoz, M’Hamdi, Oussama, Jiao, Xiaodong, Naeem, Muhammad Asif, Khan, Shaheen N., Riazuddin, Sheikh, Hejtmancik, J. Fielding, Riazuddin, S. Amer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6137601/
https://www.ncbi.nlm.nih.gov/pubmed/30204152
http://dx.doi.org/10.1038/sdata.2018.174
_version_ 1783355202943844352
author Khan, Shahid Y.
Kabir, Firoz
M’Hamdi, Oussama
Jiao, Xiaodong
Naeem, Muhammad Asif
Khan, Shaheen N.
Riazuddin, Sheikh
Hejtmancik, J. Fielding
Riazuddin, S. Amer
author_facet Khan, Shahid Y.
Kabir, Firoz
M’Hamdi, Oussama
Jiao, Xiaodong
Naeem, Muhammad Asif
Khan, Shaheen N.
Riazuddin, Sheikh
Hejtmancik, J. Fielding
Riazuddin, S. Amer
author_sort Khan, Shahid Y.
collection PubMed
description Here we report next-generation based whole genome sequencing of two individuals (H1 and H2) from a family of Pakistani descent. The genomic DNA was used to prepare paired-end libraries for whole-genome sequencing. Deep sequencing yielded 706.49 and 778.12 million mapped reads corresponding to 70.64 and 77.81 Gb sequence data and 23× and 25× average coverage for H1 and H2, respectively. Notably, a total of 448,544 and 470,683 novel variants, not present in the single nucleotide polymorphism database (dbSNP), were identified in H1 and H2, respectively. Comparative analysis identified 2,415,852 variants common in both genomes including 240,181 variants absent in the dbSNP. Principal component analysis linked the ancestry of both genomes with South Asian populations. In conclusion, we report whole genome sequences of two individuals from a family of Pakistani descent.
format Online
Article
Text
id pubmed-6137601
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-61376012018-09-17 Whole genome sequencing data for two individuals of Pakistani descent Khan, Shahid Y. Kabir, Firoz M’Hamdi, Oussama Jiao, Xiaodong Naeem, Muhammad Asif Khan, Shaheen N. Riazuddin, Sheikh Hejtmancik, J. Fielding Riazuddin, S. Amer Sci Data Data Descriptor Here we report next-generation based whole genome sequencing of two individuals (H1 and H2) from a family of Pakistani descent. The genomic DNA was used to prepare paired-end libraries for whole-genome sequencing. Deep sequencing yielded 706.49 and 778.12 million mapped reads corresponding to 70.64 and 77.81 Gb sequence data and 23× and 25× average coverage for H1 and H2, respectively. Notably, a total of 448,544 and 470,683 novel variants, not present in the single nucleotide polymorphism database (dbSNP), were identified in H1 and H2, respectively. Comparative analysis identified 2,415,852 variants common in both genomes including 240,181 variants absent in the dbSNP. Principal component analysis linked the ancestry of both genomes with South Asian populations. In conclusion, we report whole genome sequences of two individuals from a family of Pakistani descent. Nature Publishing Group 2018-09-11 /pmc/articles/PMC6137601/ /pubmed/30204152 http://dx.doi.org/10.1038/sdata.2018.174 Text en Copyright © 2018, The Author(s) http://creativecommons.org/licenses/by/4.0/ Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.
spellingShingle Data Descriptor
Khan, Shahid Y.
Kabir, Firoz
M’Hamdi, Oussama
Jiao, Xiaodong
Naeem, Muhammad Asif
Khan, Shaheen N.
Riazuddin, Sheikh
Hejtmancik, J. Fielding
Riazuddin, S. Amer
Whole genome sequencing data for two individuals of Pakistani descent
title Whole genome sequencing data for two individuals of Pakistani descent
title_full Whole genome sequencing data for two individuals of Pakistani descent
title_fullStr Whole genome sequencing data for two individuals of Pakistani descent
title_full_unstemmed Whole genome sequencing data for two individuals of Pakistani descent
title_short Whole genome sequencing data for two individuals of Pakistani descent
title_sort whole genome sequencing data for two individuals of pakistani descent
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6137601/
https://www.ncbi.nlm.nih.gov/pubmed/30204152
http://dx.doi.org/10.1038/sdata.2018.174
work_keys_str_mv AT khanshahidy wholegenomesequencingdatafortwoindividualsofpakistanidescent
AT kabirfiroz wholegenomesequencingdatafortwoindividualsofpakistanidescent
AT mhamdioussama wholegenomesequencingdatafortwoindividualsofpakistanidescent
AT jiaoxiaodong wholegenomesequencingdatafortwoindividualsofpakistanidescent
AT naeemmuhammadasif wholegenomesequencingdatafortwoindividualsofpakistanidescent
AT khanshaheenn wholegenomesequencingdatafortwoindividualsofpakistanidescent
AT riazuddinsheikh wholegenomesequencingdatafortwoindividualsofpakistanidescent
AT hejtmancikjfielding wholegenomesequencingdatafortwoindividualsofpakistanidescent
AT riazuddinsamer wholegenomesequencingdatafortwoindividualsofpakistanidescent