Cargando…
Whole genome sequencing data for two individuals of Pakistani descent
Here we report next-generation based whole genome sequencing of two individuals (H1 and H2) from a family of Pakistani descent. The genomic DNA was used to prepare paired-end libraries for whole-genome sequencing. Deep sequencing yielded 706.49 and 778.12 million mapped reads corresponding to 70.64...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6137601/ https://www.ncbi.nlm.nih.gov/pubmed/30204152 http://dx.doi.org/10.1038/sdata.2018.174 |
_version_ | 1783355202943844352 |
---|---|
author | Khan, Shahid Y. Kabir, Firoz M’Hamdi, Oussama Jiao, Xiaodong Naeem, Muhammad Asif Khan, Shaheen N. Riazuddin, Sheikh Hejtmancik, J. Fielding Riazuddin, S. Amer |
author_facet | Khan, Shahid Y. Kabir, Firoz M’Hamdi, Oussama Jiao, Xiaodong Naeem, Muhammad Asif Khan, Shaheen N. Riazuddin, Sheikh Hejtmancik, J. Fielding Riazuddin, S. Amer |
author_sort | Khan, Shahid Y. |
collection | PubMed |
description | Here we report next-generation based whole genome sequencing of two individuals (H1 and H2) from a family of Pakistani descent. The genomic DNA was used to prepare paired-end libraries for whole-genome sequencing. Deep sequencing yielded 706.49 and 778.12 million mapped reads corresponding to 70.64 and 77.81 Gb sequence data and 23× and 25× average coverage for H1 and H2, respectively. Notably, a total of 448,544 and 470,683 novel variants, not present in the single nucleotide polymorphism database (dbSNP), were identified in H1 and H2, respectively. Comparative analysis identified 2,415,852 variants common in both genomes including 240,181 variants absent in the dbSNP. Principal component analysis linked the ancestry of both genomes with South Asian populations. In conclusion, we report whole genome sequences of two individuals from a family of Pakistani descent. |
format | Online Article Text |
id | pubmed-6137601 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-61376012018-09-17 Whole genome sequencing data for two individuals of Pakistani descent Khan, Shahid Y. Kabir, Firoz M’Hamdi, Oussama Jiao, Xiaodong Naeem, Muhammad Asif Khan, Shaheen N. Riazuddin, Sheikh Hejtmancik, J. Fielding Riazuddin, S. Amer Sci Data Data Descriptor Here we report next-generation based whole genome sequencing of two individuals (H1 and H2) from a family of Pakistani descent. The genomic DNA was used to prepare paired-end libraries for whole-genome sequencing. Deep sequencing yielded 706.49 and 778.12 million mapped reads corresponding to 70.64 and 77.81 Gb sequence data and 23× and 25× average coverage for H1 and H2, respectively. Notably, a total of 448,544 and 470,683 novel variants, not present in the single nucleotide polymorphism database (dbSNP), were identified in H1 and H2, respectively. Comparative analysis identified 2,415,852 variants common in both genomes including 240,181 variants absent in the dbSNP. Principal component analysis linked the ancestry of both genomes with South Asian populations. In conclusion, we report whole genome sequences of two individuals from a family of Pakistani descent. Nature Publishing Group 2018-09-11 /pmc/articles/PMC6137601/ /pubmed/30204152 http://dx.doi.org/10.1038/sdata.2018.174 Text en Copyright © 2018, The Author(s) http://creativecommons.org/licenses/by/4.0/ Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article. |
spellingShingle | Data Descriptor Khan, Shahid Y. Kabir, Firoz M’Hamdi, Oussama Jiao, Xiaodong Naeem, Muhammad Asif Khan, Shaheen N. Riazuddin, Sheikh Hejtmancik, J. Fielding Riazuddin, S. Amer Whole genome sequencing data for two individuals of Pakistani descent |
title | Whole genome sequencing data for two individuals of Pakistani descent |
title_full | Whole genome sequencing data for two individuals of Pakistani descent |
title_fullStr | Whole genome sequencing data for two individuals of Pakistani descent |
title_full_unstemmed | Whole genome sequencing data for two individuals of Pakistani descent |
title_short | Whole genome sequencing data for two individuals of Pakistani descent |
title_sort | whole genome sequencing data for two individuals of pakistani descent |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6137601/ https://www.ncbi.nlm.nih.gov/pubmed/30204152 http://dx.doi.org/10.1038/sdata.2018.174 |
work_keys_str_mv | AT khanshahidy wholegenomesequencingdatafortwoindividualsofpakistanidescent AT kabirfiroz wholegenomesequencingdatafortwoindividualsofpakistanidescent AT mhamdioussama wholegenomesequencingdatafortwoindividualsofpakistanidescent AT jiaoxiaodong wholegenomesequencingdatafortwoindividualsofpakistanidescent AT naeemmuhammadasif wholegenomesequencingdatafortwoindividualsofpakistanidescent AT khanshaheenn wholegenomesequencingdatafortwoindividualsofpakistanidescent AT riazuddinsheikh wholegenomesequencingdatafortwoindividualsofpakistanidescent AT hejtmancikjfielding wholegenomesequencingdatafortwoindividualsofpakistanidescent AT riazuddinsamer wholegenomesequencingdatafortwoindividualsofpakistanidescent |