Cargando…
Whole genome sequencing data of multiple individuals of Pakistani descent
Here we report whole genome sequencing of four individuals (H3, H4, H5, and H6) from a family of Pakistani descent. Whole genome sequencing yielded 1084.92, 894.73, 1068.62, and 1005.77 million mapped reads corresponding to 162.73, 134.21, 160.29, and 150.86 Gb sequence data and 52.49x, 43.29x, 51.7...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7555865/ https://www.ncbi.nlm.nih.gov/pubmed/33051442 http://dx.doi.org/10.1038/s41597-020-00664-2 |
_version_ | 1783594106626244608 |
---|---|
author | Khan, Shahid Y. Ali, Muhammad Lee, Mei-Chong W. Ma, Zhiwei Biswas, Pooja Khan, Asma A. Naeem, Muhammad Asif Riazuddin, Saima Riazuddin, Sheikh Ayyagari, Radha Hejtmancik, J. Fielding Riazuddin, S. Amer |
author_facet | Khan, Shahid Y. Ali, Muhammad Lee, Mei-Chong W. Ma, Zhiwei Biswas, Pooja Khan, Asma A. Naeem, Muhammad Asif Riazuddin, Saima Riazuddin, Sheikh Ayyagari, Radha Hejtmancik, J. Fielding Riazuddin, S. Amer |
author_sort | Khan, Shahid Y. |
collection | PubMed |
description | Here we report whole genome sequencing of four individuals (H3, H4, H5, and H6) from a family of Pakistani descent. Whole genome sequencing yielded 1084.92, 894.73, 1068.62, and 1005.77 million mapped reads corresponding to 162.73, 134.21, 160.29, and 150.86 Gb sequence data and 52.49x, 43.29x, 51.70x, and 48.66x average coverage for H3, H4, H5, and H6, respectively. We identified 3,529,659, 3,478,495, 3,407,895, and 3,426,862 variants in the genomes of H3, H4, H5, and H6, respectively, including 1,668,024 variants common in the four genomes. Further, we identified 42,422, 39,824, 28,599, and 35,206 novel variants in the genomes of H3, H4, H5, and H6, respectively. A major fraction of the variants identified in the four genomes reside within the intergenic regions of the genome. Single nucleotide polymorphism (SNP) genotype based comparative analysis with ethnic populations of 1000 Genomes database linked the ancestry of all four genomes with the South Asian populations, which was further supported by mitochondria based haplogroup analysis. In conclusion, we report whole genome sequencing of four individuals of Pakistani descent. |
format | Online Article Text |
id | pubmed-7555865 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-75558652020-10-19 Whole genome sequencing data of multiple individuals of Pakistani descent Khan, Shahid Y. Ali, Muhammad Lee, Mei-Chong W. Ma, Zhiwei Biswas, Pooja Khan, Asma A. Naeem, Muhammad Asif Riazuddin, Saima Riazuddin, Sheikh Ayyagari, Radha Hejtmancik, J. Fielding Riazuddin, S. Amer Sci Data Data Descriptor Here we report whole genome sequencing of four individuals (H3, H4, H5, and H6) from a family of Pakistani descent. Whole genome sequencing yielded 1084.92, 894.73, 1068.62, and 1005.77 million mapped reads corresponding to 162.73, 134.21, 160.29, and 150.86 Gb sequence data and 52.49x, 43.29x, 51.70x, and 48.66x average coverage for H3, H4, H5, and H6, respectively. We identified 3,529,659, 3,478,495, 3,407,895, and 3,426,862 variants in the genomes of H3, H4, H5, and H6, respectively, including 1,668,024 variants common in the four genomes. Further, we identified 42,422, 39,824, 28,599, and 35,206 novel variants in the genomes of H3, H4, H5, and H6, respectively. A major fraction of the variants identified in the four genomes reside within the intergenic regions of the genome. Single nucleotide polymorphism (SNP) genotype based comparative analysis with ethnic populations of 1000 Genomes database linked the ancestry of all four genomes with the South Asian populations, which was further supported by mitochondria based haplogroup analysis. In conclusion, we report whole genome sequencing of four individuals of Pakistani descent. Nature Publishing Group UK 2020-10-13 /pmc/articles/PMC7555865/ /pubmed/33051442 http://dx.doi.org/10.1038/s41597-020-00664-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article. |
spellingShingle | Data Descriptor Khan, Shahid Y. Ali, Muhammad Lee, Mei-Chong W. Ma, Zhiwei Biswas, Pooja Khan, Asma A. Naeem, Muhammad Asif Riazuddin, Saima Riazuddin, Sheikh Ayyagari, Radha Hejtmancik, J. Fielding Riazuddin, S. Amer Whole genome sequencing data of multiple individuals of Pakistani descent |
title | Whole genome sequencing data of multiple individuals of Pakistani descent |
title_full | Whole genome sequencing data of multiple individuals of Pakistani descent |
title_fullStr | Whole genome sequencing data of multiple individuals of Pakistani descent |
title_full_unstemmed | Whole genome sequencing data of multiple individuals of Pakistani descent |
title_short | Whole genome sequencing data of multiple individuals of Pakistani descent |
title_sort | whole genome sequencing data of multiple individuals of pakistani descent |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7555865/ https://www.ncbi.nlm.nih.gov/pubmed/33051442 http://dx.doi.org/10.1038/s41597-020-00664-2 |
work_keys_str_mv | AT khanshahidy wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT alimuhammad wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT leemeichongw wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT mazhiwei wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT biswaspooja wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT khanasmaa wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT naeemmuhammadasif wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT riazuddinsaima wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT riazuddinsheikh wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT ayyagariradha wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT hejtmancikjfielding wholegenomesequencingdataofmultipleindividualsofpakistanidescent AT riazuddinsamer wholegenomesequencingdataofmultipleindividualsofpakistanidescent |