Cargando…

Challenges, Solutions, and Quality Metrics of Personal Genome Assembly in Advancing Precision Medicine

Even though each of us shares more than 99% of the DNA sequences in our genome, there are millions of sequence codes or structure in small regions that differ between individuals, giving us different characteristics of appearance or responsiveness to medical treatments. Currently, genetic variants i...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Wenming, Wu, Leihong, Yavas, Gokhan, Simonyan, Vahan, Ning, Baitang, Hong, Huixiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4932478/
https://www.ncbi.nlm.nih.gov/pubmed/27110816
http://dx.doi.org/10.3390/pharmaceutics8020015
_version_ 1782441061760630784
author Xiao, Wenming
Wu, Leihong
Yavas, Gokhan
Simonyan, Vahan
Ning, Baitang
Hong, Huixiao
author_facet Xiao, Wenming
Wu, Leihong
Yavas, Gokhan
Simonyan, Vahan
Ning, Baitang
Hong, Huixiao
author_sort Xiao, Wenming
collection PubMed
description Even though each of us shares more than 99% of the DNA sequences in our genome, there are millions of sequence codes or structure in small regions that differ between individuals, giving us different characteristics of appearance or responsiveness to medical treatments. Currently, genetic variants in diseased tissues, such as tumors, are uncovered by exploring the differences between the reference genome and the sequences detected in the diseased tissue. However, the public reference genome was derived with the DNA from multiple individuals. As a result of this, the reference genome is incomplete and may misrepresent the sequence variants of the general population. The more reliable solution is to compare sequences of diseased tissue with its own genome sequence derived from tissue in a normal state. As the price to sequence the human genome has dropped dramatically to around $1000, it shows a promising future of documenting the personal genome for every individual. However, de novo assembly of individual genomes at an affordable cost is still challenging. Thus, till now, only a few human genomes have been fully assembled. In this review, we introduce the history of human genome sequencing and the evolution of sequencing platforms, from Sanger sequencing to emerging “third generation sequencing” technologies. We present the currently available de novo assembly and post-assembly software packages for human genome assembly and their requirements for computational infrastructures. We recommend that a combined hybrid assembly with long and short reads would be a promising way to generate good quality human genome assemblies and specify parameters for the quality assessment of assembly outcomes. We provide a perspective view of the benefit of using personal genomes as references and suggestions for obtaining a quality personal genome. Finally, we discuss the usage of the personal genome in aiding vaccine design and development, monitoring host immune-response, tailoring drug therapy and detecting tumors. We believe the precision medicine would largely benefit from bioinformatics solutions, particularly for personal genome assembly.
format Online
Article
Text
id pubmed-4932478
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-49324782016-07-13 Challenges, Solutions, and Quality Metrics of Personal Genome Assembly in Advancing Precision Medicine Xiao, Wenming Wu, Leihong Yavas, Gokhan Simonyan, Vahan Ning, Baitang Hong, Huixiao Pharmaceutics Review Even though each of us shares more than 99% of the DNA sequences in our genome, there are millions of sequence codes or structure in small regions that differ between individuals, giving us different characteristics of appearance or responsiveness to medical treatments. Currently, genetic variants in diseased tissues, such as tumors, are uncovered by exploring the differences between the reference genome and the sequences detected in the diseased tissue. However, the public reference genome was derived with the DNA from multiple individuals. As a result of this, the reference genome is incomplete and may misrepresent the sequence variants of the general population. The more reliable solution is to compare sequences of diseased tissue with its own genome sequence derived from tissue in a normal state. As the price to sequence the human genome has dropped dramatically to around $1000, it shows a promising future of documenting the personal genome for every individual. However, de novo assembly of individual genomes at an affordable cost is still challenging. Thus, till now, only a few human genomes have been fully assembled. In this review, we introduce the history of human genome sequencing and the evolution of sequencing platforms, from Sanger sequencing to emerging “third generation sequencing” technologies. We present the currently available de novo assembly and post-assembly software packages for human genome assembly and their requirements for computational infrastructures. We recommend that a combined hybrid assembly with long and short reads would be a promising way to generate good quality human genome assemblies and specify parameters for the quality assessment of assembly outcomes. We provide a perspective view of the benefit of using personal genomes as references and suggestions for obtaining a quality personal genome. Finally, we discuss the usage of the personal genome in aiding vaccine design and development, monitoring host immune-response, tailoring drug therapy and detecting tumors. We believe the precision medicine would largely benefit from bioinformatics solutions, particularly for personal genome assembly. MDPI 2016-04-22 /pmc/articles/PMC4932478/ /pubmed/27110816 http://dx.doi.org/10.3390/pharmaceutics8020015 Text en © 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Xiao, Wenming
Wu, Leihong
Yavas, Gokhan
Simonyan, Vahan
Ning, Baitang
Hong, Huixiao
Challenges, Solutions, and Quality Metrics of Personal Genome Assembly in Advancing Precision Medicine
title Challenges, Solutions, and Quality Metrics of Personal Genome Assembly in Advancing Precision Medicine
title_full Challenges, Solutions, and Quality Metrics of Personal Genome Assembly in Advancing Precision Medicine
title_fullStr Challenges, Solutions, and Quality Metrics of Personal Genome Assembly in Advancing Precision Medicine
title_full_unstemmed Challenges, Solutions, and Quality Metrics of Personal Genome Assembly in Advancing Precision Medicine
title_short Challenges, Solutions, and Quality Metrics of Personal Genome Assembly in Advancing Precision Medicine
title_sort challenges, solutions, and quality metrics of personal genome assembly in advancing precision medicine
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4932478/
https://www.ncbi.nlm.nih.gov/pubmed/27110816
http://dx.doi.org/10.3390/pharmaceutics8020015
work_keys_str_mv AT xiaowenming challengessolutionsandqualitymetricsofpersonalgenomeassemblyinadvancingprecisionmedicine
AT wuleihong challengessolutionsandqualitymetricsofpersonalgenomeassemblyinadvancingprecisionmedicine
AT yavasgokhan challengessolutionsandqualitymetricsofpersonalgenomeassemblyinadvancingprecisionmedicine
AT simonyanvahan challengessolutionsandqualitymetricsofpersonalgenomeassemblyinadvancingprecisionmedicine
AT ningbaitang challengessolutionsandqualitymetricsofpersonalgenomeassemblyinadvancingprecisionmedicine
AT honghuixiao challengessolutionsandqualitymetricsofpersonalgenomeassemblyinadvancingprecisionmedicine