Cargando…

Computational pan-genomics: status, promises and challenges

Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply...

Descripción completa

Detalles Bibliográficos
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2016
Materias:	Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862344/ https://www.ncbi.nlm.nih.gov/pubmed/27769991 http://dx.doi.org/10.1093/bib/bbw089

_version_	1783308210245992448
collection	PubMed
description	Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed. We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pan-genome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. We outline how this and other challenges from different application domains translate into common computational problems, point out relevant bioinformatics techniques and identify open problems in computer science. With this review, we aim to increase awareness that a joint approach to computational pan-genomics can help address many of the problems currently faced in various domains.
format	Online Article Text
id	pubmed-5862344
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-58623442018-07-10 Computational pan-genomics: status, promises and challenges Brief Bioinform Papers Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed. We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pan-genome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. We outline how this and other challenges from different application domains translate into common computational problems, point out relevant bioinformatics techniques and identify open problems in computer science. With this review, we aim to increase awareness that a joint approach to computational pan-genomics can help address many of the problems currently faced in various domains. Oxford University Press 2016-10-21 /pmc/articles/PMC5862344/ /pubmed/27769991 http://dx.doi.org/10.1093/bib/bbw089 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Papers Computational pan-genomics: status, promises and challenges
title	Computational pan-genomics: status, promises and challenges
title_full	Computational pan-genomics: status, promises and challenges
title_fullStr	Computational pan-genomics: status, promises and challenges
title_full_unstemmed	Computational pan-genomics: status, promises and challenges
title_short	Computational pan-genomics: status, promises and challenges
title_sort	computational pan-genomics: status, promises and challenges
topic	Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5862344/ https://www.ncbi.nlm.nih.gov/pubmed/27769991 http://dx.doi.org/10.1093/bib/bbw089
work_keys_str_mv	AT computationalpangenomicsstatuspromisesandchallenges

Computational pan-genomics: status, promises and challenges

Ejemplares similares