Cargando…

PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms

Read mapping is a fundamental part of next-generation genomic research but is complicated by genome duplication in many plants. Categorizing DNA sequence reads into their respective genomes enables current methods to analyze polyploid genomes as if they were diploid. We present PolyCat—a pipeline fo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Page, Justin T., Gingle, Alan R., Udall, Joshua A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Genetics Society of America 2013
Materias:	Investigations
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583458/ https://www.ncbi.nlm.nih.gov/pubmed/23450226 http://dx.doi.org/10.1534/g3.112.005298

_version_	1782475427783114752
author	Page, Justin T. Gingle, Alan R. Udall, Joshua A.
author_facet	Page, Justin T. Gingle, Alan R. Udall, Joshua A.
author_sort	Page, Justin T.
collection	PubMed
description	Read mapping is a fundamental part of next-generation genomic research but is complicated by genome duplication in many plants. Categorizing DNA sequence reads into their respective genomes enables current methods to analyze polyploid genomes as if they were diploid. We present PolyCat—a pipeline for mapping and categorizing all types of next-generation sequence data produced from allopolyploid organisms. PolyCat uses GSNAP’s single-nucleotide polymorphism (SNP)-tolerant mapping to minimize the mapping efficiency bias caused by SNPs between genomes. PolyCat then uses SNPs between genomes to categorize reads according to their respective genomes. Bisulfite-treated reads have a significant reduction in nucleotide complexity because nucleotide conversion events are confounded with transition substitutions. PolyCat includes special provisions to properly handle bisulfite-treated data. We demonstrate the functionality of PolyCat on allotetraploid cotton, Gossypium hirsutum, and create a functional SNP index for efficiently mapping sequence reads to the D-genome sequence of G. raimondii. PolyCat is appropriate for all allopolyploids and all types of next-generation genome analysis, including differential expression (RNA sequencing), differential methylation (bisulfite sequencing), differential DNA-protein binding (chromatin immunoprecipitation sequencing), and population diversity.
format	Online Article Text
id	pubmed-3583458
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Genetics Society of America
record_format	MEDLINE/PubMed
spelling	pubmed-35834582013-03-01 PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms Page, Justin T. Gingle, Alan R. Udall, Joshua A. G3 (Bethesda) Investigations Read mapping is a fundamental part of next-generation genomic research but is complicated by genome duplication in many plants. Categorizing DNA sequence reads into their respective genomes enables current methods to analyze polyploid genomes as if they were diploid. We present PolyCat—a pipeline for mapping and categorizing all types of next-generation sequence data produced from allopolyploid organisms. PolyCat uses GSNAP’s single-nucleotide polymorphism (SNP)-tolerant mapping to minimize the mapping efficiency bias caused by SNPs between genomes. PolyCat then uses SNPs between genomes to categorize reads according to their respective genomes. Bisulfite-treated reads have a significant reduction in nucleotide complexity because nucleotide conversion events are confounded with transition substitutions. PolyCat includes special provisions to properly handle bisulfite-treated data. We demonstrate the functionality of PolyCat on allotetraploid cotton, Gossypium hirsutum, and create a functional SNP index for efficiently mapping sequence reads to the D-genome sequence of G. raimondii. PolyCat is appropriate for all allopolyploids and all types of next-generation genome analysis, including differential expression (RNA sequencing), differential methylation (bisulfite sequencing), differential DNA-protein binding (chromatin immunoprecipitation sequencing), and population diversity. Genetics Society of America 2013-03-01 /pmc/articles/PMC3583458/ /pubmed/23450226 http://dx.doi.org/10.1534/g3.112.005298 Text en Copyright © 2013 Page et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution Unported License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Investigations Page, Justin T. Gingle, Alan R. Udall, Joshua A. PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms
title	PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms
title_full	PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms
title_fullStr	PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms
title_full_unstemmed	PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms
title_short	PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms
title_sort	polycat: a resource for genome categorization of sequencing reads from allopolyploid organisms
topic	Investigations
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583458/ https://www.ncbi.nlm.nih.gov/pubmed/23450226 http://dx.doi.org/10.1534/g3.112.005298
work_keys_str_mv	AT pagejustint polycataresourceforgenomecategorizationofsequencingreadsfromallopolyploidorganisms AT ginglealanr polycataresourceforgenomecategorizationofsequencingreadsfromallopolyploidorganisms AT udalljoshuaa polycataresourceforgenomecategorizationofsequencingreadsfromallopolyploidorganisms

PolyCat: A Resource for Genome Categorization of Sequencing Reads From Allopolyploid Organisms

Ejemplares similares