Cargando…

The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats

Unmapped next-generation sequencing reads are typically ignored while they contain biologically relevant information. We systematically analyzed unmapped reads from whole genome sequencing of 33 inbred rat strains. High quality reads were selected and enriched for biologically relevant sequences; si...

Descripción completa

Detalles Bibliográficos
Autores principales: van der Weide, Robin H., Simonis, Marieke, Hermsen, Roel, Toonen, Pim, Cuppen, Edwin, de Ligt, Joep
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4976967/
https://www.ncbi.nlm.nih.gov/pubmed/27501045
http://dx.doi.org/10.1371/journal.pone.0160036
_version_ 1782446946627092480
author van der Weide, Robin H.
Simonis, Marieke
Hermsen, Roel
Toonen, Pim
Cuppen, Edwin
de Ligt, Joep
author_facet van der Weide, Robin H.
Simonis, Marieke
Hermsen, Roel
Toonen, Pim
Cuppen, Edwin
de Ligt, Joep
author_sort van der Weide, Robin H.
collection PubMed
description Unmapped next-generation sequencing reads are typically ignored while they contain biologically relevant information. We systematically analyzed unmapped reads from whole genome sequencing of 33 inbred rat strains. High quality reads were selected and enriched for biologically relevant sequences; similarity-based analysis revealed clustering similar to previously reported phylogenetic trees. Our results demonstrate that on average 20% of all unmapped reads harbor sequences that can be used to improve reference genomes and generate hypotheses on potential genotype-phenotype relationships. Analysis pipelines would benefit from incorporating the described methods and reference genomes would benefit from inclusion of the genomic segments obtained through these efforts.
format Online
Article
Text
id pubmed-4976967
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-49769672016-08-25 The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats van der Weide, Robin H. Simonis, Marieke Hermsen, Roel Toonen, Pim Cuppen, Edwin de Ligt, Joep PLoS One Research Article Unmapped next-generation sequencing reads are typically ignored while they contain biologically relevant information. We systematically analyzed unmapped reads from whole genome sequencing of 33 inbred rat strains. High quality reads were selected and enriched for biologically relevant sequences; similarity-based analysis revealed clustering similar to previously reported phylogenetic trees. Our results demonstrate that on average 20% of all unmapped reads harbor sequences that can be used to improve reference genomes and generate hypotheses on potential genotype-phenotype relationships. Analysis pipelines would benefit from incorporating the described methods and reference genomes would benefit from inclusion of the genomic segments obtained through these efforts. Public Library of Science 2016-08-08 /pmc/articles/PMC4976967/ /pubmed/27501045 http://dx.doi.org/10.1371/journal.pone.0160036 Text en © 2016 van der Weide et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
van der Weide, Robin H.
Simonis, Marieke
Hermsen, Roel
Toonen, Pim
Cuppen, Edwin
de Ligt, Joep
The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats
title The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats
title_full The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats
title_fullStr The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats
title_full_unstemmed The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats
title_short The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats
title_sort genomic scrapheap challenge; extracting relevant data from unmapped whole genome sequencing reads, including strain specific genomic segments, in rats
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4976967/
https://www.ncbi.nlm.nih.gov/pubmed/27501045
http://dx.doi.org/10.1371/journal.pone.0160036
work_keys_str_mv AT vanderweiderobinh thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT simonismarieke thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT hermsenroel thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT toonenpim thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT cuppenedwin thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT deligtjoep thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT vanderweiderobinh genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT simonismarieke genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT hermsenroel genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT toonenpim genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT cuppenedwin genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats
AT deligtjoep genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats