Cargando…
The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats
Unmapped next-generation sequencing reads are typically ignored while they contain biologically relevant information. We systematically analyzed unmapped reads from whole genome sequencing of 33 inbred rat strains. High quality reads were selected and enriched for biologically relevant sequences; si...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4976967/ https://www.ncbi.nlm.nih.gov/pubmed/27501045 http://dx.doi.org/10.1371/journal.pone.0160036 |
_version_ | 1782446946627092480 |
---|---|
author | van der Weide, Robin H. Simonis, Marieke Hermsen, Roel Toonen, Pim Cuppen, Edwin de Ligt, Joep |
author_facet | van der Weide, Robin H. Simonis, Marieke Hermsen, Roel Toonen, Pim Cuppen, Edwin de Ligt, Joep |
author_sort | van der Weide, Robin H. |
collection | PubMed |
description | Unmapped next-generation sequencing reads are typically ignored while they contain biologically relevant information. We systematically analyzed unmapped reads from whole genome sequencing of 33 inbred rat strains. High quality reads were selected and enriched for biologically relevant sequences; similarity-based analysis revealed clustering similar to previously reported phylogenetic trees. Our results demonstrate that on average 20% of all unmapped reads harbor sequences that can be used to improve reference genomes and generate hypotheses on potential genotype-phenotype relationships. Analysis pipelines would benefit from incorporating the described methods and reference genomes would benefit from inclusion of the genomic segments obtained through these efforts. |
format | Online Article Text |
id | pubmed-4976967 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-49769672016-08-25 The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats van der Weide, Robin H. Simonis, Marieke Hermsen, Roel Toonen, Pim Cuppen, Edwin de Ligt, Joep PLoS One Research Article Unmapped next-generation sequencing reads are typically ignored while they contain biologically relevant information. We systematically analyzed unmapped reads from whole genome sequencing of 33 inbred rat strains. High quality reads were selected and enriched for biologically relevant sequences; similarity-based analysis revealed clustering similar to previously reported phylogenetic trees. Our results demonstrate that on average 20% of all unmapped reads harbor sequences that can be used to improve reference genomes and generate hypotheses on potential genotype-phenotype relationships. Analysis pipelines would benefit from incorporating the described methods and reference genomes would benefit from inclusion of the genomic segments obtained through these efforts. Public Library of Science 2016-08-08 /pmc/articles/PMC4976967/ /pubmed/27501045 http://dx.doi.org/10.1371/journal.pone.0160036 Text en © 2016 van der Weide et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article van der Weide, Robin H. Simonis, Marieke Hermsen, Roel Toonen, Pim Cuppen, Edwin de Ligt, Joep The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats |
title | The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats |
title_full | The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats |
title_fullStr | The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats |
title_full_unstemmed | The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats |
title_short | The Genomic Scrapheap Challenge; Extracting Relevant Data from Unmapped Whole Genome Sequencing Reads, Including Strain Specific Genomic Segments, in Rats |
title_sort | genomic scrapheap challenge; extracting relevant data from unmapped whole genome sequencing reads, including strain specific genomic segments, in rats |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4976967/ https://www.ncbi.nlm.nih.gov/pubmed/27501045 http://dx.doi.org/10.1371/journal.pone.0160036 |
work_keys_str_mv | AT vanderweiderobinh thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT simonismarieke thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT hermsenroel thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT toonenpim thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT cuppenedwin thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT deligtjoep thegenomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT vanderweiderobinh genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT simonismarieke genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT hermsenroel genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT toonenpim genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT cuppenedwin genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats AT deligtjoep genomicscrapheapchallengeextractingrelevantdatafromunmappedwholegenomesequencingreadsincludingstrainspecificgenomicsegmentsinrats |