Cargando…
Genomic landscape of rat strain and substrain variation
BACKGROUND: Since the completion of the rat reference genome in 2003, whole-genome sequencing data from more than 40 rat strains have become available. These data represent the broad range of strains that are used in rat research including commonly used substrains. Currently, this wealth of informat...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4422378/ https://www.ncbi.nlm.nih.gov/pubmed/25943489 http://dx.doi.org/10.1186/s12864-015-1594-1 |
_version_ | 1782370043699396608 |
---|---|
author | Hermsen, Roel de Ligt, Joep Spee, Wim Blokzijl, Francis Schäfer, Sebastian Adami, Eleonora Boymans, Sander Flink, Stephen van Boxtel, Ruben van der Weide, Robin H Aitman, Tim Hübner, Norbert Simonis, Marieke Tabakoff, Boris Guryev, Victor Cuppen, Edwin |
author_facet | Hermsen, Roel de Ligt, Joep Spee, Wim Blokzijl, Francis Schäfer, Sebastian Adami, Eleonora Boymans, Sander Flink, Stephen van Boxtel, Ruben van der Weide, Robin H Aitman, Tim Hübner, Norbert Simonis, Marieke Tabakoff, Boris Guryev, Victor Cuppen, Edwin |
author_sort | Hermsen, Roel |
collection | PubMed |
description | BACKGROUND: Since the completion of the rat reference genome in 2003, whole-genome sequencing data from more than 40 rat strains have become available. These data represent the broad range of strains that are used in rat research including commonly used substrains. Currently, this wealth of information cannot be used to its full extent, because the variety of different variant calling algorithms employed by different groups impairs comparison between strains. In addition, all rat whole genome sequencing studies to date used an outdated reference genome for analysis (RGSC3.4 released in 2004). RESULTS: Here we present a comprehensive, multi-sample and uniformly called set of genetic variants in 40 rat strains, including 19 substrains. We reanalyzed all primary data using a recent version of the rat reference assembly (RGSC5.0 released in 2012) and identified over 12 million genomic variants (SNVs, indels and structural variants) among the 40 strains. 28,318 SNVs are specific to individual substrains, which may be explained by introgression from other unsequenced strains and ongoing evolution by genetic drift. Substrain SNVs may have a larger predicted functional impact compared to older shared SNVs. CONCLUSIONS: In summary we present a comprehensive catalog of uniformly analyzed genetic variants among 40 widely used rat inbred strains based on the RGSC5.0 assembly. This represents a valuable resource, which will facilitate rat functional genomic research. In line with previous observations, our genome-wide analyses do not show evidence for contribution of multiple ancestral founder rat subspecies to the currently used rat inbred strains, as is the case for mouse. In addition, we find that the degree of substrain variation is highly variable between strains, which is of importance for the correct interpretation of experimental data from different labs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1594-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4422378 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44223782015-05-07 Genomic landscape of rat strain and substrain variation Hermsen, Roel de Ligt, Joep Spee, Wim Blokzijl, Francis Schäfer, Sebastian Adami, Eleonora Boymans, Sander Flink, Stephen van Boxtel, Ruben van der Weide, Robin H Aitman, Tim Hübner, Norbert Simonis, Marieke Tabakoff, Boris Guryev, Victor Cuppen, Edwin BMC Genomics Research Article BACKGROUND: Since the completion of the rat reference genome in 2003, whole-genome sequencing data from more than 40 rat strains have become available. These data represent the broad range of strains that are used in rat research including commonly used substrains. Currently, this wealth of information cannot be used to its full extent, because the variety of different variant calling algorithms employed by different groups impairs comparison between strains. In addition, all rat whole genome sequencing studies to date used an outdated reference genome for analysis (RGSC3.4 released in 2004). RESULTS: Here we present a comprehensive, multi-sample and uniformly called set of genetic variants in 40 rat strains, including 19 substrains. We reanalyzed all primary data using a recent version of the rat reference assembly (RGSC5.0 released in 2012) and identified over 12 million genomic variants (SNVs, indels and structural variants) among the 40 strains. 28,318 SNVs are specific to individual substrains, which may be explained by introgression from other unsequenced strains and ongoing evolution by genetic drift. Substrain SNVs may have a larger predicted functional impact compared to older shared SNVs. CONCLUSIONS: In summary we present a comprehensive catalog of uniformly analyzed genetic variants among 40 widely used rat inbred strains based on the RGSC5.0 assembly. This represents a valuable resource, which will facilitate rat functional genomic research. In line with previous observations, our genome-wide analyses do not show evidence for contribution of multiple ancestral founder rat subspecies to the currently used rat inbred strains, as is the case for mouse. In addition, we find that the degree of substrain variation is highly variable between strains, which is of importance for the correct interpretation of experimental data from different labs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1594-1) contains supplementary material, which is available to authorized users. BioMed Central 2015-05-06 /pmc/articles/PMC4422378/ /pubmed/25943489 http://dx.doi.org/10.1186/s12864-015-1594-1 Text en © Hermsen et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Hermsen, Roel de Ligt, Joep Spee, Wim Blokzijl, Francis Schäfer, Sebastian Adami, Eleonora Boymans, Sander Flink, Stephen van Boxtel, Ruben van der Weide, Robin H Aitman, Tim Hübner, Norbert Simonis, Marieke Tabakoff, Boris Guryev, Victor Cuppen, Edwin Genomic landscape of rat strain and substrain variation |
title | Genomic landscape of rat strain and substrain variation |
title_full | Genomic landscape of rat strain and substrain variation |
title_fullStr | Genomic landscape of rat strain and substrain variation |
title_full_unstemmed | Genomic landscape of rat strain and substrain variation |
title_short | Genomic landscape of rat strain and substrain variation |
title_sort | genomic landscape of rat strain and substrain variation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4422378/ https://www.ncbi.nlm.nih.gov/pubmed/25943489 http://dx.doi.org/10.1186/s12864-015-1594-1 |
work_keys_str_mv | AT hermsenroel genomiclandscapeofratstrainandsubstrainvariation AT deligtjoep genomiclandscapeofratstrainandsubstrainvariation AT speewim genomiclandscapeofratstrainandsubstrainvariation AT blokzijlfrancis genomiclandscapeofratstrainandsubstrainvariation AT schafersebastian genomiclandscapeofratstrainandsubstrainvariation AT adamieleonora genomiclandscapeofratstrainandsubstrainvariation AT boymanssander genomiclandscapeofratstrainandsubstrainvariation AT flinkstephen genomiclandscapeofratstrainandsubstrainvariation AT vanboxtelruben genomiclandscapeofratstrainandsubstrainvariation AT vanderweiderobinh genomiclandscapeofratstrainandsubstrainvariation AT aitmantim genomiclandscapeofratstrainandsubstrainvariation AT hubnernorbert genomiclandscapeofratstrainandsubstrainvariation AT simonismarieke genomiclandscapeofratstrainandsubstrainvariation AT tabakoffboris genomiclandscapeofratstrainandsubstrainvariation AT guryevvictor genomiclandscapeofratstrainandsubstrainvariation AT cuppenedwin genomiclandscapeofratstrainandsubstrainvariation |