Cargando…

Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes

The laboratory mouse is the most widely used mammalian model organism in biomedical research, so a thorough annotation of functional variation in the mouse genome would be of significant value. In this study, we compared sequence variation in a comprehensive list of functional elements (e.g. promote...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Cao, Baten, Abdul, Morahan, Grant
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3958616/
https://www.ncbi.nlm.nih.gov/pubmed/24647628
http://dx.doi.org/10.1093/database/bau020
_version_ 1782307906062909440
author Nguyen, Cao
Baten, Abdul
Morahan, Grant
author_facet Nguyen, Cao
Baten, Abdul
Morahan, Grant
author_sort Nguyen, Cao
collection PubMed
description The laboratory mouse is the most widely used mammalian model organism in biomedical research, so a thorough annotation of functional variation in the mouse genome would be of significant value. In this study, we compared sequence variation in a comprehensive list of functional elements (e.g. promoters, enhancers and CTCF binding sites) across 17 inbred mouse strains. Sequences were derived for ∼300 000 functional elements experimentally identified by the mouse ENCODE project as regulating gene expression in 19 different tissue sources. We aligned sequences for each predicted cis-regulatory element to genomes of 17 mouse strains. This yielded a database comprising ∼5 million aligned sequences, allowing interrogation of sequence variation of functional elements for each of the 19 tissues/cell types in commonly used mouse strains. We also developed an online tool to visualize the genome around each predicted cis-regulatory element in each tissue context and which allows efficient comparison of variation between any two sets of strains. This will be particularly useful in the context of the Collaborative Cross (CC), which was conceived as a powerful new systems genetics resource to accelerate gene discovery. Comprising a large number of inbred strains derived from eight genetically diverse founders, the CC offers rapid mapping and identification of genes that mediate complex traits. We show that, among the 17 sequenced strains, the set of CC founder strains captures the most variability in the ENCODE elements, further emphasizing the value of this resource. Database URL: www.sysgen.org/ecco
format Online
Article
Text
id pubmed-3958616
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-39586162014-03-20 Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes Nguyen, Cao Baten, Abdul Morahan, Grant Database (Oxford) Database Tool The laboratory mouse is the most widely used mammalian model organism in biomedical research, so a thorough annotation of functional variation in the mouse genome would be of significant value. In this study, we compared sequence variation in a comprehensive list of functional elements (e.g. promoters, enhancers and CTCF binding sites) across 17 inbred mouse strains. Sequences were derived for ∼300 000 functional elements experimentally identified by the mouse ENCODE project as regulating gene expression in 19 different tissue sources. We aligned sequences for each predicted cis-regulatory element to genomes of 17 mouse strains. This yielded a database comprising ∼5 million aligned sequences, allowing interrogation of sequence variation of functional elements for each of the 19 tissues/cell types in commonly used mouse strains. We also developed an online tool to visualize the genome around each predicted cis-regulatory element in each tissue context and which allows efficient comparison of variation between any two sets of strains. This will be particularly useful in the context of the Collaborative Cross (CC), which was conceived as a powerful new systems genetics resource to accelerate gene discovery. Comprising a large number of inbred strains derived from eight genetically diverse founders, the CC offers rapid mapping and identification of genes that mediate complex traits. We show that, among the 17 sequenced strains, the set of CC founder strains captures the most variability in the ENCODE elements, further emphasizing the value of this resource. Database URL: www.sysgen.org/ecco Oxford University Press 2014-03-18 /pmc/articles/PMC3958616/ /pubmed/24647628 http://dx.doi.org/10.1093/database/bau020 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Nguyen, Cao
Baten, Abdul
Morahan, Grant
Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes
title Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes
title_full Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes
title_fullStr Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes
title_full_unstemmed Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes
title_short Comparison of sequence variants in transcriptomic control regions across 17 mouse genomes
title_sort comparison of sequence variants in transcriptomic control regions across 17 mouse genomes
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3958616/
https://www.ncbi.nlm.nih.gov/pubmed/24647628
http://dx.doi.org/10.1093/database/bau020
work_keys_str_mv AT nguyencao comparisonofsequencevariantsintranscriptomiccontrolregionsacross17mousegenomes
AT batenabdul comparisonofsequencevariantsintranscriptomiccontrolregionsacross17mousegenomes
AT morahangrant comparisonofsequencevariantsintranscriptomiccontrolregionsacross17mousegenomes