Cargando…

Satellog: A database for the identification and prioritization of satellite repeats in disease association studies

BACKGROUND: To date, 35 human diseases, some of which also exhibit anticipation, have been associated with unstable repeats. Anticipation has been reported in a number of diseases in which repeat expansion may have a role in etiology. Despite the growing importance of unstable repeats in disease, cu...

Descripción completa

Detalles Bibliográficos
Autores principales: Missirlis, Perseus I, Mead, Carri-Lyn R, Butland, Stefanie L, Ouellette, BF Francis, Devon, Rebecca S, Leavitt, Blair R, Holt, Robert A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1181805/
https://www.ncbi.nlm.nih.gov/pubmed/15949044
http://dx.doi.org/10.1186/1471-2105-6-145
_version_ 1782124631157637120
author Missirlis, Perseus I
Mead, Carri-Lyn R
Butland, Stefanie L
Ouellette, BF Francis
Devon, Rebecca S
Leavitt, Blair R
Holt, Robert A
author_facet Missirlis, Perseus I
Mead, Carri-Lyn R
Butland, Stefanie L
Ouellette, BF Francis
Devon, Rebecca S
Leavitt, Blair R
Holt, Robert A
author_sort Missirlis, Perseus I
collection PubMed
description BACKGROUND: To date, 35 human diseases, some of which also exhibit anticipation, have been associated with unstable repeats. Anticipation has been reported in a number of diseases in which repeat expansion may have a role in etiology. Despite the growing importance of unstable repeats in disease, currently no resource exists for the prioritization of repeats. Here we present Satellog, a database that catalogs all pure 1–16 repeat unit satellite repeats in the human genome along with supplementary data. Satellog analyzes each pure repeat in UniGene clusters for evidence of repeat polymorphism. RESULTS: A total of 5,546 such repeats were identified, providing the first indication of many novel polymorphic sites in the genome. Overall, polymorphic repeats were over-represented within 3'-UTR sequence relative to 5'-UTR and coding sequence. Interestingly, we observed that repeat polymorphism within coding sequence is restricted to trinucleotide repeats whereas UTR sequence tolerated a wider range of repeat period polymorphisms. For each pure repeat we also calculate its repeat length percentile rank, its location either within or adjacent to EnsEMBL genes, and its expression profile in normal tissues according to the GeneNote database. CONCLUSION: Satellog provides the ability to dynamically prioritize repeats based on any of their characteristics (i.e. repeat unit, class, period, length, repeat length percentile rank, genomic co-ordinates), polymorphism profile within UniGene, proximity to or presence within gene regions (i.e. cds, UTR, 15 kb upstream etc.), metadata of the genes they are detected within and gene expression profiles within normal human tissues. Unstable repeats associated with 31 diseases were analyzed in Satellog to evaluate their common repeat properties. The utility of Satellog was highlighted by prioritizing repeats for Huntington's disease and schizophrenia. Satellog is available online at .
format Text
id pubmed-1181805
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11818052005-07-30 Satellog: A database for the identification and prioritization of satellite repeats in disease association studies Missirlis, Perseus I Mead, Carri-Lyn R Butland, Stefanie L Ouellette, BF Francis Devon, Rebecca S Leavitt, Blair R Holt, Robert A BMC Bioinformatics Database BACKGROUND: To date, 35 human diseases, some of which also exhibit anticipation, have been associated with unstable repeats. Anticipation has been reported in a number of diseases in which repeat expansion may have a role in etiology. Despite the growing importance of unstable repeats in disease, currently no resource exists for the prioritization of repeats. Here we present Satellog, a database that catalogs all pure 1–16 repeat unit satellite repeats in the human genome along with supplementary data. Satellog analyzes each pure repeat in UniGene clusters for evidence of repeat polymorphism. RESULTS: A total of 5,546 such repeats were identified, providing the first indication of many novel polymorphic sites in the genome. Overall, polymorphic repeats were over-represented within 3'-UTR sequence relative to 5'-UTR and coding sequence. Interestingly, we observed that repeat polymorphism within coding sequence is restricted to trinucleotide repeats whereas UTR sequence tolerated a wider range of repeat period polymorphisms. For each pure repeat we also calculate its repeat length percentile rank, its location either within or adjacent to EnsEMBL genes, and its expression profile in normal tissues according to the GeneNote database. CONCLUSION: Satellog provides the ability to dynamically prioritize repeats based on any of their characteristics (i.e. repeat unit, class, period, length, repeat length percentile rank, genomic co-ordinates), polymorphism profile within UniGene, proximity to or presence within gene regions (i.e. cds, UTR, 15 kb upstream etc.), metadata of the genes they are detected within and gene expression profiles within normal human tissues. Unstable repeats associated with 31 diseases were analyzed in Satellog to evaluate their common repeat properties. The utility of Satellog was highlighted by prioritizing repeats for Huntington's disease and schizophrenia. Satellog is available online at . BioMed Central 2005-06-10 /pmc/articles/PMC1181805/ /pubmed/15949044 http://dx.doi.org/10.1186/1471-2105-6-145 Text en Copyright © 2005 Missirlis et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Missirlis, Perseus I
Mead, Carri-Lyn R
Butland, Stefanie L
Ouellette, BF Francis
Devon, Rebecca S
Leavitt, Blair R
Holt, Robert A
Satellog: A database for the identification and prioritization of satellite repeats in disease association studies
title Satellog: A database for the identification and prioritization of satellite repeats in disease association studies
title_full Satellog: A database for the identification and prioritization of satellite repeats in disease association studies
title_fullStr Satellog: A database for the identification and prioritization of satellite repeats in disease association studies
title_full_unstemmed Satellog: A database for the identification and prioritization of satellite repeats in disease association studies
title_short Satellog: A database for the identification and prioritization of satellite repeats in disease association studies
title_sort satellog: a database for the identification and prioritization of satellite repeats in disease association studies
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1181805/
https://www.ncbi.nlm.nih.gov/pubmed/15949044
http://dx.doi.org/10.1186/1471-2105-6-145
work_keys_str_mv AT missirlisperseusi satellogadatabasefortheidentificationandprioritizationofsatelliterepeatsindiseaseassociationstudies
AT meadcarrilynr satellogadatabasefortheidentificationandprioritizationofsatelliterepeatsindiseaseassociationstudies
AT butlandstefaniel satellogadatabasefortheidentificationandprioritizationofsatelliterepeatsindiseaseassociationstudies
AT ouellettebffrancis satellogadatabasefortheidentificationandprioritizationofsatelliterepeatsindiseaseassociationstudies
AT devonrebeccas satellogadatabasefortheidentificationandprioritizationofsatelliterepeatsindiseaseassociationstudies
AT leavittblairr satellogadatabasefortheidentificationandprioritizationofsatelliterepeatsindiseaseassociationstudies
AT holtroberta satellogadatabasefortheidentificationandprioritizationofsatelliterepeatsindiseaseassociationstudies