Cargando…
Genome-wide analysis of mobile genetic element insertion sites
Mobile genetic elements (MGEs) account for a significant fraction of eukaryotic genomes and are implicated in altered gene expression and disease. We present an efficient computational protocol for MGE insertion site analysis. ELAN, the suite of tools described here uses standard techniques to ident...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3167599/ https://www.ncbi.nlm.nih.gov/pubmed/21609951 http://dx.doi.org/10.1093/nar/gkr337 |
_version_ | 1782211260655337472 |
---|---|
author | Rawal, Kamal Ramaswamy, Ram |
author_facet | Rawal, Kamal Ramaswamy, Ram |
author_sort | Rawal, Kamal |
collection | PubMed |
description | Mobile genetic elements (MGEs) account for a significant fraction of eukaryotic genomes and are implicated in altered gene expression and disease. We present an efficient computational protocol for MGE insertion site analysis. ELAN, the suite of tools described here uses standard techniques to identify different MGEs and their distribution on the genome. One component, DNASCANNER analyses known insertion sites of MGEs for the presence of signals that are based on a combination of local physical and chemical properties. ISF (insertion site finder) is a machine-learning tool that incorporates information derived from DNASCANNER. ISF permits classification of a given DNA sequence as a potential insertion site or not, using a support vector machine. We have studied the genomes of Homo sapiens, Mus musculus, Drosophila melanogaster and Entamoeba histolytica via a protocol whereby DNASCANNER is used to identify a common set of statistically important signals flanking the insertion sites in the various genomes. These are used in ISF for insertion site prediction, and the current accuracy of the tool is over 65%. We find similar signals at gene boundaries and splice sites. Together, these data are suggestive of a common insertion mechanism that operates in a variety of eukaryotes. |
format | Online Article Text |
id | pubmed-3167599 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-31675992011-09-06 Genome-wide analysis of mobile genetic element insertion sites Rawal, Kamal Ramaswamy, Ram Nucleic Acids Res Computational Biology Mobile genetic elements (MGEs) account for a significant fraction of eukaryotic genomes and are implicated in altered gene expression and disease. We present an efficient computational protocol for MGE insertion site analysis. ELAN, the suite of tools described here uses standard techniques to identify different MGEs and their distribution on the genome. One component, DNASCANNER analyses known insertion sites of MGEs for the presence of signals that are based on a combination of local physical and chemical properties. ISF (insertion site finder) is a machine-learning tool that incorporates information derived from DNASCANNER. ISF permits classification of a given DNA sequence as a potential insertion site or not, using a support vector machine. We have studied the genomes of Homo sapiens, Mus musculus, Drosophila melanogaster and Entamoeba histolytica via a protocol whereby DNASCANNER is used to identify a common set of statistically important signals flanking the insertion sites in the various genomes. These are used in ISF for insertion site prediction, and the current accuracy of the tool is over 65%. We find similar signals at gene boundaries and splice sites. Together, these data are suggestive of a common insertion mechanism that operates in a variety of eukaryotes. Oxford University Press 2011-09 2011-05-23 /pmc/articles/PMC3167599/ /pubmed/21609951 http://dx.doi.org/10.1093/nar/gkr337 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Rawal, Kamal Ramaswamy, Ram Genome-wide analysis of mobile genetic element insertion sites |
title | Genome-wide analysis of mobile genetic element insertion sites |
title_full | Genome-wide analysis of mobile genetic element insertion sites |
title_fullStr | Genome-wide analysis of mobile genetic element insertion sites |
title_full_unstemmed | Genome-wide analysis of mobile genetic element insertion sites |
title_short | Genome-wide analysis of mobile genetic element insertion sites |
title_sort | genome-wide analysis of mobile genetic element insertion sites |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3167599/ https://www.ncbi.nlm.nih.gov/pubmed/21609951 http://dx.doi.org/10.1093/nar/gkr337 |
work_keys_str_mv | AT rawalkamal genomewideanalysisofmobilegeneticelementinsertionsites AT ramaswamyram genomewideanalysisofmobilegeneticelementinsertionsites |