Cargando…
Computational identification of harmful mutation regions to the activity of transposable elements
BACKGROUND: Transposable elements (TEs) are interspersed DNA sequences that can move or copy to new positions within a genome. TEs are believed to promote speciation and their activities play a significant role in human disease. In the human genome, the 22 AluY and 6 AluS TE subfamilies have been th...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773891/ https://www.ncbi.nlm.nih.gov/pubmed/29219079 http://dx.doi.org/10.1186/s12864-017-4227-z |
_version_ | 1783293658677641216 |
---|---|
author | Jin, Lingling McQuillan, Ian Li, Longhai |
author_facet | Jin, Lingling McQuillan, Ian Li, Longhai |
author_sort | Jin, Lingling |
collection | PubMed |
description | BACKGROUND: Transposable elements (TEs) are interspersed DNA sequences that can move or copy to new positions within a genome. TEs are believed to promote speciation and their activities play a significant role in human disease. In the human genome, the 22 AluY and 6 AluS TE subfamilies have been the most recently active, and their transposition has been implicated in many inherited human diseases and in various forms of cancer. Therefore, understanding their transposition activity is very important and identifying the factors that affect their transpositional activity is of great interest. Recently, there has been some work done to quantify the activity levels of active Alu TEs based on variation in the sequence. Given this activity data, an analysis of TE activity based on the position of mutations is conducted. RESULTS: A method/simulation is created to computationally predict so-called harmful mutation regions in the consensus sequence of a TE; that is, mutations that occur in these regions decrease the transpositional activity dramatically. The methods are applied to the most active subfamily, AluY, to identify the harmful regions, and seven harmful regions are identified within the AluY consensus with q-values less than 0.05. A supplementary simulation also shows that the identified harmful regions covering the AluYa5 RNA functional regions are not occurring by chance. This method is then applied to two additional TE families: the Alu family and the L1 family, to computationally detect the harmful regions in these elements. CONCLUSIONS: We use a computational method to identify a set of harmful mutation regions. Mutations within the identified harmful regions decrease the transpositional activity of active elements. The correlation between the mutations within these regions and the transpositional activity of TEs are shown to be statistically significant. Verifications are presented using the activity of AluY elements and the secondary structure of the AluYa5 RNA, providing evidence that the method is successfully identifying harmful mutation regions. |
format | Online Article Text |
id | pubmed-5773891 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57738912018-01-26 Computational identification of harmful mutation regions to the activity of transposable elements Jin, Lingling McQuillan, Ian Li, Longhai BMC Genomics Research BACKGROUND: Transposable elements (TEs) are interspersed DNA sequences that can move or copy to new positions within a genome. TEs are believed to promote speciation and their activities play a significant role in human disease. In the human genome, the 22 AluY and 6 AluS TE subfamilies have been the most recently active, and their transposition has been implicated in many inherited human diseases and in various forms of cancer. Therefore, understanding their transposition activity is very important and identifying the factors that affect their transpositional activity is of great interest. Recently, there has been some work done to quantify the activity levels of active Alu TEs based on variation in the sequence. Given this activity data, an analysis of TE activity based on the position of mutations is conducted. RESULTS: A method/simulation is created to computationally predict so-called harmful mutation regions in the consensus sequence of a TE; that is, mutations that occur in these regions decrease the transpositional activity dramatically. The methods are applied to the most active subfamily, AluY, to identify the harmful regions, and seven harmful regions are identified within the AluY consensus with q-values less than 0.05. A supplementary simulation also shows that the identified harmful regions covering the AluYa5 RNA functional regions are not occurring by chance. This method is then applied to two additional TE families: the Alu family and the L1 family, to computationally detect the harmful regions in these elements. CONCLUSIONS: We use a computational method to identify a set of harmful mutation regions. Mutations within the identified harmful regions decrease the transpositional activity of active elements. The correlation between the mutations within these regions and the transpositional activity of TEs are shown to be statistically significant. Verifications are presented using the activity of AluY elements and the secondary structure of the AluYa5 RNA, providing evidence that the method is successfully identifying harmful mutation regions. BioMed Central 2017-11-17 /pmc/articles/PMC5773891/ /pubmed/29219079 http://dx.doi.org/10.1186/s12864-017-4227-z Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Jin, Lingling McQuillan, Ian Li, Longhai Computational identification of harmful mutation regions to the activity of transposable elements |
title | Computational identification of harmful mutation regions to the activity of transposable elements |
title_full | Computational identification of harmful mutation regions to the activity of transposable elements |
title_fullStr | Computational identification of harmful mutation regions to the activity of transposable elements |
title_full_unstemmed | Computational identification of harmful mutation regions to the activity of transposable elements |
title_short | Computational identification of harmful mutation regions to the activity of transposable elements |
title_sort | computational identification of harmful mutation regions to the activity of transposable elements |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773891/ https://www.ncbi.nlm.nih.gov/pubmed/29219079 http://dx.doi.org/10.1186/s12864-017-4227-z |
work_keys_str_mv | AT jinlingling computationalidentificationofharmfulmutationregionstotheactivityoftransposableelements AT mcquillanian computationalidentificationofharmfulmutationregionstotheactivityoftransposableelements AT lilonghai computationalidentificationofharmfulmutationregionstotheactivityoftransposableelements |