Cargando…

Computational identification of harmful mutation regions to the activity of transposable elements

BACKGROUND: Transposable elements (TEs) are interspersed DNA sequences that can move or copy to new positions within a genome. TEs are believed to promote speciation and their activities play a significant role in human disease. In the human genome, the 22 AluY and 6 AluS TE subfamilies have been th...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Lingling, McQuillan, Ian, Li, Longhai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773891/
https://www.ncbi.nlm.nih.gov/pubmed/29219079
http://dx.doi.org/10.1186/s12864-017-4227-z
_version_ 1783293658677641216
author Jin, Lingling
McQuillan, Ian
Li, Longhai
author_facet Jin, Lingling
McQuillan, Ian
Li, Longhai
author_sort Jin, Lingling
collection PubMed
description BACKGROUND: Transposable elements (TEs) are interspersed DNA sequences that can move or copy to new positions within a genome. TEs are believed to promote speciation and their activities play a significant role in human disease. In the human genome, the 22 AluY and 6 AluS TE subfamilies have been the most recently active, and their transposition has been implicated in many inherited human diseases and in various forms of cancer. Therefore, understanding their transposition activity is very important and identifying the factors that affect their transpositional activity is of great interest. Recently, there has been some work done to quantify the activity levels of active Alu TEs based on variation in the sequence. Given this activity data, an analysis of TE activity based on the position of mutations is conducted. RESULTS: A method/simulation is created to computationally predict so-called harmful mutation regions in the consensus sequence of a TE; that is, mutations that occur in these regions decrease the transpositional activity dramatically. The methods are applied to the most active subfamily, AluY, to identify the harmful regions, and seven harmful regions are identified within the AluY consensus with q-values less than 0.05. A supplementary simulation also shows that the identified harmful regions covering the AluYa5 RNA functional regions are not occurring by chance. This method is then applied to two additional TE families: the Alu family and the L1 family, to computationally detect the harmful regions in these elements. CONCLUSIONS: We use a computational method to identify a set of harmful mutation regions. Mutations within the identified harmful regions decrease the transpositional activity of active elements. The correlation between the mutations within these regions and the transpositional activity of TEs are shown to be statistically significant. Verifications are presented using the activity of AluY elements and the secondary structure of the AluYa5 RNA, providing evidence that the method is successfully identifying harmful mutation regions.
format Online
Article
Text
id pubmed-5773891
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57738912018-01-26 Computational identification of harmful mutation regions to the activity of transposable elements Jin, Lingling McQuillan, Ian Li, Longhai BMC Genomics Research BACKGROUND: Transposable elements (TEs) are interspersed DNA sequences that can move or copy to new positions within a genome. TEs are believed to promote speciation and their activities play a significant role in human disease. In the human genome, the 22 AluY and 6 AluS TE subfamilies have been the most recently active, and their transposition has been implicated in many inherited human diseases and in various forms of cancer. Therefore, understanding their transposition activity is very important and identifying the factors that affect their transpositional activity is of great interest. Recently, there has been some work done to quantify the activity levels of active Alu TEs based on variation in the sequence. Given this activity data, an analysis of TE activity based on the position of mutations is conducted. RESULTS: A method/simulation is created to computationally predict so-called harmful mutation regions in the consensus sequence of a TE; that is, mutations that occur in these regions decrease the transpositional activity dramatically. The methods are applied to the most active subfamily, AluY, to identify the harmful regions, and seven harmful regions are identified within the AluY consensus with q-values less than 0.05. A supplementary simulation also shows that the identified harmful regions covering the AluYa5 RNA functional regions are not occurring by chance. This method is then applied to two additional TE families: the Alu family and the L1 family, to computationally detect the harmful regions in these elements. CONCLUSIONS: We use a computational method to identify a set of harmful mutation regions. Mutations within the identified harmful regions decrease the transpositional activity of active elements. The correlation between the mutations within these regions and the transpositional activity of TEs are shown to be statistically significant. Verifications are presented using the activity of AluY elements and the secondary structure of the AluYa5 RNA, providing evidence that the method is successfully identifying harmful mutation regions. BioMed Central 2017-11-17 /pmc/articles/PMC5773891/ /pubmed/29219079 http://dx.doi.org/10.1186/s12864-017-4227-z Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Jin, Lingling
McQuillan, Ian
Li, Longhai
Computational identification of harmful mutation regions to the activity of transposable elements
title Computational identification of harmful mutation regions to the activity of transposable elements
title_full Computational identification of harmful mutation regions to the activity of transposable elements
title_fullStr Computational identification of harmful mutation regions to the activity of transposable elements
title_full_unstemmed Computational identification of harmful mutation regions to the activity of transposable elements
title_short Computational identification of harmful mutation regions to the activity of transposable elements
title_sort computational identification of harmful mutation regions to the activity of transposable elements
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773891/
https://www.ncbi.nlm.nih.gov/pubmed/29219079
http://dx.doi.org/10.1186/s12864-017-4227-z
work_keys_str_mv AT jinlingling computationalidentificationofharmfulmutationregionstotheactivityoftransposableelements
AT mcquillanian computationalidentificationofharmfulmutationregionstotheactivityoftransposableelements
AT lilonghai computationalidentificationofharmfulmutationregionstotheactivityoftransposableelements