Cargando…

New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome

Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of f...

Descripción completa

Detalles Bibliográficos
Autores principales: Garazha, Andrew, Ivanova, Alena, Suntsova, Maria, Malakhova, Galina, Roumiantsev, Sergey, Zhavoronkov, Alex, Buzdin, Anton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4612461/
https://www.ncbi.nlm.nih.gov/pubmed/25853282
http://dx.doi.org/10.1080/15384101.2015.1022696
_version_ 1782396174050787328
author Garazha, Andrew
Ivanova, Alena
Suntsova, Maria
Malakhova, Galina
Roumiantsev, Sergey
Zhavoronkov, Alex
Buzdin, Anton
author_facet Garazha, Andrew
Ivanova, Alena
Suntsova, Maria
Malakhova, Galina
Roumiantsev, Sergey
Zhavoronkov, Alex
Buzdin, Anton
author_sort Garazha, Andrew
collection PubMed
description Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of functional genomic elements. Here, we performed the genome-wide identification of human ERVs/LRs containing TFBS according to the ENCODE project. We created the first interactive ERV/LRs database that groups the individual inserts according to their familial nomenclature, number of mapped TFBS and divergence from their consensus sequence. Information on any particular element can be easily extracted by the user. We also created a genome browser tool, which enables quick mapping of any ERV/LR insert according to genomic coordinates, known human genes and TFBS. These tools can be used to easily explore functionally relevant individual ERV/LRs, and for studying their impact on the regulation of human genes. Overall, we identified ∼110,000 ERV/LR genomic elements having TFBS. We propose a hypothesis of “domestication” of ERV/LR TFBS by the genome milieu including subsequent stages of initial epigenetic repression, partial functional release, and further mutation-driven reshaping of TFBS in tight coevolution with the enclosing genomic loci.
format Online
Article
Text
id pubmed-4612461
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-46124612016-02-03 New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome Garazha, Andrew Ivanova, Alena Suntsova, Maria Malakhova, Galina Roumiantsev, Sergey Zhavoronkov, Alex Buzdin, Anton Cell Cycle Report Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of functional genomic elements. Here, we performed the genome-wide identification of human ERVs/LRs containing TFBS according to the ENCODE project. We created the first interactive ERV/LRs database that groups the individual inserts according to their familial nomenclature, number of mapped TFBS and divergence from their consensus sequence. Information on any particular element can be easily extracted by the user. We also created a genome browser tool, which enables quick mapping of any ERV/LR insert according to genomic coordinates, known human genes and TFBS. These tools can be used to easily explore functionally relevant individual ERV/LRs, and for studying their impact on the regulation of human genes. Overall, we identified ∼110,000 ERV/LR genomic elements having TFBS. We propose a hypothesis of “domestication” of ERV/LR TFBS by the genome milieu including subsequent stages of initial epigenetic repression, partial functional release, and further mutation-driven reshaping of TFBS in tight coevolution with the enclosing genomic loci. Taylor & Francis 2015-04-08 /pmc/articles/PMC4612461/ /pubmed/25853282 http://dx.doi.org/10.1080/15384101.2015.1022696 Text en © 2015 The Author(s). Published with license by Taylor & Francis http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. The moral rights of the named author(s) have been asserted.
spellingShingle Report
Garazha, Andrew
Ivanova, Alena
Suntsova, Maria
Malakhova, Galina
Roumiantsev, Sergey
Zhavoronkov, Alex
Buzdin, Anton
New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome
title New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome
title_full New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome
title_fullStr New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome
title_full_unstemmed New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome
title_short New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome
title_sort new bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome
topic Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4612461/
https://www.ncbi.nlm.nih.gov/pubmed/25853282
http://dx.doi.org/10.1080/15384101.2015.1022696
work_keys_str_mv AT garazhaandrew newbioinformatictoolforquickidentificationoffunctionallyrelevantendogenousretroviralinsertsinhumangenome
AT ivanovaalena newbioinformatictoolforquickidentificationoffunctionallyrelevantendogenousretroviralinsertsinhumangenome
AT suntsovamaria newbioinformatictoolforquickidentificationoffunctionallyrelevantendogenousretroviralinsertsinhumangenome
AT malakhovagalina newbioinformatictoolforquickidentificationoffunctionallyrelevantendogenousretroviralinsertsinhumangenome
AT roumiantsevsergey newbioinformatictoolforquickidentificationoffunctionallyrelevantendogenousretroviralinsertsinhumangenome
AT zhavoronkovalex newbioinformatictoolforquickidentificationoffunctionallyrelevantendogenousretroviralinsertsinhumangenome
AT buzdinanton newbioinformatictoolforquickidentificationoffunctionallyrelevantendogenousretroviralinsertsinhumangenome