Cargando…

Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines

Transposable elements, such as Long INterspersed Elements (LINEs), are DNA sequences that can replicate within genomes. LINEs replicate using an RNA intermediate followed by reverse transcription and are typically a few kilobases in length. LINE activity creates genomic structural variants in human...

Descripción completa

Detalles Bibliográficos
Autores principales: Rybacki, Karleena, Xia, Mingyi, Ahsan, Mian Umair, Xing, Jinchuan, Wang, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10606529/
https://www.ncbi.nlm.nih.gov/pubmed/37895242
http://dx.doi.org/10.3390/genes14101893
_version_ 1785127338055303168
author Rybacki, Karleena
Xia, Mingyi
Ahsan, Mian Umair
Xing, Jinchuan
Wang, Kai
author_facet Rybacki, Karleena
Xia, Mingyi
Ahsan, Mian Umair
Xing, Jinchuan
Wang, Kai
author_sort Rybacki, Karleena
collection PubMed
description Transposable elements, such as Long INterspersed Elements (LINEs), are DNA sequences that can replicate within genomes. LINEs replicate using an RNA intermediate followed by reverse transcription and are typically a few kilobases in length. LINE activity creates genomic structural variants in human populations and leads to somatic alterations in cancer genomes. Long-read RNA sequencing technologies, including Oxford Nanopore and PacBio, can directly sequence relatively long transcripts, thus providing the opportunity to examine full-length LINE transcripts. This study focuses on the development of a new bioinformatics pipeline for the identification and quantification of active, full-length LINE transcripts in diverse human tissues and cell lines. In our pipeline, we utilized RepeatMasker to identify LINE-1 (L1) transcripts from long-read transcriptome data and incorporated several criteria, such as transcript start position, divergence, and length, to remove likely false positives. Comparisons between cancerous and normal cell lines, as well as human tissue samples, revealed elevated expression levels of young LINEs in cancer, particularly at intact L1 loci. By employing bioinformatics methodologies on long-read transcriptome data, this study demonstrates the landscape of L1 expression in tissues and cell lines.
format Online
Article
Text
id pubmed-10606529
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106065292023-10-28 Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines Rybacki, Karleena Xia, Mingyi Ahsan, Mian Umair Xing, Jinchuan Wang, Kai Genes (Basel) Article Transposable elements, such as Long INterspersed Elements (LINEs), are DNA sequences that can replicate within genomes. LINEs replicate using an RNA intermediate followed by reverse transcription and are typically a few kilobases in length. LINE activity creates genomic structural variants in human populations and leads to somatic alterations in cancer genomes. Long-read RNA sequencing technologies, including Oxford Nanopore and PacBio, can directly sequence relatively long transcripts, thus providing the opportunity to examine full-length LINE transcripts. This study focuses on the development of a new bioinformatics pipeline for the identification and quantification of active, full-length LINE transcripts in diverse human tissues and cell lines. In our pipeline, we utilized RepeatMasker to identify LINE-1 (L1) transcripts from long-read transcriptome data and incorporated several criteria, such as transcript start position, divergence, and length, to remove likely false positives. Comparisons between cancerous and normal cell lines, as well as human tissue samples, revealed elevated expression levels of young LINEs in cancer, particularly at intact L1 loci. By employing bioinformatics methodologies on long-read transcriptome data, this study demonstrates the landscape of L1 expression in tissues and cell lines. MDPI 2023-09-29 /pmc/articles/PMC10606529/ /pubmed/37895242 http://dx.doi.org/10.3390/genes14101893 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rybacki, Karleena
Xia, Mingyi
Ahsan, Mian Umair
Xing, Jinchuan
Wang, Kai
Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines
title Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines
title_full Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines
title_fullStr Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines
title_full_unstemmed Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines
title_short Assessing the Expression of Long INterspersed Elements (LINEs) via Long-Read Sequencing in Diverse Human Tissues and Cell Lines
title_sort assessing the expression of long interspersed elements (lines) via long-read sequencing in diverse human tissues and cell lines
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10606529/
https://www.ncbi.nlm.nih.gov/pubmed/37895242
http://dx.doi.org/10.3390/genes14101893
work_keys_str_mv AT rybackikarleena assessingtheexpressionoflonginterspersedelementslinesvialongreadsequencingindiversehumantissuesandcelllines
AT xiamingyi assessingtheexpressionoflonginterspersedelementslinesvialongreadsequencingindiversehumantissuesandcelllines
AT ahsanmianumair assessingtheexpressionoflonginterspersedelementslinesvialongreadsequencingindiversehumantissuesandcelllines
AT xingjinchuan assessingtheexpressionoflonginterspersedelementslinesvialongreadsequencingindiversehumantissuesandcelllines
AT wangkai assessingtheexpressionoflonginterspersedelementslinesvialongreadsequencingindiversehumantissuesandcelllines