Cargando…

AgIn: measuring the landscape of CpG methylation of individual repetitive elements

Motivation: Determining the methylation state of regions with high copy numbers is challenging for second-generation sequencing, because the read length is insufficient to map reads uniquely, especially when repetitive regions are long and nearly identical to each other. Single-molecule real-time (S...

Descripción completa

Detalles Bibliográficos
Autores principales: Suzuki, Yuta, Korlach, Jonas, Turner, Stephen W., Tsukahara, Tatsuya, Taniguchi, Junko, Qu, Wei, Ichikawa, Kazuki, Yoshimura, Jun, Yurino, Hideaki, Takahashi, Yuji, Mitsui, Jun, Ishiura, Hiroyuki, Tsuji, Shoji, Takeda, Hiroyuki, Morishita, Shinichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039925/
https://www.ncbi.nlm.nih.gov/pubmed/27318202
http://dx.doi.org/10.1093/bioinformatics/btw360
_version_ 1782456150496641024
author Suzuki, Yuta
Korlach, Jonas
Turner, Stephen W.
Tsukahara, Tatsuya
Taniguchi, Junko
Qu, Wei
Ichikawa, Kazuki
Yoshimura, Jun
Yurino, Hideaki
Takahashi, Yuji
Mitsui, Jun
Ishiura, Hiroyuki
Tsuji, Shoji
Takeda, Hiroyuki
Morishita, Shinichi
author_facet Suzuki, Yuta
Korlach, Jonas
Turner, Stephen W.
Tsukahara, Tatsuya
Taniguchi, Junko
Qu, Wei
Ichikawa, Kazuki
Yoshimura, Jun
Yurino, Hideaki
Takahashi, Yuji
Mitsui, Jun
Ishiura, Hiroyuki
Tsuji, Shoji
Takeda, Hiroyuki
Morishita, Shinichi
author_sort Suzuki, Yuta
collection PubMed
description Motivation: Determining the methylation state of regions with high copy numbers is challenging for second-generation sequencing, because the read length is insufficient to map reads uniquely, especially when repetitive regions are long and nearly identical to each other. Single-molecule real-time (SMRT) sequencing is a promising method for observing such regions, because it is not vulnerable to GC bias, it produces long read lengths, and its kinetic information is sensitive to DNA modifications. Results: We propose a novel linear-time algorithm that combines the kinetic information for neighboring CpG sites and increases the confidence in identifying the methylation states of those sites. Using a practical read coverage of ∼30-fold from an inbred strain medaka (Oryzias latipes), we observed that both the sensitivity and precision of our method on individual CpG sites were ∼93.7%. We also observed a high correlation coefficient (R = 0.884) between our method and bisulfite sequencing, and for 92.0% of CpG sites, methylation levels ranging over [0,1] were in concordance within an acceptable difference 0.25. Using this method, we characterized the landscape of the methylation status of repetitive elements, such as LINEs, in the human genome, thereby revealing the strong correlation between CpG density and hypomethylation and detecting hypomethylation hot spots of LTRs and LINEs. We uncovered the methylation states for nearly identical active transposons, two novel LINE insertions of identity ∼99% and length 6050 base pairs (bp) in the human genome, and 16 Tol2 elements of identity >99.8% and length 4682 bp in the medaka genome. Availability and Implementation: AgIn (Aggregate on Intervals) is available at: https://github.com/hacone/AgIn Contact: ysuzuki@cb.k.u-tokyo.ac.jp or moris@cb.k.u-tokyo.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5039925
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-50399252016-09-29 AgIn: measuring the landscape of CpG methylation of individual repetitive elements Suzuki, Yuta Korlach, Jonas Turner, Stephen W. Tsukahara, Tatsuya Taniguchi, Junko Qu, Wei Ichikawa, Kazuki Yoshimura, Jun Yurino, Hideaki Takahashi, Yuji Mitsui, Jun Ishiura, Hiroyuki Tsuji, Shoji Takeda, Hiroyuki Morishita, Shinichi Bioinformatics Original Papers Motivation: Determining the methylation state of regions with high copy numbers is challenging for second-generation sequencing, because the read length is insufficient to map reads uniquely, especially when repetitive regions are long and nearly identical to each other. Single-molecule real-time (SMRT) sequencing is a promising method for observing such regions, because it is not vulnerable to GC bias, it produces long read lengths, and its kinetic information is sensitive to DNA modifications. Results: We propose a novel linear-time algorithm that combines the kinetic information for neighboring CpG sites and increases the confidence in identifying the methylation states of those sites. Using a practical read coverage of ∼30-fold from an inbred strain medaka (Oryzias latipes), we observed that both the sensitivity and precision of our method on individual CpG sites were ∼93.7%. We also observed a high correlation coefficient (R = 0.884) between our method and bisulfite sequencing, and for 92.0% of CpG sites, methylation levels ranging over [0,1] were in concordance within an acceptable difference 0.25. Using this method, we characterized the landscape of the methylation status of repetitive elements, such as LINEs, in the human genome, thereby revealing the strong correlation between CpG density and hypomethylation and detecting hypomethylation hot spots of LTRs and LINEs. We uncovered the methylation states for nearly identical active transposons, two novel LINE insertions of identity ∼99% and length 6050 base pairs (bp) in the human genome, and 16 Tol2 elements of identity >99.8% and length 4682 bp in the medaka genome. Availability and Implementation: AgIn (Aggregate on Intervals) is available at: https://github.com/hacone/AgIn Contact: ysuzuki@cb.k.u-tokyo.ac.jp or moris@cb.k.u-tokyo.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-10-01 2016-06-17 /pmc/articles/PMC5039925/ /pubmed/27318202 http://dx.doi.org/10.1093/bioinformatics/btw360 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Suzuki, Yuta
Korlach, Jonas
Turner, Stephen W.
Tsukahara, Tatsuya
Taniguchi, Junko
Qu, Wei
Ichikawa, Kazuki
Yoshimura, Jun
Yurino, Hideaki
Takahashi, Yuji
Mitsui, Jun
Ishiura, Hiroyuki
Tsuji, Shoji
Takeda, Hiroyuki
Morishita, Shinichi
AgIn: measuring the landscape of CpG methylation of individual repetitive elements
title AgIn: measuring the landscape of CpG methylation of individual repetitive elements
title_full AgIn: measuring the landscape of CpG methylation of individual repetitive elements
title_fullStr AgIn: measuring the landscape of CpG methylation of individual repetitive elements
title_full_unstemmed AgIn: measuring the landscape of CpG methylation of individual repetitive elements
title_short AgIn: measuring the landscape of CpG methylation of individual repetitive elements
title_sort agin: measuring the landscape of cpg methylation of individual repetitive elements
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039925/
https://www.ncbi.nlm.nih.gov/pubmed/27318202
http://dx.doi.org/10.1093/bioinformatics/btw360
work_keys_str_mv AT suzukiyuta aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT korlachjonas aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT turnerstephenw aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT tsukaharatatsuya aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT taniguchijunko aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT quwei aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT ichikawakazuki aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT yoshimurajun aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT yurinohideaki aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT takahashiyuji aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT mitsuijun aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT ishiurahiroyuki aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT tsujishoji aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT takedahiroyuki aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements
AT morishitashinichi aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements