Cargando…
AgIn: measuring the landscape of CpG methylation of individual repetitive elements
Motivation: Determining the methylation state of regions with high copy numbers is challenging for second-generation sequencing, because the read length is insufficient to map reads uniquely, especially when repetitive regions are long and nearly identical to each other. Single-molecule real-time (S...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039925/ https://www.ncbi.nlm.nih.gov/pubmed/27318202 http://dx.doi.org/10.1093/bioinformatics/btw360 |
_version_ | 1782456150496641024 |
---|---|
author | Suzuki, Yuta Korlach, Jonas Turner, Stephen W. Tsukahara, Tatsuya Taniguchi, Junko Qu, Wei Ichikawa, Kazuki Yoshimura, Jun Yurino, Hideaki Takahashi, Yuji Mitsui, Jun Ishiura, Hiroyuki Tsuji, Shoji Takeda, Hiroyuki Morishita, Shinichi |
author_facet | Suzuki, Yuta Korlach, Jonas Turner, Stephen W. Tsukahara, Tatsuya Taniguchi, Junko Qu, Wei Ichikawa, Kazuki Yoshimura, Jun Yurino, Hideaki Takahashi, Yuji Mitsui, Jun Ishiura, Hiroyuki Tsuji, Shoji Takeda, Hiroyuki Morishita, Shinichi |
author_sort | Suzuki, Yuta |
collection | PubMed |
description | Motivation: Determining the methylation state of regions with high copy numbers is challenging for second-generation sequencing, because the read length is insufficient to map reads uniquely, especially when repetitive regions are long and nearly identical to each other. Single-molecule real-time (SMRT) sequencing is a promising method for observing such regions, because it is not vulnerable to GC bias, it produces long read lengths, and its kinetic information is sensitive to DNA modifications. Results: We propose a novel linear-time algorithm that combines the kinetic information for neighboring CpG sites and increases the confidence in identifying the methylation states of those sites. Using a practical read coverage of ∼30-fold from an inbred strain medaka (Oryzias latipes), we observed that both the sensitivity and precision of our method on individual CpG sites were ∼93.7%. We also observed a high correlation coefficient (R = 0.884) between our method and bisulfite sequencing, and for 92.0% of CpG sites, methylation levels ranging over [0,1] were in concordance within an acceptable difference 0.25. Using this method, we characterized the landscape of the methylation status of repetitive elements, such as LINEs, in the human genome, thereby revealing the strong correlation between CpG density and hypomethylation and detecting hypomethylation hot spots of LTRs and LINEs. We uncovered the methylation states for nearly identical active transposons, two novel LINE insertions of identity ∼99% and length 6050 base pairs (bp) in the human genome, and 16 Tol2 elements of identity >99.8% and length 4682 bp in the medaka genome. Availability and Implementation: AgIn (Aggregate on Intervals) is available at: https://github.com/hacone/AgIn Contact: ysuzuki@cb.k.u-tokyo.ac.jp or moris@cb.k.u-tokyo.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5039925 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-50399252016-09-29 AgIn: measuring the landscape of CpG methylation of individual repetitive elements Suzuki, Yuta Korlach, Jonas Turner, Stephen W. Tsukahara, Tatsuya Taniguchi, Junko Qu, Wei Ichikawa, Kazuki Yoshimura, Jun Yurino, Hideaki Takahashi, Yuji Mitsui, Jun Ishiura, Hiroyuki Tsuji, Shoji Takeda, Hiroyuki Morishita, Shinichi Bioinformatics Original Papers Motivation: Determining the methylation state of regions with high copy numbers is challenging for second-generation sequencing, because the read length is insufficient to map reads uniquely, especially when repetitive regions are long and nearly identical to each other. Single-molecule real-time (SMRT) sequencing is a promising method for observing such regions, because it is not vulnerable to GC bias, it produces long read lengths, and its kinetic information is sensitive to DNA modifications. Results: We propose a novel linear-time algorithm that combines the kinetic information for neighboring CpG sites and increases the confidence in identifying the methylation states of those sites. Using a practical read coverage of ∼30-fold from an inbred strain medaka (Oryzias latipes), we observed that both the sensitivity and precision of our method on individual CpG sites were ∼93.7%. We also observed a high correlation coefficient (R = 0.884) between our method and bisulfite sequencing, and for 92.0% of CpG sites, methylation levels ranging over [0,1] were in concordance within an acceptable difference 0.25. Using this method, we characterized the landscape of the methylation status of repetitive elements, such as LINEs, in the human genome, thereby revealing the strong correlation between CpG density and hypomethylation and detecting hypomethylation hot spots of LTRs and LINEs. We uncovered the methylation states for nearly identical active transposons, two novel LINE insertions of identity ∼99% and length 6050 base pairs (bp) in the human genome, and 16 Tol2 elements of identity >99.8% and length 4682 bp in the medaka genome. Availability and Implementation: AgIn (Aggregate on Intervals) is available at: https://github.com/hacone/AgIn Contact: ysuzuki@cb.k.u-tokyo.ac.jp or moris@cb.k.u-tokyo.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-10-01 2016-06-17 /pmc/articles/PMC5039925/ /pubmed/27318202 http://dx.doi.org/10.1093/bioinformatics/btw360 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Suzuki, Yuta Korlach, Jonas Turner, Stephen W. Tsukahara, Tatsuya Taniguchi, Junko Qu, Wei Ichikawa, Kazuki Yoshimura, Jun Yurino, Hideaki Takahashi, Yuji Mitsui, Jun Ishiura, Hiroyuki Tsuji, Shoji Takeda, Hiroyuki Morishita, Shinichi AgIn: measuring the landscape of CpG methylation of individual repetitive elements |
title | AgIn: measuring the landscape of CpG methylation of individual repetitive elements |
title_full | AgIn: measuring the landscape of CpG methylation of individual repetitive elements |
title_fullStr | AgIn: measuring the landscape of CpG methylation of individual repetitive elements |
title_full_unstemmed | AgIn: measuring the landscape of CpG methylation of individual repetitive elements |
title_short | AgIn: measuring the landscape of CpG methylation of individual repetitive elements |
title_sort | agin: measuring the landscape of cpg methylation of individual repetitive elements |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039925/ https://www.ncbi.nlm.nih.gov/pubmed/27318202 http://dx.doi.org/10.1093/bioinformatics/btw360 |
work_keys_str_mv | AT suzukiyuta aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT korlachjonas aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT turnerstephenw aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT tsukaharatatsuya aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT taniguchijunko aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT quwei aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT ichikawakazuki aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT yoshimurajun aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT yurinohideaki aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT takahashiyuji aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT mitsuijun aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT ishiurahiroyuki aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT tsujishoji aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT takedahiroyuki aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements AT morishitashinichi aginmeasuringthelandscapeofcpgmethylationofindividualrepetitiveelements |