Cargando…
Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods
Illumina Infinium DNA Methylation (5mC) arrays are a popular technology for low-cost, high-throughput, genome-scale measurement of 5mC distribution, especially in cancer and other complex diseases. After the success of its HumanMethylation450 array (450k), Illumina released the MethylationEPIC array...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Taylor & Francis
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9665128/ https://www.ncbi.nlm.nih.gov/pubmed/36354000 http://dx.doi.org/10.1080/15592294.2022.2135201 |
_version_ | 1784831225455706112 |
---|---|
author | Bizet, Martin Defrance, Matthieu Calonne, Emilie Bontempi, Gianluca Sotiriou, Christos Fuks, François Jeschke, Jana |
author_facet | Bizet, Martin Defrance, Matthieu Calonne, Emilie Bontempi, Gianluca Sotiriou, Christos Fuks, François Jeschke, Jana |
author_sort | Bizet, Martin |
collection | PubMed |
description | Illumina Infinium DNA Methylation (5mC) arrays are a popular technology for low-cost, high-throughput, genome-scale measurement of 5mC distribution, especially in cancer and other complex diseases. After the success of its HumanMethylation450 array (450k), Illumina released the MethylationEPIC array (850k) featuring increased coverage of enhancers. Despite the widespread use of 850k, analysis of the corresponding data remains suboptimal: it still relies mostly on Illumina’s default annotation, which underestimates enhancerss and long noncoding RNAs. Results: We have thus developed an approach, based on the ENCODE and LNCipedia databases, which greatly improves upon Illumina’s default annotation of enhancers and long noncoding transcripts. We compared the re-annotated 850k with both 450k and reduced-representation bisulphite sequencing (RRBS), another high-throughput 5mC profiling technology. We found 850k to cover at least three times as many enhancers and long noncoding RNAs as either 450k or RRBS. We further investigated the reproducibility of the three technologies, applying various normalization methods to the 850k data. Most of these methods reduced variability to a level below that of RRBS data. We then used 850k with our new annotation and normalization to profile 5mC changes in breast cancer biopsies. 850k highlighted aberrant enhancer methylation as the predominant feature, in agreement with previous reports. Our study provides an updated processing approach for 850k data, based on refined probe annotation and normalization, allowing for improved analysis of methylation at enhancers and long noncoding RNA genes. Our findings will help to further advance understanding of the DNA methylome in health and disease. |
format | Online Article Text |
id | pubmed-9665128 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Taylor & Francis |
record_format | MEDLINE/PubMed |
spelling | pubmed-96651282022-11-15 Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods Bizet, Martin Defrance, Matthieu Calonne, Emilie Bontempi, Gianluca Sotiriou, Christos Fuks, François Jeschke, Jana Epigenetics Research Paper Illumina Infinium DNA Methylation (5mC) arrays are a popular technology for low-cost, high-throughput, genome-scale measurement of 5mC distribution, especially in cancer and other complex diseases. After the success of its HumanMethylation450 array (450k), Illumina released the MethylationEPIC array (850k) featuring increased coverage of enhancers. Despite the widespread use of 850k, analysis of the corresponding data remains suboptimal: it still relies mostly on Illumina’s default annotation, which underestimates enhancerss and long noncoding RNAs. Results: We have thus developed an approach, based on the ENCODE and LNCipedia databases, which greatly improves upon Illumina’s default annotation of enhancers and long noncoding transcripts. We compared the re-annotated 850k with both 450k and reduced-representation bisulphite sequencing (RRBS), another high-throughput 5mC profiling technology. We found 850k to cover at least three times as many enhancers and long noncoding RNAs as either 450k or RRBS. We further investigated the reproducibility of the three technologies, applying various normalization methods to the 850k data. Most of these methods reduced variability to a level below that of RRBS data. We then used 850k with our new annotation and normalization to profile 5mC changes in breast cancer biopsies. 850k highlighted aberrant enhancer methylation as the predominant feature, in agreement with previous reports. Our study provides an updated processing approach for 850k data, based on refined probe annotation and normalization, allowing for improved analysis of methylation at enhancers and long noncoding RNA genes. Our findings will help to further advance understanding of the DNA methylome in health and disease. Taylor & Francis 2022-11-10 /pmc/articles/PMC9665128/ /pubmed/36354000 http://dx.doi.org/10.1080/15592294.2022.2135201 Text en © 2022 Université Libre de Bruxelles. Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Paper Bizet, Martin Defrance, Matthieu Calonne, Emilie Bontempi, Gianluca Sotiriou, Christos Fuks, François Jeschke, Jana Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods |
title | Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods |
title_full | Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods |
title_fullStr | Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods |
title_full_unstemmed | Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods |
title_short | Improving Infinium MethylationEPIC data processing: re-annotation of enhancers and long noncoding RNA genes and benchmarking of normalization methods |
title_sort | improving infinium methylationepic data processing: re-annotation of enhancers and long noncoding rna genes and benchmarking of normalization methods |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9665128/ https://www.ncbi.nlm.nih.gov/pubmed/36354000 http://dx.doi.org/10.1080/15592294.2022.2135201 |
work_keys_str_mv | AT bizetmartin improvinginfiniummethylationepicdataprocessingreannotationofenhancersandlongnoncodingrnagenesandbenchmarkingofnormalizationmethods AT defrancematthieu improvinginfiniummethylationepicdataprocessingreannotationofenhancersandlongnoncodingrnagenesandbenchmarkingofnormalizationmethods AT calonneemilie improvinginfiniummethylationepicdataprocessingreannotationofenhancersandlongnoncodingrnagenesandbenchmarkingofnormalizationmethods AT bontempigianluca improvinginfiniummethylationepicdataprocessingreannotationofenhancersandlongnoncodingrnagenesandbenchmarkingofnormalizationmethods AT sotiriouchristos improvinginfiniummethylationepicdataprocessingreannotationofenhancersandlongnoncodingrnagenesandbenchmarkingofnormalizationmethods AT fuksfrancois improvinginfiniummethylationepicdataprocessingreannotationofenhancersandlongnoncodingrnagenesandbenchmarkingofnormalizationmethods AT jeschkejana improvinginfiniummethylationepicdataprocessingreannotationofenhancersandlongnoncodingrnagenesandbenchmarkingofnormalizationmethods |