Cargando…
Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern
BACKGROUND: Genome-wide methylation arrays are increasingly used tools in studies of complex medical disorders. Because of their expense and potential utility to the scientific community, current federal policy dictates that data from these arrays, like those from genome-wide genotyping arrays, be d...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391334/ https://www.ncbi.nlm.nih.gov/pubmed/25859287 http://dx.doi.org/10.1186/1868-7083-6-28 |
_version_ | 1782365810092670976 |
---|---|
author | Philibert, Robert A Terry, Nicolas Erwin, Cheryl Philibert, Winter J Beach, Steven RH Brody, Gene H |
author_facet | Philibert, Robert A Terry, Nicolas Erwin, Cheryl Philibert, Winter J Beach, Steven RH Brody, Gene H |
author_sort | Philibert, Robert A |
collection | PubMed |
description | BACKGROUND: Genome-wide methylation arrays are increasingly used tools in studies of complex medical disorders. Because of their expense and potential utility to the scientific community, current federal policy dictates that data from these arrays, like those from genome-wide genotyping arrays, be deposited in publicly available databases. Unlike the genotyping information, access to the expression data is not restricted. An underlying supposition in the current nonrestricted access to methylation data is the belief that protected health and personal identifying information cannot be simultaneously extracted from these arrays. RESULTS: In this communication, we analyze methylation data from the Illumina HumanMethylation450 array and show that genotype at 1,069 highly informative loci, and both alcohol and smoking consumption information, can be derived from the array data. CONCLUSIONS: We conclude that both potentially personally identifying information and substance-use histories can be simultaneously derived from methylation array data. Because access to genetic information about a database subject or one of their relatives is critical to the de-identification process, this risk of de-identification is limited at the current time. We propose that access to genome-wide methylation data be restricted to institutionally approved investigators who accede to data use agreements prohibiting re-identification. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1868-7083-6-28) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4391334 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43913342015-04-10 Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern Philibert, Robert A Terry, Nicolas Erwin, Cheryl Philibert, Winter J Beach, Steven RH Brody, Gene H Clin Epigenetics Research BACKGROUND: Genome-wide methylation arrays are increasingly used tools in studies of complex medical disorders. Because of their expense and potential utility to the scientific community, current federal policy dictates that data from these arrays, like those from genome-wide genotyping arrays, be deposited in publicly available databases. Unlike the genotyping information, access to the expression data is not restricted. An underlying supposition in the current nonrestricted access to methylation data is the belief that protected health and personal identifying information cannot be simultaneously extracted from these arrays. RESULTS: In this communication, we analyze methylation data from the Illumina HumanMethylation450 array and show that genotype at 1,069 highly informative loci, and both alcohol and smoking consumption information, can be derived from the array data. CONCLUSIONS: We conclude that both potentially personally identifying information and substance-use histories can be simultaneously derived from methylation array data. Because access to genetic information about a database subject or one of their relatives is critical to the de-identification process, this risk of de-identification is limited at the current time. We propose that access to genome-wide methylation data be restricted to institutionally approved investigators who accede to data use agreements prohibiting re-identification. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1868-7083-6-28) contains supplementary material, which is available to authorized users. BioMed Central 2014-11-19 /pmc/articles/PMC4391334/ /pubmed/25859287 http://dx.doi.org/10.1186/1868-7083-6-28 Text en © Philibert et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Philibert, Robert A Terry, Nicolas Erwin, Cheryl Philibert, Winter J Beach, Steven RH Brody, Gene H Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern |
title | Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern |
title_full | Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern |
title_fullStr | Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern |
title_full_unstemmed | Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern |
title_short | Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern |
title_sort | methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391334/ https://www.ncbi.nlm.nih.gov/pubmed/25859287 http://dx.doi.org/10.1186/1868-7083-6-28 |
work_keys_str_mv | AT philibertroberta methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern AT terrynicolas methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern AT erwincheryl methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern AT philibertwinterj methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern AT beachstevenrh methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern AT brodygeneh methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern |