Cargando…

Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern

BACKGROUND: Genome-wide methylation arrays are increasingly used tools in studies of complex medical disorders. Because of their expense and potential utility to the scientific community, current federal policy dictates that data from these arrays, like those from genome-wide genotyping arrays, be d...

Descripción completa

Detalles Bibliográficos
Autores principales: Philibert, Robert A, Terry, Nicolas, Erwin, Cheryl, Philibert, Winter J, Beach, Steven RH, Brody, Gene H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391334/
https://www.ncbi.nlm.nih.gov/pubmed/25859287
http://dx.doi.org/10.1186/1868-7083-6-28
_version_ 1782365810092670976
author Philibert, Robert A
Terry, Nicolas
Erwin, Cheryl
Philibert, Winter J
Beach, Steven RH
Brody, Gene H
author_facet Philibert, Robert A
Terry, Nicolas
Erwin, Cheryl
Philibert, Winter J
Beach, Steven RH
Brody, Gene H
author_sort Philibert, Robert A
collection PubMed
description BACKGROUND: Genome-wide methylation arrays are increasingly used tools in studies of complex medical disorders. Because of their expense and potential utility to the scientific community, current federal policy dictates that data from these arrays, like those from genome-wide genotyping arrays, be deposited in publicly available databases. Unlike the genotyping information, access to the expression data is not restricted. An underlying supposition in the current nonrestricted access to methylation data is the belief that protected health and personal identifying information cannot be simultaneously extracted from these arrays. RESULTS: In this communication, we analyze methylation data from the Illumina HumanMethylation450 array and show that genotype at 1,069 highly informative loci, and both alcohol and smoking consumption information, can be derived from the array data. CONCLUSIONS: We conclude that both potentially personally identifying information and substance-use histories can be simultaneously derived from methylation array data. Because access to genetic information about a database subject or one of their relatives is critical to the de-identification process, this risk of de-identification is limited at the current time. We propose that access to genome-wide methylation data be restricted to institutionally approved investigators who accede to data use agreements prohibiting re-identification. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1868-7083-6-28) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4391334
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43913342015-04-10 Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern Philibert, Robert A Terry, Nicolas Erwin, Cheryl Philibert, Winter J Beach, Steven RH Brody, Gene H Clin Epigenetics Research BACKGROUND: Genome-wide methylation arrays are increasingly used tools in studies of complex medical disorders. Because of their expense and potential utility to the scientific community, current federal policy dictates that data from these arrays, like those from genome-wide genotyping arrays, be deposited in publicly available databases. Unlike the genotyping information, access to the expression data is not restricted. An underlying supposition in the current nonrestricted access to methylation data is the belief that protected health and personal identifying information cannot be simultaneously extracted from these arrays. RESULTS: In this communication, we analyze methylation data from the Illumina HumanMethylation450 array and show that genotype at 1,069 highly informative loci, and both alcohol and smoking consumption information, can be derived from the array data. CONCLUSIONS: We conclude that both potentially personally identifying information and substance-use histories can be simultaneously derived from methylation array data. Because access to genetic information about a database subject or one of their relatives is critical to the de-identification process, this risk of de-identification is limited at the current time. We propose that access to genome-wide methylation data be restricted to institutionally approved investigators who accede to data use agreements prohibiting re-identification. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1868-7083-6-28) contains supplementary material, which is available to authorized users. BioMed Central 2014-11-19 /pmc/articles/PMC4391334/ /pubmed/25859287 http://dx.doi.org/10.1186/1868-7083-6-28 Text en © Philibert et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Philibert, Robert A
Terry, Nicolas
Erwin, Cheryl
Philibert, Winter J
Beach, Steven RH
Brody, Gene H
Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern
title Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern
title_full Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern
title_fullStr Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern
title_full_unstemmed Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern
title_short Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern
title_sort methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391334/
https://www.ncbi.nlm.nih.gov/pubmed/25859287
http://dx.doi.org/10.1186/1868-7083-6-28
work_keys_str_mv AT philibertroberta methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern
AT terrynicolas methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern
AT erwincheryl methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern
AT philibertwinterj methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern
AT beachstevenrh methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern
AT brodygeneh methylationarraydatacansimultaneouslyidentifyindividualsandconveyprotectedhealthinformationanunrecognizedethicalconcern