Cargando…

Annotating pathogenic non-coding variants in genic regions

Identifying the underlying causes of disease requires accurate interpretation of genetic variants. Current methods ineffectively capture pathogenic non-coding variants in genic regions, resulting in overlooking synonymous and intronic variants when searching for disease risk. Here we present the Tra...

Descripción completa

Detalles Bibliográficos
Autores principales: Gelfman, Sahar, Wang, Quanli, McSweeney, K. Melodi, Ren, Zhong, La Carpia, Francesca, Halvorsen, Matt, Schoch, Kelly, Ratzon, Fanni, Heinzen, Erin L., Boland, Michael J., Petrovski, Slavé, Goldstein, David B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5550444/
https://www.ncbi.nlm.nih.gov/pubmed/28794409
http://dx.doi.org/10.1038/s41467-017-00141-2
Descripción
Sumario:Identifying the underlying causes of disease requires accurate interpretation of genetic variants. Current methods ineffectively capture pathogenic non-coding variants in genic regions, resulting in overlooking synonymous and intronic variants when searching for disease risk. Here we present the Transcript-inferred Pathogenicity (TraP) score, which uses sequence context alterations to reliably identify non-coding variation that causes disease. High TraP scores single out extremely rare variants with lower minor allele frequencies than missense variants. TraP accurately distinguishes known pathogenic and benign variants in synonymous (AUC = 0.88) and intronic (AUC = 0.83) public datasets, dismissing benign variants with exceptionally high specificity. TraP analysis of 843 exomes from epilepsy family trios identifies synonymous variants in known epilepsy genes, thus pinpointing risk factors of disease from non-coding sequence data. TraP outperforms leading methods in identifying non-coding variants that are pathogenic and is therefore a valuable tool for use in gene discovery and the interpretation of personal genomes.