Cargando…

Filtering de novo indels in parent-offspring trios

BACKGROUND: Identification of de novo indels from whole genome or exome sequencing data of parent-offspring trios is a challenging task in human disease studies and clinical practices. Existing computational approaches usually yield high false positive rate. RESULTS: In this study, we developed a gr...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yongzhuang, Liu, Jian, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7739476/
https://www.ncbi.nlm.nih.gov/pubmed/33323105
http://dx.doi.org/10.1186/s12859-020-03900-z
Descripción
Sumario:BACKGROUND: Identification of de novo indels from whole genome or exome sequencing data of parent-offspring trios is a challenging task in human disease studies and clinical practices. Existing computational approaches usually yield high false positive rate. RESULTS: In this study, we developed a gradient boosting approach for filtering de novo indels obtained by any computational approaches. Through application on the real genome sequencing data, our approach showed it could significantly reduce the false positive rate of de novo indels without a significant compromise on sensitivity. CONCLUSIONS: The software DNMFilter_Indel was written in a combination of Java and R and freely available from the website at https://github.com/yongzhuang/DNMFilter_Indel.