Cargando…

Synggen: fast and data-driven generation of synthetic heterogeneous NGS cancer data

SUMMARY: Whole-exome and targeted sequencing are widely utilized both in translational cancer genomics and in the setting of precision medicine. The benchmarking of computational methods and tools that are in continuous development is fundamental for the correct interpretation of somatic genomic pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Scandino, Riccardo, Calabrese, Federico, Romanel, Alessandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825741/
https://www.ncbi.nlm.nih.gov/pubmed/36484701
http://dx.doi.org/10.1093/bioinformatics/btac792
Descripción
Sumario:SUMMARY: Whole-exome and targeted sequencing are widely utilized both in translational cancer genomics and in the setting of precision medicine. The benchmarking of computational methods and tools that are in continuous development is fundamental for the correct interpretation of somatic genomic profiling results. To this aim we developed synggen, a tool for the fast generation of large-scale realistic and heterogeneous cancer whole-exome and targeted sequencing synthetic datasets, which enables the incorporation of phased germline single nucleotide polymorphisms and complex allele-specific somatic genomic events. Synggen performances and effectiveness in generating synthetic cancer data are shown across different scenarios and considering different platforms with distinct characteristics. AVAILABILITY AND IMPLEMENTATION: synggen is freely available at https://bitbucket.org/CibioBCG/synggen/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.