Cargando…

An integrative probabilistic model for identification of structural variation in sequencing data

Paired-end sequencing is a common approach for identifying structural variation (SV) in genomes. Discrepancies between the observed and expected alignments indicate potential SVs. Most SV detection algorithms use only one of the possible signals and ignore reads with multiple alignments. This result...

Descripción completa

Detalles Bibliográficos
Autores principales: Sindi, Suzanne S, Önal, Selim, Peng, Luke C, Wu, Hsin-Ta, Raphael, Benjamin J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439973/
https://www.ncbi.nlm.nih.gov/pubmed/22452995
http://dx.doi.org/10.1186/gb-2012-13-3-r22
Descripción
Sumario:Paired-end sequencing is a common approach for identifying structural variation (SV) in genomes. Discrepancies between the observed and expected alignments indicate potential SVs. Most SV detection algorithms use only one of the possible signals and ignore reads with multiple alignments. This results in reduced sensitivity to detect SVs, especially in repetitive regions. We introduce GASVPro, an algorithm combining both paired read and read depth signals into a probabilistic model that can analyze multiple alignments of reads. GASVPro outperforms existing methods with a 50 to 90% improvement in specificity on deletions and a 50% improvement on inversions. GASVPro is available at http://compbio.cs.brown.edu/software.