Cargando…

VariantSpark: population scale clustering of genotype information

BACKGROUND: Genomic information is increasingly used in medical practice giving rise to the need for efficient analysis methodology able to cope with thousands of individuals and millions of variants. The widely used Hadoop MapReduce architecture and associated machine learning library, Mahout, prov...

Descripción completa

Detalles Bibliográficos
Autores principales:	O’Brien, Aidan R., Saunders, Neil F. W., Guo, Yi, Buske, Fabian A., Scott, Rodney J., Bauer, Denis C.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2015
Materias:	Software
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676146/ https://www.ncbi.nlm.nih.gov/pubmed/26651996 http://dx.doi.org/10.1186/s12864-015-2269-7

Ejemplares similares

VariantSpark: Cloud-based machine learning for association study of complex phenotype and large-scale genomic data
por: Bayat, Arash, et al.
Publicado: (2020)

DECA: scalable XHMM exome copy-number variant calling with ADAM and Apache Spark
por: Linderman, Michael D., et al.
Publicado: (2019)

BiSpark: a Spark-based highly scalable aligner for bisulfite sequencing data
por: Soe, Seokjun, et al.
Publicado: (2018)

ADS-HCSpark: A scalable HaplotypeCaller leveraging adaptive data segmentation to accelerate variant calling on Spark
por: Xiao, Anghong, et al.
Publicado: (2019)

IMOS: improved Meta-aligner and Minimap2 On Spark
por: Hadadian Nejad Yousefi, Mostafa, et al.
Publicado: (2019)

SparkBLAST: scalable BLAST processing using in-memory operations
por: de Castro, Marcelo Rodrigo, et al.
Publicado: (2017)

Laboratory Information Management Software for genotyping workflows: applications in high throughput crop genotyping
por: Jayashree, B, et al.
Publicado: (2006)

SparkEC: speeding up alignment-based DNA error correction tools
por: Expósito, Roberto R., et al.
Publicado: (2022)

Minos: variant adjudication and joint genotyping of cohorts of bacterial genomes
por: Hunt, Martin, et al.
Publicado: (2022)

PRS-on-Spark (PRSoS): a novel, efficient and flexible approach for generating polygenic risk scores
por: Chen, Lawrence M., et al.
Publicado: (2018)

pmTM-align: scalable pairwise and multiple structure alignment with Apache Spark and OpenMP
por: Chen, Weiya, et al.
Publicado: (2020)

SLiMScape: a protein short linear motif analysis plugin for Cytoscape
por: O’Brien, Kevin T, et al.
Publicado: (2013)

The theory on and software simulating large-scale genomic data for genotype-by-environment interactions
por: Li, Xiujin, et al.
Publicado: (2021)

Initial development of Supportive care Assessment, Prioritization and Recommendations for Kids (SPARK), a symptom screening and management application
por: Cook, Sadie, et al.
Publicado: (2019)

MAVE-NN: learning genotype-phenotype maps from multiplex assays of variant effect
por: Tareen, Ammar, et al.
Publicado: (2022)

SC3s: efficient scaling of single cell consensus clustering to millions of cells
por: Quah, Fu Xiang, et al.
Publicado: (2022)

VARSCOT: variant-aware detection and scoring enables sensitive and personalized off-target detection for CRISPR-Cas9
por: Wilson, Laurence O. W., et al.
Publicado: (2019)

Gene, Environment and Methylation (GEM): a tool suite to efficiently navigate large scale epigenome wide association studies and integrate genotype and interaction between genotype and environment
por: Pan, Hong, et al.
Publicado: (2016)

PINES: phenotype-informed tissue weighting improves prediction of pathogenic noncoding variants
por: Bodea, Corneliu A., et al.
Publicado: (2018)

XCAVATOR: accurate detection and genotyping of copy number variants from second and third generation whole-genome sequencing experiments
por: Magi, Alberto, et al.
Publicado: (2017)

Bystro: rapid online variant annotation and natural-language filtering at whole-genome scale
por: Kotlar, Alex V., et al.
Publicado: (2018)

T.I.M.S: TaqMan Information Management System, tools to organize data flow in a genotyping laboratory
por: Monnier, Stéphanie, et al.
Publicado: (2005)

Gepoclu: a software tool for identifying and analyzing gene positional clusters in large-scale gene expression analysis
por: Dottorini, Tania, et al.
Publicado: (2011)

PMFFRC: a large-scale genomic short reads compression optimizer via memory modeling and redundant clustering
por: Sun, Hui, et al.
Publicado: (2023)

Cpipe: a shared variant detection pipeline designed for diagnostic settings
por: Sadedin, Simon P., et al.
Publicado: (2015)

SCANPY: large-scale single-cell gene expression data analysis
por: Wolf, F. Alexander, et al.
Publicado: (2018)

Exploratory analysis of genomic segmentations with Segtools
por: Buske, Orion J, et al.
Publicado: (2011)

WikiHyperGlossary (WHG): an information literacy technology for chemistry documents
por: Bauer, Michael A, et al.
Publicado: (2015)

Pangenomic genotyping with the marker array
por: Mun, Taher, et al.
Publicado: (2023)

clusterMaker: a multi-algorithm clustering plugin for Cytoscape
por: Morris, John H, et al.
Publicado: (2011)

optCluster: An R Package for Determining the Optimal Clustering Algorithm
por: Sekula, Michael, et al.
Publicado: (2017)

clusterMaker2: a major update to clusterMaker, a multi-algorithm clustering app for Cytoscape
por: Utriainen, Maija, et al.
Publicado: (2023)

miRAFinder and GeneAFinder scripts: large-scale searching for miRNA and related information in indexed literature abstracts
por: Berillo, Olga, et al.
Publicado: (2014)

CCRaVAT and QuTie - enabling analysis of rare variants in large-scale case control and quantitative trait association studies
por: Lawrence, Robert, et al.
Publicado: (2010)

MTG-Link: leveraging barcode information from linked-reads to assemble specific loci
por: Guichard, Anne, et al.
Publicado: (2023)

Partitioning of copy-number genotypes in pedigrees
por: Perreault, Louis-Philippe Lemieux, et al.
Publicado: (2010)

RiboA: a web application to identify ribosome A-site locations in ribosome profiling data
por: Shao, Danying, et al.
Publicado: (2021)

rstoolbox - a Python library for large-scale analysis of computational protein design data and structural bioinformatics
por: Bonet, Jaume, et al.
Publicado: (2019)

An heuristic filtering tool to identify phenotype-associated genetic variants applied to human intellectual disability and canine coat colors
por: Broeckx, Bart J. G., et al.
Publicado: (2015)

MotifCluster: an interactive online tool for clustering and visualizing sequences using shared motifs
por: Hamady, Micah, et al.
Publicado: (2008)

Cannot write session to /tmp/vufind_sessions/sess_gok9gjqms2o4cc3jce8ncbc95d