Cargando…
VariantSpark: population scale clustering of genotype information
BACKGROUND: Genomic information is increasingly used in medical practice giving rise to the need for efficient analysis methodology able to cope with thousands of individuals and millions of variants. The widely used Hadoop MapReduce architecture and associated machine learning library, Mahout, prov...
Autores principales: | O’Brien, Aidan R., Saunders, Neil F. W., Guo, Yi, Buske, Fabian A., Scott, Rodney J., Bauer, Denis C. |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676146/ https://www.ncbi.nlm.nih.gov/pubmed/26651996 http://dx.doi.org/10.1186/s12864-015-2269-7 |
Ejemplares similares
-
VariantSpark: Cloud-based machine learning for association study of complex phenotype and large-scale genomic data
por: Bayat, Arash, et al.
Publicado: (2020) -
DECA: scalable XHMM exome copy-number variant calling with ADAM and Apache Spark
por: Linderman, Michael D., et al.
Publicado: (2019) -
BiSpark: a Spark-based highly scalable aligner for bisulfite sequencing data
por: Soe, Seokjun, et al.
Publicado: (2018) -
ADS-HCSpark: A scalable HaplotypeCaller leveraging adaptive data segmentation to accelerate variant calling on Spark
por: Xiao, Anghong, et al.
Publicado: (2019) -
IMOS: improved Meta-aligner and Minimap2 On Spark
por: Hadadian Nejad Yousefi, Mostafa, et al.
Publicado: (2019)