Cargando…

SparkGA2: Production-quality memory-efficient Apache Spark based genome analysis framework

Due to the rapid decrease in the cost of NGS (Next Generation Sequencing), interest has increased in using data generated from NGS to diagnose genetic diseases. However, the data generated by NGS technology is usually in the order of hundreds of gigabytes per experiment, thus requiring efficient and...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mushtaq, Hamid, Ahmed, Nauman, Al-Ars, Zaid
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6894754/ https://www.ncbi.nlm.nih.gov/pubmed/31805063 http://dx.doi.org/10.1371/journal.pone.0224784

Ejemplares similares

SparkRA: Enabling Big Data Scalability for the GATK RNA-seq Pipeline with Apache Spark
por: Al-Ars, Zaid, et al.
Publicado: (2020)

Optimizing performance of GATK workflows using Apache Arrow In-Memory data framework
por: Ahmad, Tanveer, et al.
Publicado: (2020)

Apache Spark quick start guide: quickly learn the art of writing efficient big data applications with Apache Spark
por: Mehrotra, Shrey, et al.
Publicado: (2019)

Bioinformatics applications on Apache Spark
por: Guo, Runxin, et al.
Publicado: (2018)

Stream processing with Apache Spark: mastering structured streaming and Spark streaming
por: Maas, Gerard, et al.
Publicado: (2019)

High performance Spark: best practices for scaling and optimizing Apache Spark
por: Karau, Holden, et al.
Publicado: (2017)

Mastering Apache Spark: gain expertise in processing and storing data by using advanced techniques with Apache Spark
por: Frampton, Mike, et al.
Publicado: (2015)

Hands-on deep learning with Apache Spark: build and deploy distributed deep learning applications on Apache Spark
por: Iozzia, Guglielmo
Publicado: (2019)

Efficient iterative virtual screening with Apache Spark and conformal prediction
por: Ahmed, Laeeq, et al.
Publicado: (2018)

Apache Spark for Scalable Physics Analysis
por: Dimakopoulos, Vasileios
Publicado: (2018)

Random Decision Forests on Apache Spark
por: White, Tom
Publicado: (2016)

Framing Apache Spark in life sciences
por: Manconi, Andrea, et al.
Publicado: (2023)

Apache Spark deep learning cookbook: over 80 recipes that streamline deep learning in a distributed environment with Apache Spark
por: Sherif, Ahmed, et al.
Publicado: (2018)

Apache Spark 2: master complex big data processing, stream analytics, and machine learning with Apache Spark
por: Kienzler, Romeo, et al.
Publicado: (2018)

Apache Spark 2.x cookbook: Cloud-ready recipes to do analytics and data science on Apache Spark
por: Yadav, Rishi
Publicado: (2017)

Apache Spark for data science cookbook: overinsightful 90 recipes to get lightning-fast analytics with Apache Spark
por: Chitturi, Padma Priya
Publicado: (2016)

Pro Spark streaming: the zen of real-time analytics using Apache Spark
por: Nabi, Zubair
Publicado: (2016)

Big data processing with Apache Spark: efficiently tackle large datasets and big data analysis with Spark and Python
por: Franco Galeano, Manuel Ignacio
Publicado: (2018)

SPARK-MSNA: Efficient algorithm on Apache Spark for aligning multiple similar DNA/RNA sequences with supervised learning
por: Vineetha, V., et al.
Publicado: (2019)

Beginning Apache Spark 2: with resilient distributed datasets, Spark SQL, structured streaming and Spark machine learning library
por: Luu, Hien
Publicado: (2018)

Scala and Spark for big data analytics: tame big data with Scala and Apache Spark!
por: Karim, Md Rezaul
Publicado: (2017)

Apache Spark graph processing: build, process, and analyze large-scale graphs with Spark
por: Ramamonjison, Rindra, et al.
Publicado: (2015)

Practical Apache Spark: using the Scala API
por: Chellappan, Subhashini, et al.
Publicado: (2018)

CMS Analysis and Data Reduction with Apache Spark
por: Gutsche, Oliver, et al.
Publicado: (2017)

Apache Spark 2.x for Java developers: explore data at scale using the Java APIs of Apache Spark 2.x
por: Gulati, Sourav, et al.
Publicado: (2017)

From Collision to Discovery: Physics Analysis with Apache Spark
por: Motesnitsalis, Vaggelis
Publicado: (2018)

Exploiting Apache Spark platform for CMS computing analytics
por: Meoni, Marco, et al.
Publicado: (2017)

Apache Spark 2.x machine learning cookbook
por: Amirghodsi, Siamak, et al.
Publicado: (2016)

Sams teach yourself Apache Spark in 24 hours
por: Aven, Jeffrey
Publicado: (2017)

Apache Spark usage and deployment models for scientific computing
por: Castro, Diogo, et al.
Publicado: (2019)

Big Data in metagenomics: Apache Spark vs MPI
por: Abuín, José M., et al.
Publicado: (2020)

Halvade somatic: Somatic variant calling with Apache Spark
por: Decap, Dries, et al.
Publicado: (2022)

Frank Kane's Taming big data with Apache Spark and Python: real-world examples to help you analyze large datasets with Apache Spark
por: Kane, Frank
Publicado: (2017)

Apache Spark machine learning blueprints: develop a range of cutting-edge machine learning projects with Apache Spark using this actionable guide
por: Liu, Alex
Publicado: (2016)

Large-scale virtual screening on public cloud resources with Apache Spark
por: Capuccini, Marco, et al.
Publicado: (2017)

Laurelin: Java-native ROOT I/O for Apache Spark
por: Melo, Andrew Malone
Publicado: (2021)

Laurelin: Java-native ROOT I/O for Apache Spark
por: Melo, Andrew, et al.
Publicado: (2021)

PySpark cookbook: over 60 recipes for implementing big data processing and analytics using Apache Spark and Python
por: Lee, Denny, et al.
Publicado: (2018)

A new Apache Spark-based framework for big data streaming forecasting in IoT networks
por: Fernández-Gómez, Antonio M., et al.
Publicado: (2023)

Big Data Technologies and Physics Analysis with Apache Spark (lecture 2)
por: Motesnitsalis, Evangelos
Publicado: (2019)

Cannot write session to /tmp/vufind_sessions/sess_dq3c0cim450a31r462oe1lbo1g