Cargando…

Massive Predictive Modeling using Oracle R Enterprise

<!--HTML--><p align="justify"> R is fast becoming the lingua franca for analyzing data via statistics, visualization, and predictive analytics. For enterprise-scale data, R users have three main concerns: scalability, performance, and production deployment. Oracle&#39;s R-...

Descripción completa

Detalles Bibliográficos
Autor principal: Hornick, Mark F.
Lenguaje:eng
Publicado: 2014
Materias:
Acceso en línea:http://cds.cern.ch/record/1694485
_version_ 1780935965695541248
author Hornick, Mark F.
author_facet Hornick, Mark F.
author_sort Hornick, Mark F.
collection CERN
description <!--HTML--><p align="justify"> R is fast becoming the lingua franca for analyzing data via statistics, visualization, and predictive analytics. For enterprise-scale data, R users have three main concerns: scalability, performance, and production deployment. Oracle&#39;s R-based technologies - Oracle R Distribution, Oracle R Enterprise, Oracle R Connector for Hadoop, and the R package ROracle - address these concerns.</p> <p align="justify"> In this talk, we introduce Oracle&#39;s R technologies, highlighting how each enables R users to achieve scalability and performance while making production deployment of R results a natural outcome of the data analyst/scientist efforts. The focus then turns to Oracle R Enterprise with code examples using the transparency layer and embedded R execution, targeting&nbsp;massive predictive modeling. One&nbsp;goal behind massive predictive modeling is to build models per entity, such as customers, zip codes, simulations, in an effort to understand behavior and tailor predictions at the entity level. Predictions can then be aggregated, for example, to assess future demand. Massive predictive modeling comes with challenges: effectively partitioning data, where to store and manage the resulting models, how to associate models with customers, as well as backup, recovery, and security.</p> <p align="justify"> While R has parallel capabilities to facilitate taking advantage of clusters of computers, significant coding is usually required to meet the challenges noted above. In this talk, we present the business problem and illustrate how Oracle R Enterprise, one of Oracle?s R technologies, facilitates massive predictive modeling in a pair of succinct R scripts. With Oracle R Enterprise, the data, R scripts, and models all reside in Oracle Database.</p> <h4> About the speaker</h4> <p align="justify"> Mark Hornick, Director, Oracle Advanced Analytics, focuses on Oracle&#39;s R Technologies. He works with internal and external customers in the application of R for scalable advanced analytics applications in Oracle Database, Exadata, and the Big Data Appliance. Mark is coauthor of the books&nbsp;Using R to Unlock the Value of Big Data&nbsp;and&nbsp;Oracle Big Data Handbook, published by Oracle Press. He joined Oracle&#39;s Data Mining Technologies group in 1999 through the acquisition of Thinking Machines Corp. Mark also evangelizes and conducts training sessions on Oracle&#39;s R technologies internationally, and has presented at conferences including Oracle OpenWorld, Collaborate, BIWA Summit, and useR!. Mark holds a Bachelor&#39;s degree from Rutgers University and a Master&#39;s degree from Brown University, both in Computer Science.</p>
id cern-1694485
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2014
record_format invenio
spelling cern-16944852022-11-02T22:30:05Zhttp://cds.cern.ch/record/1694485engHornick, Mark F.Massive Predictive Modeling using Oracle R EnterpriseMassive Predictive Modeling using Oracle R EnterpriseComputing Seminar<!--HTML--><p align="justify"> R is fast becoming the lingua franca for analyzing data via statistics, visualization, and predictive analytics. For enterprise-scale data, R users have three main concerns: scalability, performance, and production deployment. Oracle&#39;s R-based technologies - Oracle R Distribution, Oracle R Enterprise, Oracle R Connector for Hadoop, and the R package ROracle - address these concerns.</p> <p align="justify"> In this talk, we introduce Oracle&#39;s R technologies, highlighting how each enables R users to achieve scalability and performance while making production deployment of R results a natural outcome of the data analyst/scientist efforts. The focus then turns to Oracle R Enterprise with code examples using the transparency layer and embedded R execution, targeting&nbsp;massive predictive modeling. One&nbsp;goal behind massive predictive modeling is to build models per entity, such as customers, zip codes, simulations, in an effort to understand behavior and tailor predictions at the entity level. Predictions can then be aggregated, for example, to assess future demand. Massive predictive modeling comes with challenges: effectively partitioning data, where to store and manage the resulting models, how to associate models with customers, as well as backup, recovery, and security.</p> <p align="justify"> While R has parallel capabilities to facilitate taking advantage of clusters of computers, significant coding is usually required to meet the challenges noted above. In this talk, we present the business problem and illustrate how Oracle R Enterprise, one of Oracle?s R technologies, facilitates massive predictive modeling in a pair of succinct R scripts. With Oracle R Enterprise, the data, R scripts, and models all reside in Oracle Database.</p> <h4> About the speaker</h4> <p align="justify"> Mark Hornick, Director, Oracle Advanced Analytics, focuses on Oracle&#39;s R Technologies. He works with internal and external customers in the application of R for scalable advanced analytics applications in Oracle Database, Exadata, and the Big Data Appliance. Mark is coauthor of the books&nbsp;Using R to Unlock the Value of Big Data&nbsp;and&nbsp;Oracle Big Data Handbook, published by Oracle Press. He joined Oracle&#39;s Data Mining Technologies group in 1999 through the acquisition of Thinking Machines Corp. Mark also evangelizes and conducts training sessions on Oracle&#39;s R technologies internationally, and has presented at conferences including Oracle OpenWorld, Collaborate, BIWA Summit, and useR!. Mark holds a Bachelor&#39;s degree from Rutgers University and a Master&#39;s degree from Brown University, both in Computer Science.</p> oai:cds.cern.ch:16944852014
spellingShingle Computing Seminar
Hornick, Mark F.
Massive Predictive Modeling using Oracle R Enterprise
title Massive Predictive Modeling using Oracle R Enterprise
title_full Massive Predictive Modeling using Oracle R Enterprise
title_fullStr Massive Predictive Modeling using Oracle R Enterprise
title_full_unstemmed Massive Predictive Modeling using Oracle R Enterprise
title_short Massive Predictive Modeling using Oracle R Enterprise
title_sort massive predictive modeling using oracle r enterprise
topic Computing Seminar
url http://cds.cern.ch/record/1694485
work_keys_str_mv AT hornickmarkf massivepredictivemodelingusingoraclerenterprise