Cargando…

Data-Intensive Text Processing with MapReduce

Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Jimmy, Dyer, Chris
Lenguaje:eng
Publicado: Morgan & Claypool Publishers 2010
Materias:
Acceso en línea:http://cds.cern.ch/record/1486557
Descripción
Sumario:Our world is being revolutionized by data-driven methods: access to large amounts of data has generated new insights and opened exciting new opportunities in commerce, science, and computing applications. Processing the enormous quantities of data necessary for these advances requires large clusters, making distributed computing paradigms more crucial than ever. MapReduce is a programming model for expressing distributed computations on massive datasets and an execution framework for large-scale data processing on clusters of commodity servers. The programming model provides an easy-to-underst