The book is currently out of stock

Guide to High Performance Distributed Computing

Case Studies with Hadoop, Scalding and Spark

Authors

M. Srinivasa Sarma

Book rating

Add rating

Parameters

321 pages
12 hours of reading

More about the book

This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies. Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks. Features: describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing; presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution; Reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding; Provides detailed case studies on approaches to clustering, data classification and regression analysis; Explains the process of creating a working recommender system using Scalding and Spark.

Book purchase

Guide to High Performance Distributed Computing, M. Srinivasa Sarma

Language
Released: 2015
product-detail.submit-box.info.binding: (Hardcover)

We’ll email you as soon as we track it down.

Payment methods

4.0

Very Good

1 Ratings

Add rating

We’re missing your review here.

Edition detailsSuggest a correction

Title: Guide to High Performance Distributed Computing
Subtitle: Case Studies with Hadoop, Scalding and Spark
Language: English
Authors: M. Srinivasa Sarma
Publisher: Springer
Released: 2015
Format: Hardcover
Pages: 321
ISBN10: 3319134965
ISBN13: 9783319134963
Series
Tags: Non-Fiction, Technology & Engineering, Computers & Internet
Rating: 4 out of 5
Description: This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies. Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks. Features: describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing; presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution; Reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding; Provides detailed case studies on approaches to clustering, data classification and regression analysis; Explains the process of creating a working recommender system using Scalding and Spark.