Skip to main content

Sale until 1 Feb: Up to 30% off selected books.

Springer

Big Data Processing Using Spark in Cloud

No reviews yet
Product Code: 9789811305498
ISBN13: 9789811305498
Condition: New
$117.02
The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements big data?s immutable nature, and solves it with lazy evaluation, cacheable and type inference. It also addresses advanced topics in Spark, starting with the basics of Scala and the core Spark framework, and exploring Spark data frames, machine learning using Mllib, graph analytics using Graph X and real-time processing with Apache Kafka, AWS Kenisis, and Azure Event Hub. It then goes on to investigate Spark using PySpark and R. Focusing on the current big data stack, the book examines the interaction with current big data tools, with Spark being the core processing layer for all types of data. The book is intended for data engineers and scientists working on massive datasets and big data technologies in the cloud. In addition to industry professionals, it is helpful for aspiring data processing professionals and students working in big data processing and cloud computing environments.


Author: Mamta Mittal, Valentina E. Balas, Lalit Mohan Goyal, Raghvendra Kumar
Publisher: Springer
Publication Date: Jun 26, 2018
Number of Pages: NA pages
Language: English
Binding: Hardcover
ISBN-10: 9811305498
ISBN-13: 9789811305498

Big Data Processing Using Spark in Cloud

$117.02
 
The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements big data?s immutable nature, and solves it with lazy evaluation, cacheable and type inference. It also addresses advanced topics in Spark, starting with the basics of Scala and the core Spark framework, and exploring Spark data frames, machine learning using Mllib, graph analytics using Graph X and real-time processing with Apache Kafka, AWS Kenisis, and Azure Event Hub. It then goes on to investigate Spark using PySpark and R. Focusing on the current big data stack, the book examines the interaction with current big data tools, with Spark being the core processing layer for all types of data. The book is intended for data engineers and scientists working on massive datasets and big data technologies in the cloud. In addition to industry professionals, it is helpful for aspiring data processing professionals and students working in big data processing and cloud computing environments.


Author: Mamta Mittal, Valentina E. Balas, Lalit Mohan Goyal, Raghvendra Kumar
Publisher: Springer
Publication Date: Jun 26, 2018
Number of Pages: NA pages
Language: English
Binding: Hardcover
ISBN-10: 9811305498
ISBN-13: 9789811305498
 

Customer Reviews

This product hasn't received any reviews yet. Be the first to review this product!

Faster Shipping

Delivery in 3-8 days

Easy Returns

14 days returns

Discount upto 30%

Monthly discount on books

Outstanding Customer Service

Support 24 hours a day