Skip to main content

O'Reilly Media

Parallel R: Data Analysis in the Distributed World

No reviews yet
Product Code: 9781449309923
ISBN13: 9781449309923
Condition: New
$26.07

Parallel R: Data Analysis in the Distributed World

$26.07
 

It's tough to argue with R as a high-quality, cross-platform, open source statistical software product--unless you're in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. You'll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they don't.

With these packages, you can overcome R's single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R's memory barrier.

  • Snow: works well in a traditional cluster environment
  • Multicore: popular for multiprocessor and multicore computers
  • Parallel: part of the upcoming R 2.14.0 release
  • R+Hadoop: provides low-level access to a popular form of cluster computing
  • RHIPE: uses Hadoop's power with R's language and interactive shell
  • Segue: lets you use Elastic MapReduce as a backend for lapply-style operations



Author: Q. McCallum
Publisher: O'Reilly Media
Publication Date: Nov 29, 2011
Number of Pages: 120 pages
Binding: Paperback or Softback
ISBN-10: 1449309925
ISBN-13: 9781449309923
 

Customer Reviews

This product hasn't received any reviews yet. Be the first to review this product!

Faster Shipping

Delivery in 3-8 days

Easy Returns

14 days returns

Discount upto 30%

Monthly discount on books

Outstanding Customer Service

Support 24 hours a day