Big Data Analytics with R and Hadoop

If you’re an R developer looking to harness the power of big data analytics with Hadoop, then this book tells you everything you need to integrate the two. You’ll end up capable of building a data analytics engine with huge potential.
Preview in Mapt

Big Data Analytics with R and Hadoop

Vignesh Prajapati

1 customer reviews
If you’re an R developer looking to harness the power of big data analytics with Hadoop, then this book tells you everything you need to integrate the two. You’ll end up capable of building a data analytics engine with huge potential.
Mapt Subscription
FREE
$29.99/m after trial
eBook
$21.00
RRP $29.99
Save 29%
Print + eBook
$49.99
RRP $49.99
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$21.00
$49.99
$29.99 p/m after trial
RRP $29.99
RRP $49.99
Subscription
eBook
Print + eBook
Start 30 Day Trial

Frequently bought together


Big Data Analytics with R and Hadoop Book Cover
Big Data Analytics with R and Hadoop
$ 29.99
$ 21.00
Practical Big Data Analytics Book Cover
Practical Big Data Analytics
$ 35.99
$ 25.20
Buy 2 for $35.00
Save $30.98
Add to Cart

Book Details

ISBN 139781782163282
Paperback238 pages

Book Description

Big data analytics is the process of examining large amounts of data of a variety of types to uncover hidden patterns, unknown correlations, and other useful information. Such information can provide competitive advantages over rival organizations and result in business benefits, such as more effective marketing and increased revenue. New methods of working with big data, such as Hadoop and MapReduce, offer alternatives to traditional data warehousing.

Big Data Analytics with R and Hadoop is focused on the techniques of integrating R and Hadoop by various tools such as RHIPE and RHadoop. A powerful data analytics engine can be built, which can process analytics algorithms over a large scale dataset in a scalable manner. This can be implemented through data analytics operations of R, MapReduce, and HDFS of Hadoop.

You will start with the installation and configuration of R and Hadoop. Next, you will discover information on various practical data analytics examples with R and Hadoop. Finally, you will learn how to import/export from various data sources to R. Big Data Analytics with R and Hadoop will also give you an easy understanding of the R and Hadoop connectors RHIPE, RHadoop, and Hadoop streaming.

Table of Contents

Chapter 1: Getting Ready to Use R and Hadoop
Installing R
Installing RStudio
Understanding the features of R language
Installing Hadoop
Understanding Hadoop features
Learning the HDFS and MapReduce architecture
Understanding Hadoop subprojects
Summary
Chapter 2: Writing Hadoop MapReduce Programs
Understanding the basics of MapReduce
Introducing Hadoop MapReduce
Understanding the Hadoop MapReduce fundamentals
Writing a Hadoop MapReduce example
Learning the different ways to write Hadoop MapReduce in R
Summary
Chapter 3: Integrating R and Hadoop
Introducing RHIPE
Introducing RHadoop
Summary
Chapter 4: Using Hadoop Streaming with R
Understanding the basics of Hadoop streaming
Understanding how to run Hadoop streaming with R
Exploring the HadoopStreaming R package
Summary
Chapter 5: Learning Data Analytics with R and Hadoop
Understanding the data analytics project life cycle
Understanding data analytics problems
Summary
Chapter 6: Understanding Big Data Analysis with Machine Learning
Introduction to machine learning
Supervised machine-learning algorithms
Unsupervised machine learning algorithm
Recommendation algorithms
Summary
Chapter 7: Importing and Exporting Data from Various DBs
Learning about data files as database
Understanding MySQL
Understanding Excel
Understanding MongoDB
Understanding SQLite
Understanding PostgreSQL
Understanding Hive
Understanding HBase
Summary

What You Will Learn

  • Integrate R and Hadoop via RHIPE, RHadoop, and Hadoop streaming
  • Develop and run a MapReduce application that runs with R and Hadoop
  • Handle HDFS data from within R using RHIPE and RHadoop
  • Run Hadoop streaming and MapReduce with R
  • Import and export from various data sources to R

Authors

Table of Contents

Chapter 1: Getting Ready to Use R and Hadoop
Installing R
Installing RStudio
Understanding the features of R language
Installing Hadoop
Understanding Hadoop features
Learning the HDFS and MapReduce architecture
Understanding Hadoop subprojects
Summary
Chapter 2: Writing Hadoop MapReduce Programs
Understanding the basics of MapReduce
Introducing Hadoop MapReduce
Understanding the Hadoop MapReduce fundamentals
Writing a Hadoop MapReduce example
Learning the different ways to write Hadoop MapReduce in R
Summary
Chapter 3: Integrating R and Hadoop
Introducing RHIPE
Introducing RHadoop
Summary
Chapter 4: Using Hadoop Streaming with R
Understanding the basics of Hadoop streaming
Understanding how to run Hadoop streaming with R
Exploring the HadoopStreaming R package
Summary
Chapter 5: Learning Data Analytics with R and Hadoop
Understanding the data analytics project life cycle
Understanding data analytics problems
Summary
Chapter 6: Understanding Big Data Analysis with Machine Learning
Introduction to machine learning
Supervised machine-learning algorithms
Unsupervised machine learning algorithm
Recommendation algorithms
Summary
Chapter 7: Importing and Exporting Data from Various DBs
Learning about data files as database
Understanding MySQL
Understanding Excel
Understanding MongoDB
Understanding SQLite
Understanding PostgreSQL
Understanding Hive
Understanding HBase
Summary

Book Details

ISBN 139781782163282
Paperback238 pages
Read More
From 1 reviews

Read More Reviews

Recommended for You

Big Data Analytics with R Book Cover
Big Data Analytics with R
$ 43.99
$ 30.80
R for Data Science Cookbook Book Cover
R for Data Science Cookbook
$ 35.99
$ 25.20
Learning Data Mining with R Book Cover
Learning Data Mining with R
$ 29.99
$ 21.00
R for Data Science Book Cover
R for Data Science
$ 29.99
$ 21.00
Practical Machine Learning Book Cover
Practical Machine Learning
$ 37.99
$ 26.60
Mastering pandas for Finance Book Cover
Mastering pandas for Finance
$ 35.99
$ 25.20