VIVI Electric Bike Ride Show

Introduction to Sparklyr for Data Science [Online Code]

Number of Videos: 1.5 hours – 21 lessonsAuthor: Kelly O’BriantUser Level: Intermediate Join data scientist Kelly O’Briant for an exploration of sparklyr, the package from RStudio which provides an interface to Apache Spark from R. For many data scientists who rely on R for their work, the paradigm shift from local in-memory computations to scalable

Number of Videos: 1.5 hours – 21 lessons
Author: Kelly O’Briant
User Level: Intermediate

Join data scientist Kelly O’Briant for an exploration of sparklyr, the package from RStudio which provides an interface to Apache Spark from R. For many data scientists who rely on R for their work, the paradigm shift from local in-memory computations to scalable distributed data processing can be complicated to navigate. This course provides an easy-to-follow R based method for working with big data. You’ll connect to Spark, run some sparklyr code, and explore some practical applications of Spark SQL and sparklyr functionality. You’ll wrap up by performing some exploratory analysis and feature generation using a Kaggle competition data set. Learners should have a moderate level of experience with doing data science tasks or workflows in R.

Explore the benefits and limitations of choosing sparklyr for distributed computing in R Discover how to interact with data in Apache Spark through sparklyr and Spark SQL Understand how to connect to Spark locally or to a remote Spark cluster Learn to perform exploratory data analysis in Spark using sparklyr, dplyr, and DBI Master the differences between working with data frames in R versus Spark Understand how to build data products in R that don’t rely on storing big data locally

Kelly O’Briant is a data scientist and lead R developer with Washington DC based B23 LLC. She holds degrees in Computational Science and Informatics from George Mason University, and Bioinformatics from Virginia Commonwealth University. Kelly is a founder and co-organizer of the Washington DC chapter of R-Ladies Global. She gives talks on R cloud computing, R data products, and sparklyr at R-Ladies meetups and R conferences.

PC Minimum System Requirements:PC Recommended System Requirements:Processor:   AnyRAM:   AnyHard Disk:   1GBVideo Card:   AnySupported OS:   Windows 10, Windows 8, Windows 8.1, Windows 7, Windows Vista, Windows XP, Windows 2000, Windows

Product Features

  • Learn Introduction to Sparklyr for Data Science from a professional trainer on your own time at your own desk.
  • This visual training method offers users increased retention and accelerated learning.
  • Breaks even the most complex applications down into simplistic steps
  • Comes with Extensive Working Files

Automation templates