Big Data Training

This is what differentiates our Big Data training program:
  • Our program always evolves as technology does
  • You can repeat the training program as many times as you would like (at no additional cost)
  • Lifetime access to latest slides and virtual machines
  • More relevant content at half the price of competitors
  • Both online and classroom options available
  • Complimentary signed copy of Spark Cookbook by Packt Publishing written by Rishi Yadav

Schedule

Next Session: May 6th 2017 8:00 – 8:30 AM – Free Breakfast + Coffee + Q/A session
9:50 AM – 10:00 AM – Break
12:00 – 1:00 AM – Free Lunch
2:50 PM – 3:00 PM – Break

Big Data Meal

7 course meal-menu (Two Consecutive Weekends)

Course 1: Introduction to Big Data and HDFSDay 1: 8:30 AM – 12:00 PM

  • Data Driven Thinking
  • Big Data 101
  • Big Data Storage: open-source
  • Hadoop Distributed File System (HDFS)
  • HDFS: Blocks
  • HDFS: NameNode
  • HDFS: DataNode
  • HDFS: Checkpointing
  • HDFS: HA, Federation and Snapshots
  • HDFS: Write & Read Path
  • HDFS: Commands
  • Big Data Storage: Public Cloud
  • AWS: S3
  • Azure Blog Storage

Course 2: Hadoop ComputeDay 1: 1:00 PM – 3:00 PM

  • Classic MapReduce
  • YARN
  • Input Formats

Course 3: Hadoop ToolsDay 1: 3 PM -5 PM
Day 2: 8:30 AM – 11 AM

  • Hive
  • Kafka

Course 4: NoSQLDay 2: 11:00 AM – 12:00 PM
Day 2: 1:00 PM – 3:00 PM

  • HBase
  • Cassandra

Course 5: Spark Basics and SparkSQLDay 2: 3:00 – 5:00 PM
Day 3: 8:30 AM – 12:00 PM

  • Spark Basic
  • DeTour: Introduction to Scala
  • Spark Usage
  • SparkSQL: DataFrames & Datasets
  • Inferring schema using case classes
  • Loading and saving data using Parquet
  • Loading and saving data using JSON

Course 6: Spark Streaming & Real-Time AnalyticsDay 3: 1:00 PM – 5:00 PM

  • Streaming Basics
  • Discretized Stream (DStream)
  • Word Count using Streaming
  • Streaming Twitter Data

Course 7: Spark MLlib & Machine LearningDay 4: 8:30 AM – 12:00 PM
Day 4: 1:00 PM – 4:00 PM

  • Machine Learning Basics
  • Supervised Learning
  • Unsupervised Learning
  • Recommendation Engines

Bonus : Public Cloud #PaaSDay 4: 4:00 PM – 5:00 PM

  • Serverless Architecture
  • Security in the Cloud
  • The Road Ahead