Big Data With Spark Scala
hadoop overview, scala, apache spark & practicals on real time data
HADOOP OVERVIEW, SCALA, APACHE SPARK & PRACTICALS ON REAL TIME DATA ABOUT TRAINING: This Hadoop training is designed to make aware about
Course HighLights :
HADOOP OVERVIEW
Need of Hadoop technology
Overview of Data centers and Cluster
Hadoop Cluster and Racks in detail
Learning Ubuntu for Hadoop
Overview of Hadoop tools
Overview of Map Reduce
Big data Concepts
Distributed network and computation
Challenges in data management and control
Introduction to Big Data
Types of data in detail
Sources of Big Data
Concept of Streaming data
HDFS and Basic hadoop commands
APACHE SPARK
Download & Install Apache Spark 2 x
Introduction to Spark Shell
Spark RDDs Part 1
Spark RDDs Part 2
RDD Transformations
Introduction to IDEs
Creating Spark Project with Eclipse Maven
Creating Spark Context and RDD in Spark 1 x style
Creating Spark Context and RDD in Spark 2 x style
RDD Basics Working with CSV Files
RDD Basics Saving RDD to a Text File
Introduction to Spark DataFrame
Creating DataFrame using Spark 1 x Style
Creating DataFrame using Spark 2 x Style
Creating DataFrame with CSV file in Spark 1 x Style
Creating DataFrame with CSV file in Spark 2 x Style
Creating Multiple Spark Context in Spark 1.x
Creating Multiple Spark Sessions Spark 2 x
Working with Different File Formats – ORC, JSON, Parquet
Working with Avro Files
Applying Own Schema to the DataFrame
Basic Operations on DataFrame
Temporary Tables Spark 1 x Style
Temporary Tables Spark 2 x Style
Introduction to Datasets
Creating a Dataset
Basic Dataset Operations
Dataset vs DataFrame Performance
SCALA
Scala Overview
Environment Setup
Basic Syntax
Data Types
Variables
Classes & Objects
Access Modifiers
Operators
IF ELSE
Loop Statements
Functions
Closures
Strings
Arrays
Collections
Traits
Pattern
Matching