21 hours (usually 3 days including breaks)
Python is a high-level programming language famous for its clear syntax and code readibility. Spark is a data processing engine used in querying, analyzing, and transforming big data. PySpark allows users to interface Spark with Python.
In this instructor-led, live training, participants will learn how to use Python and Spark together to analyze big data as they work on hands-on exercises.
By the end of this training, participants will be able to:
Format of the course
Understanding Big Data
Overview of Spark
Overview of Python
Overview of PySpark
Setting Up Python with Spark
Setting Up PySpark
Using Amazon Web Services (AWS) EC2 Instances for Spark
Setting Up Databricks
Setting Up the AWS EMR Cluster
Learning the Basics of Python Programming
Learning the Basics of Spark DataFrame
Working on a Spark DataFrame Project Exercise
Understanding Machine Learning with MLlib
Working with MLlib, Spark, and Python for Machine Learning
Understanding Random Forests and Decision Trees
Working with K-means Clustering
Working with Recommender Systems
Implementing Natural Language Processing
Streaming with Spark on Python
Pawel Kozikowski - GE Medical Systems Polska Sp. Zoo
We are looking to expand our presence in Norway!
If you are interested in running a high-tech, high-quality training and consulting business.Apply now!