• No products in the cart.

ratings 

Machine Learning Essentials with Python and Spark is a foundation-level, three-day hands-on course that teaches you core skills and concepts in modern machine learning at scale practices, leveraging Python and Spark.

PRIVATE
Course Access

Unlimited Duration

Last Updated

March 3, 2021

Students Enrolled

Total Reviews

Posted by
Certification

This “skills-centric” course is about 50% hands-on lab and 50% lecture, with extensive practical exercises designed to reinforce fundamental skills, concepts and best practices taught throughout the course. In this course you will learn about:

• Machine Learning (ML) Overview

• Machine Learning in Python and Spark

• Machine Learning Concepts

• Feature Engineering (FE)

• Linear regression

• Logistic Regression

• Classification: SVM (Supervised Vector Machines)

• Classification: Decision Trees & Random Forests

• Classification: Naive Bayes

• Clustering (K-Means)

• Principal Component Analysis (PCA)

• Recommendations (Collaborative filtering)

• Performance 

• Time Permitting: Capstone Project


Course Curriculum

    • Machine Learning landscape 00:00:00
    • Machine Learning applications 00:00:00
    • Understanding ML algorithms & models 00:00:00
    • Spark ML Overview 00:00:00
    • Introduction to Jupyter notebooks 00:00:00
    • Working with Jupyter + Python + Spark 00:00:00
    • Statistics Primer 00:00:00
    • Covariance, Correlation, Covariance Matrix 00:00:00
    • Errors, Residuals 00:00:00
    • Overfitting / Underfitting 00:00:00
    • Cross-validation, bootstrapping 00:00:00
    • Confusion Matrix 00:00:00
    • ROC curve, Area Under Curve (AUC) 00:00:00
    • Preparing data for ML 00:00:00
    • Extracting features, enhancing data 00:00:00
    • Data cleanup 00:00:00
    • Visualizing Data 00:00:00
    • Lab: data cleanup 00:00:00
    • Lab: visualizing data 00:00:00
    • Simple Linear Regression 00:00:00
    • Multiple Linear Regression 00:00:00
    • Running LR 00:00:00
    • Evaluating LR model performance 00:00:00
    • Use case: House price estimates 00:00:00
    • Understanding Logistic Regression 00:00:00
    • Calculating Logistic Regression 00:00:00
    • Evaluating model performance 00:00:00
    • Use case: credit card application, college admissions 00:00:00
    • SVM concepts and theory 00:00:00
    • SVM with kernel 00:00:00
    • Use case: Customer churn data 00:00:00
    • Theory behind trees 00:00:00
    • Classification and Regression Trees (CART) 00:00:00
    • Random Forest concepts 00:00:00
    • Use case: predicting loan defaults, estimating election contributions 00:00:00
    • Theory 00:00:00
    • Use case: spam filtering 00:00:00
    • Theory behind K-Means 00:00:00
    • Running K-Means algorithm 00:00:00
    • Estimating the performance 00:00:00
    • Use case: grouping cars data, grouping shopping data 00:00:00
    • Understanding PCA concepts 00:00:00
    • PCA applications 00:00:00
    • Running a PCA algorithm 00:00:00
    • Evaluating results 00:00:00
    • Use case: analyzing retail shopping data 00:00:00
    • Recommender systems overview 00:00:00
    • Collaborative Filtering concepts 00:00:00
    • Use case: movie recommendations, music recommendations 00:00:00
    • Best practices for scaling and optimizing Apache Spark 00:00:00
    • Memory caching 00:00:00
    • Testing and validation 00:00:00
    • Hands-on guided workshop utilizing skills learned throughout the course 00:00:00

    Course Reviews

    Profile Photo
    ashar hafeez
    0
    61

    Students

    About Instructor

    Pak

    Course Events

    [wplms_eventon_events]

    More Courses by Insturctor

    © 2021 Ernesto.  All rights reserved.  
    X