ratings
Working in a hands-on learning environment, led by our ML expert instructor, students will learn about and explore: Apply R to simplify predictive modeling with short and simple code. Use machine learning to solve problems ranging from small to big data. Build a training and testing dataset, applying different classification methods.
Unlimited Duration
January 6, 2021
EXPIRED
Course Description
Big data has become a popular buzzword across many industries. An increasing number of people have been exposed to the term and are looking at how to leverage big data in their own businesses, to improve sales and profitability. However, collecting, aggregating, and visualizing data is just one part of the equation. Being able to extract useful information from data is another task, and a much more challenging one. Machine Learning with R Cookbook, Second Edition uses a practical approach to teach you how to perform machine learning with R. Each chapter is divided into several simple recipes. Through the step-by-step instructions provided in each recipe, you will be able to construct a predictive model by using a variety of machine learning packages. In this book, you will first learn to set up the R environment and use simple R commands to explore data. The next topic covers how to perform statistical analysis with machine learning analysis and assess created models, covered in detail later on in the book. You'll also learn how to integrate R and Hadoop to create a big data analysis platform. The detailed illustrations provide all the information required to start applying machine learning to individual projects. With Machine Learning with R Cookbook, machine learning has never been easier.
About Instructor
More Courses by Insturctor
Course Curriculum
-
- Practical Machine Learning with R 00:00:00
- Introduction 00:00:00
- Downloading and installing R 00:00:00
- Downloading and installing RStudio 00:00:00
- Installing and loading packages 00:00:00
- Understanding of basic data structures 00:00:00
- Basic commands for subsetting 00:00:00
- Reading and writing data 00:00:00
- Manipulating data 00:00:00
- Applying basic statistics 00:00:00
- Visualizing data 00:00:00
- Getting a dataset for machine learning 00:00:00
-
- Data Exploration with Air Quality Datasets 00:00:00
- Introduction 00:00:00
- Using air quality dataset 00:00:00
- Converting attributes to factor 00:00:00
- Detecting missing values 00:00:00
- Imputing missing values 00:00:00
- Exploring and visualizing data 00:00:00
- Predicting values from datasets 00:00:00
- Analyzing Time Series Data 00:00:00
- Introduction 00:00:00
- Looking at time series data 00:00:00
- Plotting and forecasting time series data 00:00:00
- Extracting, subsetting, merging, filling, and padding 00:00:00
- Successive differences and moving averages 00:00:00
- Exponential smoothing 00:00:00
- Plotting the autocorrelation function 00:00:00
- Understanding Regression Analysis 00:00:00
- Introduction 00:00:00
- Different types of regression 00:00:00
- Fitting a linear regression model with lm 00:00:00
- Summarizing linear model fits 00:00:00
- Using linear regression to predict unknown values 00:00:00
- Generating a diagnostic plot of a fitted model 00:00:00
- Fitting multiple regression 00:00:00
- Summarizing multiple regression 00:00:00
- Using multiple regression to predict unknown values 00:00:00
- Fitting a polynomial regression model with lm 00:00:00
- Fitting a robust linear regression model with rlm 00:00:00
- Studying a case of linear regression on SLID data 00:00:00
- Applying the Gaussian model for generalized linear regression 00:00:00
- Applying the Poisson model for generalized linear regression 00:00:00
- Applying the Binomial model for generalized linear regression 00:00:00
- Fitting a generalized additive model to data 00:00:00
- Visualizing a generalized additive model 00:00:00
- Diagnosing a generalized additive model 00:00:00
- Classification 1 – Tree, Lazy, and Probabilistic 00:00:00
- Introduction 00:00:00
- Preparing the training and testing datasets 00:00:00
- Building a classification model with recursive partitioning trees 00:00:00
- Visualizing a recursive partitioning tree 00:00:00
- Measuring the prediction performance of a recursive partitioning tree 00:00:00
- Pruning a recursive partitioning tree 00:00:00
- Handling missing data and split and surrogate variables 00:00:00
- Building a classification model with a conditional inference tree 00:00:00
- Control parameters in conditional inference trees 00:00:00
- Visualizing a conditional inference tree 00:00:00
- Measuring the prediction performance of a conditional inference tree 00:00:00
- Classifying data with the k-nearest neighbor classifier 00:00:00
- Classifying data with logistic regression 00:00:00
- Classifying data with the Naïve Bayes classifier 00:00:00
- Model Evaluation 00:00:00
- Introduction 00:00:00
- Estimating model performance with k-fold cross-validation 00:00:00
- Estimating model performance with Leave One Out Cross Validation 00:00:00
- Performing cross-validation with the e1071 package 00:00:00
- Performing cross-validation with the caret package 00:00:00
- Ranking the variable importance with the caret package 00:00:00
- Ranking the variable importance with the rminer package 00:00:00
- Finding highly correlated features with the caret package 00:00:00
- Selecting features using the caret package 00:00:00
- Measuring the performance of the regression model 00:00:00
- Measuring prediction performance with a confusion matrix 00:00:00
- Measuring prediction performance using ROCR 00:00:00
- Comparing an ROC curve using the caret package 00:00:00
- Measuring performance differences between models with the caret package 00:00:00
- Clustering 00:00:00
- Introduction 00:00:00
- Clustering data with hierarchical clustering 00:00:00
- Cutting trees into clusters 00:00:00
- Clustering data with the k-means method 00:00:00
- Drawing a bivariate cluster plot 00:00:00
- Comparing clustering methods 00:00:00
- Extracting silhouette information from clustering 00:00:00
- Obtaining the optimum number of clusters for k-means 00:00:00
- Clustering data with the density-based method 00:00:00
- Clustering data with the model-based method 00:00:00
- Visualizing a dissimilarity matrix 00:00:00
- Validating clusters externally 00:00:00
- Dimension Reduction 00:00:00
- Introduction 00:00:00
- Why to reduce the dimension? 00:00:00
- Performing feature selection with FSelector 00:00:00
- Performing dimension reduction with PCA 00:00:00
- Determining the number of principal components using the scree test 00:00:00
- Determining the number of principal components using the Kaiser method 00:00:00
- Visualizing multivariate data using biplot 00:00:00
- Performing dimension reduction with MDS 00:00:00
- Reducing dimensions with SVD 00:00:00
- Compressing images with SVD 00:00:00
- Performing nonlinear dimension reduction with ISOMAP 00:00:00
- Performing nonlinear dimension reduction with Local Linear Embedding 00:00:00
Course Reviews
No Reviews found for this course.