[FREE] Robust data analysis in R and Matlab

Short Description

Learn robust data analysis with R and Matlab, the key in Machine Learning, Data Science and Statistics.

What you’ll learn

  • Concepts related to Robust Statistics.
  • Performance of outlier detection methods.
  • Learn to differentiate one method from another.
  • Identify the most robust and efficient methods that you should use in practice.
  • Application of the methods with handmade examples.
  • Application of the methods with R and Matlab.

Requirement before enroll

  • Basic statistical knowledge.

Description

Robust data analysis and outlier detection are crucial in Statistics, Data Analysis, Data Mining, Machine Learning, Pattern Recognition, Artificial Intelligence, Classification, Principal Components, Regression, Big Data, and any field related with data. Researchers, students, data analyst, and mostly anyone who is dealing with real data have to be aware of the problem with outliers and they have to know how to deal with this issue.

This course is intended to study the characteristics of the problem, its consequences and learn how to recognise it through the existing approaches. We will deeply study the performance and the properties of the methods to detect outliers in case we have a single random variable (univariate data) or in case we have more than one (multivariate data) . We will see the theoretical properties of the methods and we will apply them to examples. In addition, we are going to see the practical performance with the software R and Matlab, and we will learn the different existing packages in both software for the problem of outlier detection. The implementation and example codes are available in the open Google Drive repository.

You will learn about both classical and recent algorithms for outliers detection:

Univariate space:

  1. Method SD
  2. Z score
  3. Tukey Boxplot
  4. MADe
  5. Modified Z score
  6. Adjusted boxplot

Multivariate space:

  1. Classical Mahalanobis distance 
  2. Robust Mahalanobis distance
  3. MCD
  4. Adjusted MCD
  5. Stahel-Donoho
  6. Kurtosis

Linear regression:

  1. Ordinary least squares (classic method)
  2. Robust regression: LAD, LMS, LTS

In addition, we have two sections of basic concepts that will help you to remember some notions necessary to understand the methods for outlier detection.

Basics I

  1. Sample, population, random variable
  2. Distribution of a random variable
  3. Normal distribution
  4. Fisher chi-square, t-student and F distributions
  5. Estimators

Basics II

  1. Linear algebra
  2. Multivariate variable
  3. Joint and marginal distribution
  4. Independence, covariance and correlation
  5. Multivariate Normal

With this course you will master one of the most important issues today both academically, as in industry and in data analysis. The examples will help you to visualize this importance and as a guide to carry out these analyzes by yourself.

Who this course is for:

  • Data scientist.
  • Data analyst.
  • Students.
  • Researchers.

This course includes

  • 9 hours on-demand video
  • 13 articles
  • 14 downloadable resources

Checkout: https://coursetime.net/exam-concepts-aws-certified-sysops-administrator-associate

Get 3 course worth $129 for FREE

RECENT COURSE

COURSERA COURSE