Learn the basics of

Probability & Statistical Analysis

The professional diploma in probability and statistical analysis module 1, will aim to give you a thorough understanding of all the fundamental concepts you need as an analyst. This module will include crucial theories, such as the Central Limit Theorem and Bayes Theorem, needed for those wanted to understand data in more depth. SAS Studio will cater to the practical understanding of the theoretical concepts studied throughout the module. This module is the building block for anyone who wishes to gain a more thorough understanding of the the behavior behind the trend. 



All levels



Course details

Bootstrap Accordion with Plus Minus Icon


Diploma in Probability and Statistical Analysis

1.Introduction to Statistical Analysis and SAS

In this lesson, you will be introduced to statistical analysis. We will discuss what the difference between this course and the data analysis course is and who this course is aimed at. Thereafter, we will introduce the tool we will utilize throughout this module, called SAS. The lesson ends with a fun practical demonstration in SAS.

2.Getting to Know Your Data

This lesson will be all about study design, data types and a sneak peak into summarizing data. Understanding the study design and the type of scales that are used to measure data, is a crucial step in analysing the data accurately. only after we know the pros and cons of the way the data was gathered, can we start describing the data.

3.Summarising Data

This lesson will mainly focus on the different methods of summarising data. The previous lesson introduced measures of central tendency to the student. Lesson 3 will elaborate on that concept with measures of spread as well as various ways of visualising data through plots in SAS Studio. Each topic will be consolidated through a practical demonstration in SAS Studio.

4.Probability Theory

This lesson will introduce the student to the concepts of probability theory. This lesson includes concepts like samples and populations. The lesson will define the basic definitions and rules of probability. This lesson will end by touching on more advanced concepts like mutually exclusive events, independent events, non independent events, and non mutually exclusive events.


Lesson 5 aims to initiate the student's understanding of random variables and a various number probability distributions known and identified. Together with these concepts, this lesson will showcase the famous Central Limit Theorem (a fundamental concept to the understanding of sample sizes to be discussed in the next lesson).

6.Sample Sizes and Sampling

This lesson will answer the well known question of "how many observations does the study need in order to be statistically significant?". Many studies skip this fundamental step and end up not being able to prove statistical significance as a result. Lesson 6 will understand what it means for a result to be statistically significant and why it is so important for the sample to be large enough.

7.Hypothesis Testing

Lesson 7 will focus on questions about a single group. This lesson starts to uncover the concepts of inferential statistics; statistical methods used to draw conclusions from the sample in order to make conclusion about the population. Prior lessons focused on descriptive statistics, because they helped the student to describe and summarise the data through various methods like plots and summary statistics.

8.Practical Module Recap

The last lesson in module 1 will recap on all the principles covered in module 1 through a practical example.

1.Cleaning and Merging Data

The first lesson of this module is focused on enhancing your knowledge on cleaning and merging data. Lesson 1 will take it back to more basic principles, but nonetheless some of the most important principles of an analysts' work cycle, cleaning and merging data. It is estimated that up to 60% of an analysts' time is spent on cleaning data, therefore this step in the journey is crucial to manage efficiently.

2.Managing Data

Lesson 2 will teach you how to effectively manage your data. Creating new variable and transforming variables all form apart of this step of the process.

3.Working with Dates and Times

Working with dates and times is a vital skill in your Probability and Statistical Analysis journey, this lesson is geared to help you master this skill effectively. Dates and times are concepts integrated into most data sets and working with them, might be considered an art form for some. In lesson 3, the student will learn tips and tricks to deal with dates and times in SAS Studio in order to make their handling as painless as possible.

4.Introducing Linear Regression

Halfway through module 2, our focus is shifts to modelling data, by introducing linear regression, the most well-known modelling procedure.

5.Linear Regression Continued

Understanding linear regression requires a little bit more time, in part 2 of this lesson, we delve a little deeper into this complex topic.

6.Multiple Linear Regression

Multiple regression is an extension of linear regression using multiple explanatory variables, this lesson is focused on understanding this in more detail.

7.Introducing Logistic Regression

Logistic regression usually uses a logistic function to model a binary dependent variable, we explore this concept in this lesson and identify where more complex extensions exist.

8.Logistic Regression Continued

We end off module 2 by delving into the finer details of logistic regression before moving to our more advanced concepts in module 3.