Introduction to Statistical Data Science with Python

Course Information

In an increasingly data-driven world, proficiency in data science and statistics is crucial for success in many sectors. By completing this course, you will develop a strong foundation in statistical data science, enabling you to effectively explore, summarise, and analyse data to derive valuable insights and make informed decisions in various business contexts.

You will be introduced to core concepts within data science, including data visualisation, hypothesis testing and linear regression. We will provide you with the underlying theories behind these methods, allowing you to understand their relevance and applicability. You will also learn how to implement the methods using Python (a widely used programming language in data science).

No prior knowledge of data science or statistics is required.

No previous programming experience is required.

Approximate learning time: 30 hours

We also offer this course taught with the R programming language.

Book this course

To register

Sign in

For more information

Course Agenda

You will work through online material (video presentations, notes, interactive quizzes and practical exercises) covering the following topics:

Data Types

You will learn about the different types of data (e.g. numerical/categorical, quantitative/qualitative, discrete/continuous/ordinal/nominal).

Descriptive Statistics

You will learn how to summarise the location, spread and shape of data using simple metrics (e.g. mean, median, variance, etc.).

Graphical Summaries

Charts and graphs can be an excellent tool to summarise and present data. In this topic you will be introduced to different types of graphs (e.g. bar charts, histograms, boxplots, scatter plots), how to interpret them, and when to use them.

Probability Distributions

You will learn how a probability distribution can be used to describe the probability of the outcomes of an experiment or event.

Statistical Testing

Statistical testing is a method of statistical inference that can be used to test a theory/hypothesis. Applications include:

  • Testing for a deviation from a hypothesised value (e.g. a lightbulb manufacturer claims its lightbulbs last an average of 1000 hours, you want to test this claim)
  • Testing for a difference between two or more groups (e.g. is there a difference in the average salaries for men and women?)
  • Testing for a change following an event or intervention (e.g. does a new drug/treatment affect blood pressure?)

    This topic introduces the statistical testing procedure, including important concepts such as p-values. You will learn how to draw conclusions from statistical tests. You will also learn about different types of tests and how to identify the correct test to use.

    Simple Linear Regression

    Simple linear regression is a statistical modelling technique. It can be used to understand the relationship between two variables and make predictions (e.g. how much increase in sales can we expect if we increase advertising expenditure). In this topic you will learn how to interpret the regression model, how to use the model to make predictions, and how to evaluate how well the model reflects reality.

    Python for Data Analysis

    This topic will provide you with the coding skills needed to analyse data in Python. You will learn: basic programming, data manipulation, data visualisation, statistical testing and linear regression.

    Course Dates

    Course Start Date:  Monday, September 21, 2026 8:00 AM

    Course End Date :  Friday, October 30, 2026 11:30 PM

    Enrolment Dates

    Enrolment Start Date:  Monday, April 13, 2026 8:00 AM

    Enrolment End Date :  Sunday, September 13, 2026 11:30 PM

    Course Location

    Newcastle University

    Course Tutor/s

    Rosabeth White

    Course Delivery Type

    Online - Pre-recorded Virtual course icon

    Book This Course

    To register

    Sign in

    Course Fees

    Price per attendee is £200.00 (zero VAT)