Introduction to Statistical Data Science
In an increasingly data-driven world, proficiency in data science and statistics is crucial for success in many sectors. By completing this course, you will develop a strong foundation in statistical data science, enabling you to effectively explore, summarise, and analyse data to derive valuable insights and make informed decisions in various business contexts.
You will be introduced to core concepts within data science, including data visualisation, hypothesis testing and linear regression. We will provide you with the underlying theories behind these methods, allowing you to understand their relevance and applicability. You will also learn how to implement the methods using Python (a widely used programming language in data science).
No prior knowledge of data science or statistics is required.
No previous programming experience is required.
If you are interested in this course, please enquire here and state if you are a student, learner or employer and which delivery option you would be interested in.
Course Agenda
You will work through online material (video presentations, notes, interactive quizzes and practical exercises) covering the following topics:
Data Types:
You will learn about the different types of data (e.g. numerical/categorical, quantitative/qualitative, discrete/continuous/ordinal/nominal).
Descriptive Statistics:
You will learn how to summarise the location, spread and shape of data using simple metrics (e.g. mean, median, variance, etc.).
Graphical Summaries:
Charts and graphs can be an excellent tool to summarise and present data. In this topic you will be introduced to different types of graphs (e.g. bar charts, histograms, boxplots, scatter plots), how to interpret them, and when to use them.Probability Distributions:
You will learn how a probability distribution can be used to describe the probability of the outcomes of an experiment or event.
Statistical Testing:
Statistical testing is a method of statistical inference that can be used to test a theory/hypothesis. Applications include:
- Testing for a deviation from a hypothesised value (e.g. a lightbulb manufacturer claims its lightbulbs last an average of 1000 hours, you want to test this claim)
- Testing for a difference between two or more groups (e.g. is there a difference in the average salaries for men and women?)
- Testing for a change following an event or intervention (e.g. does a new drug/treatment affect blood pressure?)
This topic introduces the statistical testing procedure, including important concepts such as p-values. You will learn how to draw conclusions from statistical tests. You will also learn about different types of tests and how to identify the correct test to use.
Simple Linear Regression:
Simple linear regression is a statistical modelling technique. It can be used to understand the relationship between two variables and make predictions (e.g. how much increase in sales can we expect if we increase advertising expenditure). In this topic you will learn how to interpret the regression model, how to use the model to make predictions, and how to evaluate how well the model reflects reality.
Python for Data Analysis:
This topic will provide you with the coding skills needed to analyse data in Python. You will learn: basic programming, data manipulation, data visualisation, statistical testing and linear regression.
Approximate learning time for the online material: 20-28 hours.
You will also have the option to attend an in-person workshop where you will have the opportunity to ask questions, explore the material further, work through hands-on exercises, and network with other participants on the course.
Options for course delivery are given below
Option 1 (for individuals)
Format: Online only
You will work through the online material in your own time.
Option 2 (for individuals)
Format: Online + in-person workshop
Following completion of the online material, you will be invited to participate in an in-person workshop at Newcastle University.
Option 3 (for companies)
Format: Online + in-person workshop(s)
If you are a company/organisation looking to up-skill your staff in data science, we can develop a bespoke training course to suit your requirements. We will work with you to enhance the practicality of the training using examples, case studies and datasets that are relevant to your company/sector.
Your staff will be provided with access to the online material, and we will deliver in-person workshop(s) (location to be agreed).
If you are interested in this course, please enquire here and state if you are a student, learner or employer and which delivery option you would be interested in.
For more information on this course, please contact llahub@newcastle.ac.uk or enquire here
Register your interest
To Register interest
Sign in