Part I: Data: ethics, collection, representation, wrangling, exploration, visualisation and descriptive statistics |
S1 W1 15-19 Sep | Introduction and Logistics (KG) | Data (KG) | Introduction to Jupyter notebooks and Pandas | | Lecture Notes (LN) 1, 2 |
S1 W2 22-26 Sep | Descriptive statistics (KG) | Introduction to data ethics (KG) | Pandas - Data wrangling I | Task: Preparation for Week 3 Workshop on Ethics. | LN 3, 4 An Introduction to Data Ethics, Parts 1 and 2 |
S1 W3 29 Sep-3 Oct | Exploratory data analysis, data communication visualisation (DS) | No lecture | Data representation I - Matplotlib | Workshop: Data ethics discussion | LN 5, The Big Book of Dashboards, Chapter 1 |
S1 W4 6-10 Oct | Visualisation (DS) | Data collection and statistical relationships (KG) | Data representation II: Distributions | Task: Marking visualisations and interpretation in previous CW | LN 6 |
Part II: Linear Models |
S1 W5 13-17 Oct | Linear regression I (BI) | Linear regression II (BI) | Data wrangling II | Workshop: Discussing visualisations and interpretation | LN 7 |
S1 W6 20-24 Oct | Formative exercise 1 feedback Q&A | No lecture | | | |
S1 W7 27 Oct-31 Oct | Multiple regression I (BI) | Multiple regression II (BI) | Linear models | Workshop: Linear regression | LN 8, 9 |
S1 W8 3-7 Nov | Principal Components Analysis I (BI) | Principal Components Analysis II (BI) | PCA | Task: Preparation for Week 9 workshop | LN 10 |
Part III: Introduction to Machine Learning |
S1 W9 10-14 Nov | Intro to supervised learning: Nearest neighbours (DS) | k-Nearest Neighbours and Evaluation (DS) | k-Nearest Neighbours | Workshop: Interpretation of data science study using multiple regression | LN 11, 12 |
S1 W10 17-21 Nov | Intro to unsupervised learning: K-means (DS) | Formative exercise 2 feedback (BI) | K-means | | LN 13 |
S1 W11 24-28 Nov | No lecture | No lecture | | | |
Part IV: Statistical inference |
S2 W1 13-17 Jan | Intro to inferential statistics (DS) | Randomness, sampling and simulation (DS) | Randomness, sampling and simulations | Task: Critical evaluation preparation | LN 14 |
S2 W2 20-24 Jan | Estimation (DS) | Confidence intervals (DS) | Estimation of confidence intervals with the bootstrap | Workshop: Critical evaluation | LN 15, 16 |
S2 W3 27-31 Feb | Hypothesis testing and p-values (DS)
| A/B testing (DS) | No lab | Task: Problem sheet for S2 Week 4 Workshop | LN 17, 18, XKCD comic strip on multiple testing, A hypothesis is a liability |
Part V: The maximum likelihood principle and regression |
S2 W4 3-7 Feb | Inference of regression coefficients and logistic regression (DS) | Logistic regression (DS) | Logistic regression | Workshop: Statistical problems 1 | LN 19, 20 |
S2 W5 10-14 Feb | Linear regression and inference (DS) | Generalised linear models (DS) | Web-scraping | Task: Problem sheet for S2 Week 6 Workshop | LN 21 |
S2, FLW 17-21 Feb | | | | | |
S2 W6 24-28 Feb | Project Q&A and Ethical and legal issues in supervised learning (DS) | No lecture | | Workshop: Statistical problems 2 | LN 22, Equality law can disadvantage women in algorithmic credit decisions |
Part VI: Project and project skills |
S2 W7 3-7 Mar | Software engineering for data science (DS) | No lecture | | | Good enough practices in scientific computing, Reproducible Analysis Through Automated Jupyter Notebook Pipelines, The scientific paper is obsolete |
S2 W8 10-14 Mar | Project writing workshop in lecture slot (DS) | No lecture | | Workshop: Project presentations | |
S2 W9 17-21 Mar | Guest lecture: "Data Science in Action: Exploring Edinburgh’s Student Housing Crisis" Toby Wyckoff Smith (FDS alumnus) of Slurp: Students for Action on Homelessness | No lecture | | Workshop: Project presentations | |
S2 W10 24-28 Mar | No lecture | No lecture | | Workshop: Project presentations | |
S2 W11 31 Mar-4 Apr | | | | | |