
Welcome to Informatics 2 - Foundations of Data Science
We, the FDS course team are looking forward to helping you learn about the Foundations of Data Science. This OpenCourse site is the main source for most course information, including:
- an overview of what expect you to learn and how you'll be assessed
- the schedule
- lab and workshop logistics
- course materials, including the lecture notes and slides.
Our course Learn page will still be used for communicating important messages (via the Announcements feature), for coursework information and submitting coursework.
About the course
The amount of data is growing at an exponential rate, and it has the power to transform our world for better or worse. It's crucial that we learn how to collect, process and present data so that the conclusions we draw from the data are well-founded, and that we can communicate the results clearly both graphically and in writing. It's vital that we understand the legal frameworks that govern issues around data use, in particular privacy and licensing. We also need to go beyond the law, and consider ethical issues: for example, do we have the consent of the people providing the data in a survey and are the conclusions we draw valid? It's also fundamental that we understand how algorithms we use work, and how much confidence we should have in our results - which takes us into the realm of statistics.
In this course, we'll introduce these foundations of data science, using real world examples of data sets and ethical scenarios. You'll lean how to use Python libraries such as Pandas, Matplotlib and Seaborn to undertake data wrangling and visualisation. We'll introduce linear models both for explanation and prediction, and we'll give a taster of some machine learning methods, which are covered in more depth in later courses in Informatics degree programmes.
License
All rights reserved The University of Edinburgh