Data Wrangling in Python
Lecture Slides Lecture Slides (pdf) Lecture Slides (ipynb)
Tutorial Exercise Tutorial Exercise (pdf) Tutorial Exercise (ipynb)
This week we will turn from broader programming and software engineering concepts to practical approaches of working with data in Python. In particular, we will focus on pandas, a versatile library for data analysis, which often serves as the first building block in many data-science pipelines.
Required Readings
Additional Readings
- Guttag Ch 23: Exploring Data with Pandas;
- Charles R. Harris et al. 2020. Array programming with NumPy. Nature 585 (7825): 357-362. https://doi.org/10.1038/s41586-020-2649-2
Tutorial
- Data input and output;
- Working with pandas library.
Assignment 4 (Blackboard)
- Data Wrangling and Classes in Python.
- Due at 12:00 on Monday, 1st December (submission on Blackboard);
- Rename the file from
04_assignment.ipynbto04_assignment_lastname_firstname_studentnumber.ipynbbefore submission.