Why Pandas?
- Pandas stands for “Python Data Analysis Library”.
- Game changer when it comes to analyzing data with Python
- Pandas is open source, free to use (under a BSD license).
- Takes data (CSV, TSV, SQL etc) and creates a Python object with rows and columns that looks like Excel.
- Is much easier to work with in comparison to lists and/or dictionaries.
Hands on with Pandas!
Follow the workshop notes with me and complete the exercises after this course, on your own time, at your own pace. You will have to make a copy of the assignment to your personal Google Drive account in order to edit the notebook document. Remember, if you get stuck with the exercises, try asking your colleagues and myself in the Slack Channel. Just click on the Join the Discussion section.
Loading Data
CLICK HERE FOR THE WORKSHOP NOTES
CLICK HERE FOR THE EXERCISE
Indexing and Subsetting Dataframes
CLICK HERE FOR THE WORKSHOP NOTES
CLICK HERE FOR THE EXERCISE
Cleaning Data
CLICK HERE FOR THE WORKSHOP NOTES
CLICK HERE FOR THE EXERCISE
Basic Visualizations
CLICK HERE FOR THE WORKSHOP NOTES
CLICK HERE FOR THE EXERCISE