Why Pandas?

Image of Pandas

  • Pandas stands for “Python Data Analysis Library”.
  • Game changer when it comes to analyzing data with Python
  • Pandas is open source, free to use (under a BSD license).
  • Takes data (CSV, TSV, SQL etc) and creates a Python object with rows and columns that looks like Excel.
  • Is much easier to work with in comparison to lists and/or dictionaries.

Hands on with Pandas!

Follow the workshop notes with me and complete the exercises after this course, on your own time, at your own pace. You will have to make a copy of the assignment to your personal Google Drive account in order to edit the notebook document. Remember, if you get stuck with the exercises, try asking your colleagues and myself in the Slack Channel. Just click on the Join the Discussion section.

Loading Data

CLICK HERE FOR THE WORKSHOP NOTES

CLICK HERE FOR THE EXERCISE

Indexing and Subsetting Dataframes

CLICK HERE FOR THE WORKSHOP NOTES

CLICK HERE FOR THE EXERCISE

Cleaning Data

CLICK HERE FOR THE WORKSHOP NOTES

CLICK HERE FOR THE EXERCISE

Basic Visualizations

CLICK HERE FOR THE WORKSHOP NOTES

CLICK HERE FOR THE EXERCISE


Previous section:
Next section: