AI Instructor Live Labs Included

Python for Data Analysis

Analyze and visualize real-world data with pandas, NumPy, and matplotlib. Load, clean, transform, merge, and chart datasets.

Intermediate
10h 50m
12 Lessons
PY-400
Python for Data Analysis Badge

View badge details

About This Course

Learn to analyze real-world data using Python's most powerful tools. This course covers NumPy arrays and vectorized operations, pandas DataFrames for data manipulation, cleaning messy datasets (missing values, duplicates, type issues), grouping and merging data from multiple sources, and creating professional visualizations with matplotlib. Every exercise uses realistic messy data not pre-cleaned textbook examples. The capstone project has you clean, merge, and visualize a multi-file client dataset into a business report. Requires Python Data Structures and OOP or equivalent. Course 4 of 6 in the Python Learning Path.

Course Curriculum

12 Lessons
01
AI Lesson
AI Lesson

NumPy Arrays and Vectorized Operations

30m
02
Lab Exercise
Lab Exercise

NumPy - Lab Exercises

1h 15m 1 Exercises

ndarray basics, shape, dtype, array creation (zeros/ones/arange/linspace/random), vectorized operations vs loops, aggregations (sum/mean/std/min/max with axis), boolean indexing and np.where

NumPy Arrays and Vectorized Operations NumPy Arrays and Vectorized Operations ~30 min
03
AI Lesson
AI Lesson

pandas Fundamentals Series and DataFrames

35m
04
Lab Exercise
Lab Exercise

pandas Fundamentals - Lab Exercises

1h 15m 1 Exercises

Series creation and operations, DataFrame from dicts/lists/CSV, column access, .head()/.info()/.describe(), reading data with pd.read_csv/json/excel, selecting with []/loc/iloc/query, adding and modifying columns with .apply()/.map()/np.where

pandas Fundamentals: Series and DataFrames pandas Fundamentals: Series and DataFrames ~30 min
05
AI Lesson
AI Lesson

Data Cleaning and Transformation

35m
06
Lab Exercise
Lab Exercise

Data Cleaning - Lab Exercises

1h 15m 1 Exercises

Missing data (isna/fillna/dropna strategies), duplicates (duplicated/drop_duplicates), type conversion (astype/to_datetime/to_numeric), string operations (.str accessor), reshaping (melt/pivot_table/stack/unstack)

Data Cleaning and Transformation with Pandas Data Cleaning and Transformation with Pandas ~30 min
07
AI Lesson
AI Lesson

Grouping Aggregation and Merging

35m
08
Lab Exercise
Lab Exercise

Grouping and Merging - Lab Exercises

1h 15m 1 Exercises

groupby single and multi-column, .agg() with named aggregations, .transform(), pd.merge (inner/left/outer joins), pd.concat row and column-wise, time-based grouping with resample and Grouper

Mastering GroupBy, Aggregation, and DataFrame Merging Mastering GroupBy, Aggregation, and DataFrame Merging ~30 min
09
AI Lesson
AI Lesson

Data Visualization with matplotlib

35m
10
Lab Exercise
Lab Exercise

Visualization - Lab Exercises

1h 15m 1 Exercises

plt.plot/show, figure and axes objects, chart types (line/bar/scatter/histogram/pie), customization (titles/labels/legends/colors/grid), multi-panel with subplots, saving figures with savefig

Advanced Data Visualization with matplotlib Advanced Data Visualization with matplotlib ~30 min
11
AI Lesson
AI Lesson

Capstone Briefing Client Data Report

30m
12
Lab Exercise
Lab Exercise

Capstone Client Data Report

1h 15m 1 Exercises

Capstone combining all data analysis skills: load 3 messy CSVs, clean all datasets, merge into analysis DataFrame, compute business metrics, generate multi-panel matplotlib visualization, export summary CSV and PNG

Capstone Client Data Report Capstone Client Data Report ~30 min

This course includes:

  • 24/7 AI Instructor Support
  • Live Lab Environments
  • 6 Hands-on Lessons
  • 6 Months Access
  • Completion Badge
  • Certificate of Completion
Python for Data Analysis Badge

Earn Your Badge

Complete all lessons to unlock the Python for Data Analysis achievement badge.

Category
Skill Level Intermediate
Total Duration 10h 50m
Python for Data Analysis Badge
Achievement Badge

Python for Data Analysis

Demonstrates proficiency in data analysis with pandas, NumPy, and matplotlib.

Course Python for Data Analysis

Skills You'll Earn

Python pandas NumPy matplotlib Data Cleaning Data Visualization

Complete all lessons in this course to earn this badge