Course Details
Course Duration: 2 days; Instructor-led
Audience
Anyone who wants to use Python programming language to do Data Analysis
Prerequisites
Python programming knowledge preferred.
Methodology
This program will be conducted with interactive lectures, PowerPoint presentation, discussion and practical exercise.
Course Objectives
The course will introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the Series and Data Frame as the central data structures for data analysis, along with tutorials on how to use functions such as group by, merge, and pivot tables effectively. By the end of this course, participants will be able to take tabular data, clean it, manipulate it, and run basic inferential statistical analyses.
Outlines
Module 7: Data Manipulation, simple statistics and data plotting using Pandas
- Finding Maximum & Minimums
- Calculating and identifying outliers
- Indexing and Selecting Data
- Reshaping and Pivot Tables
- Time Series / Date functionality
- Categorical Data
- Simple Statistical Analysis
Module 8: Visualization
- Basic Plotting
- Plot Customization