Python Data Associate Certification
Study Guide
Please use this study guide to create your certification self-study plan. We’ve included the
objectives you should meet for each assessed competency, with links to relevant practice
assessments.
Associate
Exam PY101: Data Management and Exploratory Analysis
1. Perform standard data import, joining and aggregation tasks
1.1. Import data from flat files
1.2. Combine multiple tables by rows or columns
1.3. Filter data based on different criteria
1.4. Aggregate numeric, categorical variables and dates by groups
2. Perform cleaning tasks to prepare data for analysis
2.1. Match strings in a dataset with specific patterns
2.2. Convert values between data types
2.3. Clean categorical and text data by manipulating strings.
2.4. Clean date and time data.
3. Assess data quality and perform validation tasks
3.1. Identify and replace missing values
3.2. Perform different types of data validation tasks (e.g. consistency, constraints,
range validation, uniqueness)
3.3. Identify and validate data types in a data set
Related assessment
Importing & Cleaning Data with Python
4. Calculate metrics to effectively report characteristics of data and relationships
between features
Python Data Associate Certification
Study Guide
4.1. Calculate measures of center (e.g. mean, median, mode) for variables
4.2. Calculate measures of spread (e.g. range, standard deviation, variance) for
variables
4.3. Calculate skewness for variables
4.4. Calculate missingness for variables and explain its influence on reporting
characteristics of data and relationships
4.5. Calculate the correlation between variables
5. Create data visualizations to demonstrate the characteristics of data
5.1. Create and customize bar charts
5.2. Create and customize box plots
5.3. Create and customize line graphs
5.4. Create and customize histograms
6. Identify and reduce the impact of characteristics of data
6.1. Identify when imputation methods should be used and implement them to
reduce the impact of missing data on analysis or modeling
6.2. Describe when a transformation to a variable is required and implement
corresponding transformations
6.3. Describe the differences between types of missingness and identify relevant
approaches to handling types of missingness
Related assessment
Data Manipulation with Python