Data aggregations and descriptive statistics
Summarizing data
Please do not copy without permission. © ALX 2024.
Data aggregations and descriptive statistics
Data overview
| You are an analyst hired by an NPO to work on initiatives that focus on closing the gender
achievement gap in education.
You need… The dataset
Summary statistics to help you The PhD graduates in public chartered
understand the gender differences universities in Kenya, 2015 dataset from
among PhD graduates. openAFRICA.
2
Data aggregations and descriptive statistics
Data overview
| The PhD graduates dataset contains 22
rows and the following columns:
The dataset
A. Qualification
The title of the qualification being considered.
B. Male
The number of male graduates for each
qualification in the year 2015.
C. Female
The number of female graduates for each
qualification in the year 2015.
3
Data aggregations and descriptive statistics
Consider the questions we want to investigate
PhD graduates
PhD graduates
in public chartered
in public universities
chartered in
universities in
KenyaKenya
Which qualifications have
How many males and
How many qualifications the least or most male and
females received PhD
are taken into account? female graduates,
qualifications, respectively?
respectively?
4
Data aggregations and descriptive statistics
The SUM function
| How many males and females received PhD qualifications, respectively?
The answer to this question will give more information on the
gender disparities among PhD holders at Kenya’s public
chartered universities in 2015.
01. Add all the values in the Male column;
02. Add all the values in the Female column;
5
Data aggregations and descriptive statistics
The SUM function
The SUM function is used to add the cells that
=SUM(value1, [value2, …]) are specified in the function argument.
The SUM of a range. The SUM of specific cells.
6
Data aggregations and descriptive statistics
The SUM function
01. Ignores empty cells, cells with text, and 01.
TRUE/FALSE values.
02. Returns an error if any of the cells contain
errors.
02. 02.
7
Data aggregations and descriptive statistics
The COUNT function
| How many qualifications are taken into account?
This question will provide insight into the number of
qualifications that are taken into account in this study and
give us a better understanding of the scope of the values we
are working with.
Count the number of entries in the
01.
Qualification column;
8
Data aggregations and descriptive statistics
The COUNT function
The COUNT function counts the number of cells
=COUNT(value1, [value2, …]) that have numerical values within the specified
range.
The COUNT of a range.
9
Data aggregations and descriptive statistics
The COUNT function
01. Ignores empty cells, cells with text, and 01.
TRUE/FALSE values.
02. Use COUNTA to include text and True/False
values.
03. Ignores cells that contain errors.
02. 03.
10
Data aggregations and descriptive statistics
The MIN and MAX functions
| Which qualifications have the least or most male and female graduates, respectively?
This inquiry will shed light on the degrees that have
graduated the most or least number of males and females.
This can also help us determine the qualifications that men
and women are more likely to pursue.
Find the minimum and maximum values in the Male
01.
column.
Find the minimum and maximum values in the
02.
Female column.
11
Data aggregations and descriptive statistics
The MIN and MAX functions
The MIN and MAX functions
=MIN(value1, [value2, …]) find the minimum and =MAX(value1, [value2, …])
maximum number within the
specified range.
The MIN of a range. The MAX of a range.
12
Data aggregations and descriptive statistics
The MIN and MAX functions
01. Ignores empty cells, cells with text, and
01.
TRUE/FALSE values.
02. Returns an error if any of the cells contain
errors.
02. 02.
13