Statistics and
Probability
Introduction
Course Overview
What is Statistics and
Probability?
Probability deals with
predicting the likelihood of future
events, while statistics involves the
analysis of the frequency of past
events.
What is Statistics and
Probability?
Probability is the chance
and possibilities.
Statistics refers to data.
Statistics
Statistics is a science that
involves collecting, analyzing,
summarizing and interpreting the
data.
Used to aid in decision making
process.
Statistics
Statistics are as a
process in solving a
problem.
Do these questions require a
statistical tool?
1. How many Filipinos are
playing mobile legends?
2. How much is Chris’
allowance daily?
STATISTICS
It is a technique or method by
which data are collected, organized,
presented, analyzed, and
interpreted.
1)Descriptive Statistics
2)Inferential Statistics
DESCRIPTIVE STATISTICS
It is a form of statistical methods
and procedure which is concerned
with collecting and describing the
characteristics and properties of a
set of data so as to obtain
meaningful information.
INFERENTIAL STATISTICS
It is a form of statistical
method and procedures which is
concerned with the analysis of a
subset of data leading to
predictions or inferences about
the entire set of data.
STATISTICAL TERMINOLOGIES
Population – consists of the totality of the
subjects under consideration.
For instance, if there are 3,000 students in
private school and a researcher wants to know
the place where a majority of them lives, then
all the students compose the population, which
can be represented by N.
STATISTICAL TERMINOLOGIES
Sample – is a subset or a small part of a
population from which inferences about the
total population are made.
For instance, a sample may consist of
150 students from a private high school
selected to lead a rally. These 150 students
comprise the sample denoted by n.
STATISTICAL TERMINOLOGIES
Parameter – any statistical information
or attribute taken from a population
Statistic – any estimate of statistical
attributes taken from a sample.
STATISTICAL TERMINOLOGIES
Data- are statistical facts, or a set of
information or observations under study.
Example/s: scores, ages, gender and
nationality.
1) Quantitative Data
2) Qualitative Data
STATISTICAL TERMINOLOGIES
Variable – a specific characteristic or
property of unit sample of a population
which may be different for each unit sample.
- It is called a variable because the value may
vary between data units in a population, and
may change in value over time.
(a)Quantitative Variables
(b)Qualitative Variables
Collecting Data
1.Interview Method
2.Questionnaire Method
3.Registration Method
4.Experimental Method
Statistical Question
A statistical question is a
question that anticipates many
different responses.
Variability – describes how spread
out or closely clustered a set of data
is.
Do these questions require a
statistical tool?
1. How many Filipinos are
playing mobile legends?
2. How much is Chris’
allowance daily?
Consider these 2 Questions:
1. How old am I?
Not a statistical question
2. How old are the students at my
school?
It is a statistical question
Examples:
1. What is your phone number? No
2. How many televisions does each
family on your street own? YES
3. How much did you pay for your
tuition? No
4. How many brothers and sisters does
each student in your school have? YES
High or Low Variability
How old are the people shopping
at the mall?
How old are the students in the
11th grade?
Interview Method
The researcher has a personal
contact with the individual
interviewed. The researcher gathers
the significant information and facts
by questioning and inquiring from
the individual interviewed.
Questionnaire Method
This method is done by sending,
giving or distributing
questionnaires to the persons from
whom a researcher would like to get
significant information through
personal delivery or by mail.
Registration Method
This method of collecting
data is one which utilizes existing
records provided by legitimate
agencies and institutions
regulated and govern by laws.
Experimental Method
This method of collecting
data is done by performing or
conducting scientific inquiry to
know the effect of particular
variable over the other variable.
Methods of Presenting
Data
1.Textual Method
2.Tabular Method
3.Graphical Method
Data Presentation
Presenting the data includes the
pictorial representation of the data
by using graphs, charts, maps, and
other methods.
This visual representation of
data is called as data visualization.
Textual Method
It is the simplest method of
presenting data. Data can be
presented or expressed in narrative
form or in paragraph form. The
emphasis of this method is citing
significant characteristics, figures
and features of data.
Consider the unit test scores of 40 senior students.
These test scores are now organized and arranged from
highest to lowest.
A way of describing the data using the textual form is
as follows:
97 93 90 84 79 78 77 76 74 69
96 92 88 84 79 78 77 76 73 68
96 94 87 82 79 78 77 75 73 64
94 91 87 82 79 78 76 75 71 63
93 90 86 81 79 78 76 74 70 61
The highest score obtained is 97 and the lowest is 61.
All of the students got a score of 61 and above. Generally
speaking, the students performed very well in the unit test.
Tabular Method
The tabular method of presentation is
applicable for large data sets.
The frequency distribution table is the
usual tabular form of presenting the
distribution of the data.
Tables are usually used in written
technical reports and in oral presentation.
Frequency Distribution Table
(FDT)
A large data set can be easily
analyzed and interpreted by grouping
the data into different classes in order to
determine the number of observations
that fall within a given interval or class.
Frequency Distribution Table
(FDT)
A FDT showing categorical
variables. An FDT containing non-
overlapping categories or classes of a
variable and the frequencies or counts of
the observations falling into the
categories or classes.
• Class limit – The lower and upper limit,
respectively, that can fall in a given class.
• Class boundaries – These are the numbers
midway between the upper limit of one class
and the lower limit of the next class.
• Class width or size – It is the distance or
the difference between the lower limit of
one class and the lower limit of the next
class.
• Class frequency – it is the number of
observations that fall in a given class.
• Class mark – it is the average of the
lower and upper limit of given class.
• Relative frequency – The ratio of the
class frequency to the total frequency.
• Less than cumulative frequency – It is
the number of observations that are less
than or equal to the upper limit of the
class interval.
• Greater than cumulative frequency –
It is the number of observations that are
greater than or equal to the upper limit
of the class interval.
Class F Class Class Relative <cumulative > Cumulative
frequency frequency
Intervals Boundaries Mark frequency
60 – 65 7 59.5 – 65.5 62.5 19.4 7 36
66 – 71 3 65.5 – 71.5 68.5 8.3 10 29
72 – 77 7 71.5 – 77.5 74.5 19.4 17 26
78 – 83 5 77.5 – 83.5 80.5 13.8 22 19
84 – 89 8 83.5 – 89.5 86.5 20.0 30 14
90 – 95 5 89.5 – 95.5 92.5 13.8 35 6
96 – 100 1 95.5 – 100.5 98.5 2.7 36 1
N = 36 𝑓
R= × 100
𝑁
Graphical Method
Graphs are commonly used in oral
presentation. There are several forms
of graphs to use like the pie chart,
pictograph, bar graph, line graph,
histogram and box-plot.
Histogram
A histogram is used to summarize
discrete or continuous data.
It is similar to vertical bar graph.
However, a histogram, unlike a vertical
bar graph, shows no gaps between the
bars and uses purely numerical data.
*spread/dispersion
Class F Class Class Relative <cumulative > Cumulative
frequency frequency
Intervals Boundaries Mark frequency
91 – 100 3
81 – 90 5
71 – 80 6
61 – 70 0
51 – 60 9
41 – 50 7
31 – 40 4
N=? 𝑓
R= × 100
𝑁
Thank You