Faculty Name: Dr. M.
Massarrat Ali Khan
Course Name: Introduction to Statistics
Email: mokhan@iba.edu.pk
Week 1
What is Statistics?
• Meaning of the word Statistics:
There are three meaning of the word statistics:
1. Statistics( plural): Facts and figures itself is called statistics
e.g. Import statistics, production statistics, results statistics
2. Statistics( plural): It is a subject of social science which deals with the collection,
organization or presentation analysis of data and interpretation the results about the
population from the data based on samples for decision making.
3. Statistic(Singular): It is characteristic of sample in numerical values
Key Terms In Statistics
• Population:
The totality of the data with which we are concerned is called
population.
• Sample: It is the subset or portion of the population.
• Parameter: The characteristics of population in numerical terms is called
parameter.
e.g. Population A.M, the notation of which is µ
• Statistic: The characteristics of sample in numerical terms is called statistic e.g.
A.M of sample, the notation of which is
Variable: It is a characteristic or measurement that can be determined for each member of a population .
Variable is denoted by X or Y
Variable may be Numerical and Categorical
1. Numerical: Numerical variable take one value with equal unit of measurement such a weight in pounds,
time in hours etc.
Numerical Variable are of two types:
Discrete: A variable whose value is obtained by counting e.g. no of calls in a day
Continuous: A continuous variable take all values in a given interval. The value of continuous variable is
obtained by measuring e.g. temperature 20o to 30o .
2. Categorical: It is a variable that can take values on the basis of some qualitative property e.g. person
affiliation with a political party.
X PTI, PPP, NML.
Type of Data
Data: are the actual values of the variable. They may be numbers or
words.
So the actual data can be divided into:
• Quantitative Data discrete and continuous variable
• Qualitative Data Categorical or subjective variable
Organizing Qualitative Data
• Tabular Presentation
• Frequency Distributions: A frequency distribution of qualitative data is a listing of the distinct
values and their frequencies.
• To Construct a Frequency Distribution of Qualitative Data
Step 1 List the distinct values of the observations in the data set in the first column of a table.
Step 2 For each observation, place a tally mark in the second column of the table in the row of
the appropriate distinct value.
Step 3 Count the tallies for each distinct value and record the totals in the third column of the
table.
Example
Political party affiliations of the students in introductory statistics
Table for constructing a frequency distribution for the political party affiliation
Relative-frequency distribution
• A relative-frequency distribution of qualitative data is a listing of the
distinct values and their relative frequencies.
• To Construct a Relative-Frequency Distribution of Qualitative Data
Step 1 Obtain a frequency distribution of the data.
Step 2 Divide each frequency by the total number of observations.
Example
• Figure out your first relative frequency by dividing the count by the
total.
For the category of dogs we have 16 out of 56, so 16/56=0.29
Graphical Presentation of Qualitative Data
Pie Charts
Pie Charts Another method for organizing and summarizing data is to draw a picture of
some kind. The old saying “a picture is worth a thousand words” has particular
relevance in statistics—a graph or chart of a data set often provides the simplest and
most efficient display. Two common methods for graphically displaying qualitative data
are pie charts and bar charts
-A pie chart is a disk divided into wedge-shaped pieces proportional to the relative
frequencies of the qualitative data.
• To Construct a Pie Chart
Step 1 Obtain a relative-frequency distribution of the data.
Step 2 Divide a disk into wedge-shaped pieces proportional to the relative frequencies.
Step 3 Label the slices with the distinct values and their relative frequencies.
Example of Pie Chart:
Political Party Affiliations of the students
Bar Chart
• Bar Charts : Another graphical display for qualitative data is the bar
chart.
- A bar chart displays the distinct values of the qualitative data on a
horizontal axis and the relative frequencies (or frequencies or percent)
of those values on a vertical axis. The relative frequency of each distinct
value is represented by a vertical bar whose height is equal to the
relative frequency of that value. The bars should be positioned so that
they do not touch each other.
Example of Bar Chart:
Organizing Quantitative Data
Quantitative Data
Tabular Presentation
• Frequency Distribution Table
-To organize quantitative data, we first group the observations into classes (also
known as categories or bins) and then treat the classes as the distinct values of
qualitative data. Consequently, once we group the quantitative data into classes, we
can construct frequency and relative-frequency distributions of the data in exactly
the same way as we did for qualitative data.
Three important guidelines for grouping quantitative data into classes are:
1. The number of classes should be small enough to provide an effective summary but
large enough to display the relevant characteristics of the data. A rule of thumb is that
the number of classes should be between 5 and 20.
Guide Line for number of classes:
Number of Classes = 1 + 3.3 log N
If N=100 No. of classes = 1+ 3.3 log 100 = 1+3.3x2.0 = 8
2. Each observation must belong to only one class.
3. Whenever feasible, all classes should have the same width. Roughly speaking if
possible, all classes should cover the same number of possible values.
• Class Width = Range/No. of Classes