Stat Exercises
Stat Exercises
EXERCISE No. 4
1. The directors of a small college find it necessary to increase the size of classes. A special
film, utilizing the most advanced propaganda techniques presents the advantages of larger
size of classes. It is anticipated that more favorable attitudes (i.e. higher scores) will result
from exposure to the film. Use 1% level of significance to test the hypothesis.
Before 25 23 30 7 3 22 12 30 5 14
After 28 19 34 10 6 26 13 47 16 9
Solution:
Step 1. H0: µ1 = µ2, That the mean average of favorable attitudes (higher score) before and after watching the film is the same or do not differ
significantly.
Ha: µ1 > µ2, That the mean average of favorable attitude (higher score) before exposure to the film is higher than after.
Table 1.
1 25 28 -3 9
2 23 19 4 16
3 30 34 -4 16
4 7 10 -3 9
5 3 6 -3 9
6 22 26 -4 16
7 12 13 -1 1
8 30 47 -17 289
9 5 16 -11 121
10 14 9 5 25
Total 171 208 -37 511
Step 6. Decision: Since tc = -1.8148 < t0.05 (10-1) = 1.833, we do not reject the H0
Step 7. Conclusion: Based on the result, it can be concluded that the mean average of
favorable attitudes (higher score) before and after watching the film is the same or do not
differ significantly at 5% level.
2. Two samples of 8 high school seniors each has been matched on IQ before beginning an
experiment in learning words, after which they tested for the number of words learned. Group
A studied in pairs, one student reading the words to other; the students in Group B studied
alone. The following table shows the results:
Is there a significant difference at 5% level in the number of the words learned between
Group A and Group B?
Solution:
Step 1. H0: µ1 = µ2, that there is no significant difference in the number of words learned between the two groups (studied in pairs and studied
alone).
Ha: µ1 ≠ µ2, that there is a significant difference in the number of words learned between the two groups (studied in pairs and studied
alone).
Table 2.
GROUP A GROUP B Di Di
2
20 25 -5 25
13 15 -2 4
15 20 -5 25
10 15 -5 25
18 16 2 4
10 11 -1 1
20 15 5 25
12 20 -8 64
SUM 118 137 -19 173
MEAN 14.75 17.125
Step 6. Decision: Since tc = -1.5717 < t0.05 (10-1) = 1.895, we do not reject the H0
Step 7. Conclusion: Based on the result, it can be concluded that there is no significant
difference in the number of words learned between the two groups (studied in pairs and
studied alone).
3. A course is designed to increase readers’ speed and comprehension. To evaluate
the effectiveness of this course, a test is given both before and after course, and
sample results follow. At the 0.05 significance level, test the claim that the scores are
higher after the course.
Solution:
Step 1. H0: µ1 = µ2, that the course has no effect on the reader’s speed and comprehension before and after the course or they do not differ
significantly.
Ha: µ1 > µ2, that the reader’s speed and comprehension after the course is greater than or increases than before.
Table 3.
Step 6. Decision: Since tc = -2.3637 < t0.05 (10-1) = 1.833, we do not reject the H0
Step 7. Conclusion: Based on the result, it can be concluded that the course has no effect
on the reader’s speed and comprehension before and after the course or they do not differ
significantly.
4. A pill designed to lower systolic blood pressure is administered to 10 randomly
selected volunteers. The results are as follows:
Volunteer 1 2 3 4 5 6 7 8 9 10
Before 120 120 160 98 115 110 180 190 138 128
Pill
After Pill 90 115 120 105 90 90 180 100 100 110
At 0.05 level of significance, test the claim that systolic blood pressure is not affected
by the pill. That is, test the claim that the before and after values are equal.
Solution:
Step 1. H0: µ1 = µ2, that the pill has no effect on systolic blood pressure, and the mean blood pressure before and after taking the pill is the
same.
Ha: µ1 ≠ µ2, that the pill does have an effect on systolic blood pressure, meaning the before and after values are not equal.
Table 4.
1 120 90 30 900
2 120 115 5 25
3 160 120 40 1600
4 98 105 -7 49
5 115 90 25 625
6 110 90 20 400
7 180 180 0 0
8 190 100 90 8100
9 138 100 38 1444
10 128 110 18 324
SUM 1359 1100
MEAN 135.9 110
Step 6. Decision: Since tc = 2.9887 > t0.05 (10-1) = 1.833, reject the H0
Step 7. Conclusion: Based on the result, it can be concluded that the pill does have an
effect on systolic blood pressure, meaning the before and after values are not equal.
5. Trace metals in drinking water affect the flavor and an unusually high concentration
can pose a health hazard. Ten pairs of data were taken measuring zinc concentration
in bottom water and surface water. Does the data suggest that the true average
concentration in the bottom water exceeds that of surface water?
Solution:
Step 1. H0: µ1 = µ2, that the average concentration of zinc in bottom water is equal in surface water.
Ha: µ1 ≥ µ2, that the average concentration of zinc in bottom water greater than in surface water.
Table 5.
Step 7. Conclusion: Based on the result, it can be concluded that the that the average
concentration of zinc in bottom water greater than in surface water.
NAME: ANNALYN M. AKMAD Program: MAEd – EM
EXERCISE 5
T-TEST FOR INDEPENDENT SAMPLES
1. Do left-handed persons type slower than right- handed persons? The
mean time to type an article for a random sample of 8 left- handed
persons is 133.8 min with standard deviation of 14.7 min. While for a
random sample of 11 right- handed persons, the mean time is 136.2
min with standard deviation equal to 15.0 min. Test the hypothesis
at 5% level of significance.
Left-Handed Persons Right-Handed
Persons
SAMPLE SIZE 8 11
MEAN 133.8 136.2
STANDARD 14.7 15.0
DEVIATION
VARIANCE
2. To find out whether a new serum will arrest leukemia, 9 mice which
have all reached an advanced stage of the disease, are selected.
Four mice received the treatment and five do not. The survival times
in months, from the time the experiment commenced are as follows:
TREATED NOT TREATED
5.9 3.1
4.5 4.8
5.8 1.4
4.8 0.5
Test the hypothesis at 1% level of significance.
Solution:
2 2
Ha : σ 1 ≠ σ 2
larger variance
Fc =
smaller variance
3.193
Fc = = 6. 4289
0.496667
From the result of the comparison of the variances of the new serum treatment for the four
mice (with treatment) and five (without treatment), the following is undertaken to the solve the
main problem:
Step 1.
H0: µ1=µ2 That the new serum does not significantly arrest leukemia.
Ha: µ1 ≠ µ2 That the new serum does significantly arrest leukemia.
Step 5. Computation:
Not
Treated Treated
Mean 5.25 2.14
Variance 0.496667 3.193
Observations 4 5
Hypothesized Mean Difference 0
df 5
t Stat 3.560939
P(T<=t) one-tail 0.008099
t Critical one-tail 3.36493
P(T<=t) two-tail 0.016198
t Critical two-tail 4.032143
Note: Solution/Computation on the next page.
Step 7. Conclusion: The data indicate that the new serum does significantly arrest
leukemia.
3. In a study designed to monitor the movement of ammonia ions in the
soil, radioactive nitrogen ¿ ¿) was introduced into soil cores. Uptake of
¿ ¿) into microbial biomass in the soil was recorded at the end of 7
days. The soil examined were either bare or planted to onions. The
results of the uptake experiment are given below:
SOIL ¿ ¿) (picograms)
With Plant 18 15 12 8 15
Cover
Without 22 24 20 28 28 25
Plant Cover
Determine if plant cover significantly reduced uptake of labelled
nitrogen by microorganisms in the soil.
Solution:
2 2
Ha : σ 1 ≠ σ 2
larger variance
Fc =
smaller variance
SOIL
14.3
Fc = = 1.3883
10.3
f. Conclusion: H0: σ 21 ≠ σ 22
From the result of the comparison of the variances of the plant with cover and without cover,
the following is undertaken to the solve the main problem:
Step 1.
H0: µ1=µ2 That the movement of ammonia ions in the soil with plant
cover is equal to the ammonia ions in the soil of without plant cover.
Ha: µ1 ≠ µ2 That the movement of ammonia ions in the soil with plant
cover is lower to the ammonia ions in the soil of without plant cover.
Step 7. Conclusion: That the movement of ammonia ions in the soil with plant
cover is lower to the ammonia ions in the soil of without plant cover.
GROUP YEAR
1 2 3 4 5 6 7 8
Active 80 85 85 88 84 85 83 83
Not Active 90 90 92 91 88 80 79 80
4. To determine whether active participation in extra- curricular
activities is detrimental to one’s grades, the following grade-point
averages were collected over a period of 8 years.
Assuming the population to be normally distributed, test at 5% level of
significance whether to participate actively in school extra- curricular
activities is detrimental to one’s grades.
Solution:
2 2
Ha : σ 1 ≠ σ 2
larger variance
Fc =
smaller variance
YEAR
ACTIVE NOT ACTIVE
80 90
85 90
85 92
88 91
84 88
85 80
83 79
83 80
Mean 84.125 86.25
Variance 5.267857 31.07143
31.07143
Fc = = 5.8983
5.267857
f. Conclusion: H0: σ 21 ≠ σ 22
Step 1.
Solution:
larger variance
Fc =
smaller variance
1 42.1 42.7
2 41.3 43.8
3 42.4 42.5
4 43.2 43.1
5 41.8 44.0
6 41.0 43.3
7 41.8 43.6
8 42.8 43.5
9 42.3 41.7
10 42.7 44.1
Mean 42.14 43.23
Variance 0.467111 0.5061
0.5061
Fc = = 1.0835
0.4671
Step 1.
H0: µ1=µ2 that the mean average that the new machine packs is equal
to the mean average of the machine currently used.
Ha: µ1 ≠ µ2 that the mean average that the new machine packs is greater
to the mean average of the machine currently used.
Step 7. Conclusion: The mean average that the new machine packs is equal to
the mean average of the machine currently used.
EXERCISE
ONE WAY AND TWO-WAY CLASSIFICATION ANOVA
y
1. To study the effectiveness of three kinds of packaging, a processor of
a breakfast food puts each kind into 5 different supermarkets. Use
the following data representing the number of sales of the breakfast
food between 9:00 AM and noon on a given day, to test the null
hypothesis that the packaging has no effect on sales at 5% level of
significance.
PACKAGING SUPERMARKET TOTAL MEAN
1 2 3 4 5
A 45 32 36 32 40
B 37 34 46 44 34
C 35 37 48 46 35
SOLUTION:
Grand Mean=
∑ AllObservations
15
where:
k is the number of groups (packaging types)
ni is the number of observation in each group (5 in this case)
Mean i is the mean of each packaging type
Grand Mean is the overall mean
where:
k is the number of groups (3)
N is the total number of observations (15)
If F is greater than the critical value, or if the p-value is less than α , reject the null
hypothesis, indicating that there is a significant effect of packaging on sales.
B 5 195 39 32
C 5 196 39.2 25.2
ANOVA
Source of Variation SS df MS F P-value F crit
Rows 215.4 4 53.85 16.07463 0.009888 6.388233
Columns 0.1 1 0.1 0.029851 0.871219 7.708647
Error 13.4 4 3.35
Total 228.9 9
Conclusion
Since the p-value (0.691) is greater than the significance level α =0.05 , we fail to
reject the null hypothesis. This indicates that there is no statistically significant effect
of packaging type on sales at the 5% level of significance.
3. Suppose the National Transportation Safety Board (NTSB) wants to examine the
safety of compact cars, midsize cars, and full- size cars. It collects a sample of three
for each of the treatments (cars types). Using the data provided below, test whether
the mean pressure applied to the driver’s head during a crash test is equal for each
type of cars. Use 5% level of significance for testing the hypothesis.
CARS
COMPACT MIDSIZE FULL SIZE
640 465 480
650 422 450 TOTAL
700 520 400
SOLUTION:
Steps to solve:
1. State the hypotheses:
Null Hypotheses ( H 0): The mean pressure is the same for all car types ¿
Alternative Hypotheses ( H 1): At least one car type has a mean pressure
different from the others.
SSB
Mean Square Between (MSB):
df between
SSW
Mean Square Within (MSW):
df within
SUMMARY
Groups Count Sum Average Variance
COMPACT 3 1990 663.3333 1033.333
MIDSIZE 3 1407 469 2413
FULL SIZE 3 1330 443.3333 1633.333
ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 86824.22 2 43412.11 25.63876 0.001149 5.143253
Within Groups 10159.33 6 1693.222
Total 96983.56 8
F-Statistic: 25.64
Step 4: Conclusion
At a 5% significance level, we need to compare the calculated F-statistic to the critical
value for df between=2 and df within=6 . The critical F-value for α =0.05 is approximately
5.14.
Since the calculated F-statistic (25.64) is much greater than the critical value (5.14),
we reject the null hypothesis.
4. The following data represent the final grades obtained by five students in
Mathematics, English, Values Education, and Biology:
STUDENT SUBJECT
Math English Values Biology
1 80 78 85 79
2 86 90 93 91
3 75 78 83 79
4 78 80 84 76
5 83 87 85 87
Use 0.05 level of significance to test the hypotheses that
a. Students have equal ability.
b. Courses are of equal difficulty level.
To solve this problem, we need to perform a two-way ANOVA test. We will test two
hypotheses:
1. Students have equal ability (row effect).
2. Courses are of equal difficulty level (column effect).
Step-by-Step Solution
Step 1: Set up the Hypotheses
For each hypothesis:
For students (rows):
o H 0: All students have the same average ability.
o H 1: At least one student has a different average ability.
For courses (columns):
o H 0: All courses are of equal difficulty level.
o H 1: At least one course has a different difficulty level.
Mean Squares:
MSR (Rows): 91.70
MSC (Columns): 26.98
MSE (Error): 4.40
F-Values:
F-value for Rows (Students): 20.84
F-value for Columns (Subjects): 6.13
Critical F-Values at α =0.05 :
F-critical for Rows: 3.26
F-critical for Columns: 3.49
Interpretation
1. Students’ ability (Rows):
F row=20.84> Fcritical , row=3.26
Conclusion: Reject H 0 for courses. This means that there is a significant
difference in difficulty levels among the courses.
Su Varianc
SUMMARY Count m Average e
80.6666 14.3333
80 3 242 7 3
91.3333 2.33333
86 3 274 3 3
75 3 240 80 7
78 3 240 80 16
86.3333 1.33333
83 3 259 3 3
ANOVA
Source of
Variation SS df MS F P-value F crit
305.333 76.3333 14.8701 0.00089 3.83785
Rows 3 4 3 3 4 3
40.9333 20.4666 3.98701 0.06290
Columns 3 2 7 3 7 4.45897
41.0666 5.13333
Error 7 8 3
387.333
Total 3 14
Conclusion: Both hypotheses are rejected. Therefore, students do not have equal
ability.
5. A researcher examined the effect of maintaining the water table at three
different heights on seminal root length of three cereals. Evaluate at 5% level
whether a) water table and b) type of cereal significantly affect root length.
SOLUTION:
To evaluate whether the water table and type of cereal significantly affect root length
at the 5% significance level, we need to perform a two-way ANOVA. Here, the factors
are:
Step-by-Step Solution
Step 1: Step up the Hypotheses
For each factor:
For water table levels:
o H 0: Water table levels has no effect on root length.
o H 1: At least one water table level has a different effect on root length.
SSE
MSE=
df error
where df error=(number of rows−1)( number of columns−1)
The critical F-value at α =0.05 for both water table levels and cereal type is
approximately 6.94.
Decision
1. For Water Table Levels: F column=3.86 < F critical=6.94 We fail to reject H 0. This
means that water table levels do not significantly affect root length at the 5%
significance level.
2. For Cereal Types: F row=0.33< F critical=6.94 We fail to reject the H 0. This means
that cereal type does not significantly affect root length at the 5% significant
level.
Anova: Two-Factor Without
Replication
Su Varianc
SUMMARY Count m Average e
10.3 2 44 22 3.92
9.6 2 19 9.5 0.5
13.
10.8 2 1 6.55 0.245
35. 11.9333 58.2933
WHEAT 3 8 3 3
40. 13.4333 76.9033
OAT 3 3 3 3
ANOVA
Source of Variation SS df MS F P-value F crit
269.103 134.551 208.607 0.00477
Rows 3 2 7 2 1 19
5.23255 0.14942 18.5128
Columns 3.375 1 3.375 8 8 2
Error 1.29 2 0.645
273.768
Total 3 5
Conclusion
At the 5% significance level, neither the water table level nor the type of cereal
significantly affects root length.
4. The following are the data for 12 individuals’ daily sodium intake and their
systolic blood pressure readings.
SOLUTION:
r=
∑ (X −X )(Y −Y )
√∑ ¿¿¿¿
where X represents sodium intake, and Y represents BP.
2. Hypothesis test for H 0 : ρ=0 (at the 5% significance level): After calculating r ,
we can test if there is a significant linear relationship between sodium intake
and BP. The null hypothesis H 0 states that there is no linear relationship ( ρ=0
). If the p-value is less than 0.05, we reject H 0.
4. Calculate the regression equation: We need to find the regression line in the
form:
Y^ =a+bX
where a is the intercept and b is the slope. The slope b can be
calculated by:
b=
∑ (X −X )(Y −Y )
∑ ¿¿¿
and the intercept a is:
a=Y −b X
5. Test H 0 : β=0 (at the 1% level of significance): This tests whether the slope b
is significantly different from 0, indicating that sodium intake significantly
predicts BP. If the p-value is less than 0.01, we reject H 0.
6. Predict BP for a sodium intake of 6.3: Using the regression equation, substitute
X =6.3 to estimate the likely BP.
4. Regression equation:
The regression equation is:
Y^ =−53.75+32.10 · X
where Y^ is the predicted blood pressure, and X is the sodium intake.
These results suggest a significant positive linear relationship between sodium intake
and blood pressure, and we can use sodium intake to predict blood pressure with
reasonable accuracy.
Temperature (C 0 ¿ MilkShake Sales (Peso)
(X) (Y)
14.2 215
16.4 325
11.9 180
15.2 300
18.5 400
22.1 500
19.4 400
25.1 610
23.4 520
18.1 400
22.6 430
17.2 410
5. A local shop of milk shakes keeps a track of the amount of milkshakes they
sell in accordance to the temperature on that day. Below are the figures of their
sale and temperature for the last 12 days. Comment on the relationship.
3. Interpret the results: We can use the correlation coefficient to assess the
strength of the relationship, and the regression equation will allow us to predict
milkshake sales given a specific temperature.
Here are the results for the relationship between temperature and milkshake sales.
1. Pearson correlation coefficient (r):
r =0.957, which indicates a very strong positive correlation between
temperature and milkshake sales. As temperature increases, milkshake sales
also tend to increase.
3. Regression equation:
The regression equation is:
Y^ =−157.68+29.37 · X