0% found this document useful (0 votes)

61 views6 pages

ANOVA and T-Test Analysis Guide

This document discusses and provides examples of performing one-way ANOVA and t-tests in Python using libraries like scipy, statsmodels, and pingouin. It shows how to conduct one-way ANOVA on different groups of performance data and explore differences between groups. It also demonstrates three methods of performing two-sample t-tests to compare two groups of data and determine if their means are statistically different.

Uploaded by

Garuma Abdisa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

61 views6 pages

ANOVA and T-Test Analysis Guide

Uploaded by

Garuma Abdisa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

One-way ANOVA:

# Importing library

from scipy.stats import f_oneway

# Performance when each of the engine

# oil is applied

performance1 = [89, 89, 88, 78, 79]

performance2 = [93, 92, 94, 89, 88]

performance3 = [89, 88, 89, 93, 90]

performance4 = [81, 78, 81, 92, 82]

# Conduct the one-way ANOVA

print(f_oneway(performance1, performance2, performance3, performance4))

Output: F_onewayResult(statistic=4.625000000000002, pvalue=0.016336459839780215)

###############################################

import pandas as pd

# load data file

df = pd.read_excel("C:/Users/user/Documents/sampanova.xlsx")

# reshape the d dataframe suitable for statsmodels package

df_melt = pd.melt(df.reset_index(), id_vars=['index'], value_vars=['A', 'B', 'C', 'D'])

# replace column names

df_melt.columns = ['index', 'treatments', 'value']

# generate a boxplot to see the data distribution by treatments. Using boxplot, we can

# easily detect the differences between different treatments

import matplotlib.pyplot as plt

import seaborn as sns

ax = sns.boxplot(x='treatments', y='value', data=df_melt, color='#99c2a2')

ax = sns.swarmplot(x="treatments", y="value", data=df_melt, color='#7d0013')

plt.show()

import scipy.stats as stats

# stats f_oneway functions takes the groups as input and returns ANOVA F and p value

fvalue, pvalue = stats.f_oneway(df['A'], df['B'], df['C'], df['D'])

print(fvalue, pvalue)

# 17.492810457516338 2.639241146210922e-05

# get ANOVA table as R like output

import statsmodels.api as sm

from statsmodels.formula.api import ols

# Ordinary Least Squares (OLS) model

model = ols('value ~ C(treatments)', data=df_melt).fit()

anova_table = sm.stats.anova_lm(model, typ=2)

print(anova_table)

#######################

# install

pip install bioinfokit

# upgrade to latest version

pip install bioinfokit --upgrade

# uninstall

pip uninstall bioinfokit

################################
t-test

import scipy.stats as stats

import numpy as np

# Creating data groups

data_group1 = np.array([14, 15, 15, 16, 13, 8, 14,

17, 16, 14, 19, 20, 21, 15,

15, 16, 16, 13, 14, 12])

data_group2 = np.array([15, 17, 14, 17, 14, 8, 12,

19, 19, 14, 17, 22, 24, 16,

13, 16, 13, 18, 15, 13])

# Print the variance of both data groups

print(np.var(data_group1), np.var(data_group2))

output: 7.727500000000001 12.260000000000002

1. Performing Two-Sample T-Test

Method 1

# Python program to demonstrate how to

# perform two sample T-test

# Import the library

import scipy.stats as stats

import numpy as np

# Creating data groups

data_group1 = np.array([14, 15, 15, 16, 13, 8, 14,

17, 16, 14, 19, 20, 21, 15,

15, 16, 16, 13, 14, 12])

data_group2 = np.array([15, 17, 14, 17, 14, 8, 12,

19, 19, 14, 17, 22, 24, 16,

13, 16, 13, 18, 15, 13])

# Perform the two sample t-test with equal variances

print(stats.ttest_ind(a=data_group1, b=data_group2, equal_var=True))

output: Ttest_indResult(statistic=-0.6337397070250238, pvalue=0.5300471010405257)

method 2

# Python program to conduct two-sample

# T-test using pingouin library

# Importing library

from statsmodels.stats.weightstats import ttest_ind

import numpy as np

import pingouin as pg

# Creating data groups

data_group1 = np.array([160, 150, 160, 156.12, 163.24,

160.56, 168.56, 174.12,

167.123, 165.12])

data_group2 = np.array([157.97, 146, 140.2, 170.15,

167.34, 176.123, 162.35, 159.123,

169.43, 148.123])

# Conducting two-sample ttest

result = pg.ttest(data_group1,

data_group2,

correction=True)
# Print the result

print(result)

output: T dof alternative ... cohen-d BF10 power

T-test 0.653148 14.389477 two-sided ... 0.292097 0.462 0.094912

Method 3

from statsmodels.stats.weightstats import ttest_ind

import numpy as np

import pingouin as pg

# Creating data groups

data_group1 = np.array([160, 150, 160, 156.12,

163.24,

160.56, 168.56, 174.12,

167.123, 165.12])

data_group2 = np.array([157.97, 146, 140.2, 170.15,

167.34, 176.123, 162.35,

159.123, 169.43, 148.123])

# Conducting two-sample ttest

print(ttest_ind(data_group1, data_group2))

output: (0.6531479162158739, 0.5219170107019715, 18.0) ….> t-stat, p-val, df

linear regression

pip install sklearn-pandas==1.5.0

Stats Lab (7-9)
No ratings yet
Stats Lab (7-9)
8 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
4 12
No ratings yet
4 12
17 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
32 pages
Pratical 11 Python DP
No ratings yet
Pratical 11 Python DP
5 pages
Staff Manual 06
No ratings yet
Staff Manual 06
3 pages
188007259941509
No ratings yet
188007259941509
2 pages
Exp 5-6-7-8
No ratings yet
Exp 5-6-7-8
8 pages
DA Lab ANSWERS
No ratings yet
DA Lab ANSWERS
10 pages
Ex. No.: 01 Working With Numpy Arrays
No ratings yet
Ex. No.: 01 Working With Numpy Arrays
30 pages
Data Science Practical With Solutions BSC Cs Sem 6
No ratings yet
Data Science Practical With Solutions BSC Cs Sem 6
29 pages
Exercises 2 Unfinished
No ratings yet
Exercises 2 Unfinished
8 pages
FDSA Lab Manual
No ratings yet
FDSA Lab Manual
27 pages
Annotated Follow-Along Guide - Explore One-Way Versus Two-Way ANOVA Tests With Python
No ratings yet
Annotated Follow-Along Guide - Explore One-Way Versus Two-Way ANOVA Tests With Python
17 pages
DVA Lab Manual
No ratings yet
DVA Lab Manual
20 pages
Mat Lab Workbooks Ta THW 4
No ratings yet
Mat Lab Workbooks Ta THW 4
4 pages
7406HW02 1
No ratings yet
7406HW02 1
3 pages
Manual vs Auto Transmission MPG Analysis
No ratings yet
Manual vs Auto Transmission MPG Analysis
5 pages
Experimenting With Data Analysis Packages and Statistical Operations
No ratings yet
Experimenting With Data Analysis Packages and Statistical Operations
18 pages
DA Manual - Part B
No ratings yet
DA Manual - Part B
13 pages
Statistical Analysis With Scipy?
No ratings yet
Statistical Analysis With Scipy?
9 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Data Science Practical
No ratings yet
Data Science Practical
22 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Fha-Pyhton Program Unit 1-4
No ratings yet
Fha-Pyhton Program Unit 1-4
13 pages
BAN5
No ratings yet
BAN5
2 pages
TS Final
No ratings yet
TS Final
13 pages
Data Analysis and Visualization Guide
No ratings yet
Data Analysis and Visualization Guide
16 pages
Machine Learning Cheat Sheet
No ratings yet
Machine Learning Cheat Sheet
15 pages
Anova 2
No ratings yet
Anova 2
4 pages
ML Updated File
No ratings yet
ML Updated File
36 pages
En Tanagra Python StatsModels PDF
No ratings yet
En Tanagra Python StatsModels PDF
20 pages
Tinywow Matlabworkbookstathw4 83108852
No ratings yet
Tinywow Matlabworkbookstathw4 83108852
16 pages
Data Analytics Lab
No ratings yet
Data Analytics Lab
46 pages
Project Inferential Statistics-Checkpoint
No ratings yet
Project Inferential Statistics-Checkpoint
11 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
ANOVA Analysis in R Guide
No ratings yet
ANOVA Analysis in R Guide
7 pages
Machine Learning Evaluation Guide
100% (1)
Machine Learning Evaluation Guide
504 pages
Medium Com Sarowar Saurav10 20 Advanced Statistical Approaches Every Data Scientist Should Know Ccc70ae4df28
No ratings yet
Medium Com Sarowar Saurav10 20 Advanced Statistical Approaches Every Data Scientist Should Know Ccc70ae4df28
15 pages
Principal Component Analysis Python
No ratings yet
Principal Component Analysis Python
7 pages
1
No ratings yet
1
13 pages
Business Analytics Assignment
No ratings yet
Business Analytics Assignment
26 pages
Mtcars Dataset: Multilinear Regression Analysis
No ratings yet
Mtcars Dataset: Multilinear Regression Analysis
13 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
Agniva
No ratings yet
Agniva
16 pages
Parametric
No ratings yet
Parametric
15 pages
ADS EXP Assignments
No ratings yet
ADS EXP Assignments
38 pages
7708 - MBA PredAnanBigDataNov21
No ratings yet
7708 - MBA PredAnanBigDataNov21
11 pages
Data Analytics Lab Manual Final1
No ratings yet
Data Analytics Lab Manual Final1
32 pages
Da Rec
No ratings yet
Da Rec
29 pages
Fdsa New Lab
No ratings yet
Fdsa New Lab
14 pages
Ad3411-Data Science and Analytics Laboratory
No ratings yet
Ad3411-Data Science and Analytics Laboratory
27 pages
AD3411 DATA SCIENCE AND ANALYTICS LAB (2) - Removed
No ratings yet
AD3411 DATA SCIENCE AND ANALYTICS LAB (2) - Removed
24 pages
Statistics Cheatsheet 1703847367
No ratings yet
Statistics Cheatsheet 1703847367
8 pages
Data Science
No ratings yet
Data Science
15 pages
ANCOVA How To Perform An Ancova in Python
No ratings yet
ANCOVA How To Perform An Ancova in Python
4 pages
Modern Physics, Final Exam
No ratings yet
Modern Physics, Final Exam
2 pages
Python OOP and Tkinter Guide
No ratings yet
Python OOP and Tkinter Guide
3 pages
How To Perform T-Test in Pandas
No ratings yet
How To Perform T-Test in Pandas
5 pages
Full Stack (1-4)
No ratings yet
Full Stack (1-4)
10 pages
Test2 Chap5678 HonsDBMS-11feb25
No ratings yet
Test2 Chap5678 HonsDBMS-11feb25
1 page
Gamestorming Techniques Guide
No ratings yet
Gamestorming Techniques Guide
10 pages
QSP-12 - Procedure For Process Change Control
No ratings yet
QSP-12 - Procedure For Process Change Control
2 pages
T2 Option Price List October 2023
No ratings yet
T2 Option Price List October 2023
97 pages
3HAC049406-003 CD IRC5c - Rev10
No ratings yet
3HAC049406-003 CD IRC5c - Rev10
60 pages
Holistic Testing-Weave Quality Into Your Product
No ratings yet
Holistic Testing-Weave Quality Into Your Product
37 pages
DS Unit 4
No ratings yet
DS Unit 4
21 pages
PKI and Digital Signatures Explained
No ratings yet
PKI and Digital Signatures Explained
4 pages
Lec 2 LAN Technologies
No ratings yet
Lec 2 LAN Technologies
46 pages
About Whatsapp Lock Chat (By Vikas Verma)
No ratings yet
About Whatsapp Lock Chat (By Vikas Verma)
3 pages
Product Prospectus - Tharaldsen
No ratings yet
Product Prospectus - Tharaldsen
2 pages
Digital Economy & Blockchain SWOT
No ratings yet
Digital Economy & Blockchain SWOT
8 pages
Analisa Mep 19022019
No ratings yet
Analisa Mep 19022019
9 pages
Streets of Blood - Carl Sargent, Marc Gascoigne (EPUB Rip)
100% (1)
Streets of Blood - Carl Sargent, Marc Gascoigne (EPUB Rip)
437 pages
MPMC U3&u4 Part-C Key
No ratings yet
MPMC U3&u4 Part-C Key
19 pages
Os - Lab - Manual Cse-2024-25
No ratings yet
Os - Lab - Manual Cse-2024-25
58 pages
Man 8035 Ord Hand
No ratings yet
Man 8035 Ord Hand
1 page
Backlog Exam - Routine - Even - 2023 - 24 - Sem - 2 - 4 - 6 - 8 - Spl. Supple
No ratings yet
Backlog Exam - Routine - Even - 2023 - 24 - Sem - 2 - 4 - 6 - 8 - Spl. Supple
14 pages
Ijst 2021 1266
No ratings yet
Ijst 2021 1266
15 pages
Chapter Six Tree and Graph
No ratings yet
Chapter Six Tree and Graph
26 pages
My ANIKET Document
No ratings yet
My ANIKET Document
52 pages
Sun Storage 7000 Unified Storage System Administration Guide
No ratings yet
Sun Storage 7000 Unified Storage System Administration Guide
388 pages
Fact - Sheet - Spectra-Professional - 2 2
No ratings yet
Fact - Sheet - Spectra-Professional - 2 2
1 page
Experimental Fluid Dynamics: M. Muste, S. Ghosh, F. Stern
No ratings yet
Experimental Fluid Dynamics: M. Muste, S. Ghosh, F. Stern
30 pages
VasFMC User's Guide - 1st Flight
No ratings yet
VasFMC User's Guide - 1st Flight
60 pages
AT1 Command List - V1.3 - 20200629
No ratings yet
AT1 Command List - V1.3 - 20200629
10 pages
Linux File System & LVM Guide
No ratings yet
Linux File System & LVM Guide
6 pages
Challenges in Workplace Communication Coursework
100% (2)
Challenges in Workplace Communication Coursework
8 pages
FRS of LTE For PoC With Covering Letter
No ratings yet
FRS of LTE For PoC With Covering Letter
54 pages

ANOVA and T-Test Analysis Guide

Uploaded by

ANOVA and T-Test Analysis Guide

Uploaded by

One-way ANOVA:

from scipy.stats import f_oneway

# Performance when each of the engine

performance1 = [89, 89, 88, 78, 79]

performance2 = [93, 92, 94, 89, 88]

performance3 = [89, 88, 89, 93, 90]

performance4 = [81, 78, 81, 92, 82]

# Conduct the one-way ANOVA

print(f_oneway(performance1, performance2, performance3, performance4))

Output: F_onewayResult(statistic=4.625000000000002, pvalue=0.016336459839780215)

# load data file

# reshape the d dataframe suitable for statsmodels package

df_melt = pd.melt(df.reset_index(), id_vars=['index'], value_vars=['A', 'B', 'C', 'D'])

# replace column names

df_melt.columns = ['index', 'treatments', 'value']

# easily detect the differences between different treatments

import matplotlib.pyplot as plt

import seaborn as sns

ax = sns.swarmplot(x="treatments", y="value", data=df_melt, color='#7d0013')

import scipy.stats as stats

fvalue, pvalue = stats.f_oneway(df['A'], df['B'], df['C'], df['D'])

# get ANOVA table as R like output

from statsmodels.formula.api import ols

# Ordinary Least Squares (OLS) model

model = ols('value ~ C(treatments)', data=df_melt).fit()

anova_table = sm.stats.anova_lm(model, typ=2)

pip install bioinfokit

# upgrade to latest version

pip install bioinfokit --upgrade

pip uninstall bioinfokit

import scipy.stats as stats

# Creating data groups

data_group1 = np.array([14, 15, 15, 16, 13, 8, 14,

17, 16, 14, 19, 20, 21, 15,

15, 16, 16, 13, 14, 12])

data_group2 = np.array([15, 17, 14, 17, 14, 8, 12,

19, 19, 14, 17, 22, 24, 16,

13, 16, 13, 18, 15, 13])

# Print the variance of both data groups

output: 7.727500000000001 12.260000000000002

1. Performing Two-Sample T-Test

# Python program to demonstrate how to

# perform two sample T-test

# Import the library

import scipy.stats as stats

# Creating data groups

data_group1 = np.array([14, 15, 15, 16, 13, 8, 14,

17, 16, 14, 19, 20, 21, 15,

15, 16, 16, 13, 14, 12])

19, 19, 14, 17, 22, 24, 16,

13, 16, 13, 18, 15, 13])

# Perform the two sample t-test with equal variances

print(stats.ttest_ind(a=data_group1, b=data_group2, equal_var=True))

output: Ttest_indResult(statistic=-0.6337397070250238, pvalue=0.5300471010405257)

# Python program to conduct two-sample

# T-test using pingouin library

from statsmodels.stats.weightstats import ttest_ind

# Creating data groups

data_group1 = np.array([160, 150, 160, 156.12, 163.24,

160.56, 168.56, 174.12,

data_group2 = np.array([157.97, 146, 140.2, 170.15,

167.34, 176.123, 162.35, 159.123,

# Conducting two-sample ttest

output: T dof alternative ... cohen-d BF10 power

T-test 0.653148 14.389477 two-sided ... 0.292097 0.462 0.094912

from statsmodels.stats.weightstats import ttest_ind

# Creating data groups

data_group1 = np.array([160, 150, 160, 156.12,

160.56, 168.56, 174.12,

data_group2 = np.array([157.97, 146, 140.2, 170.15,

167.34, 176.123, 162.35,

159.123, 169.43, 148.123])

# Conducting two-sample ttest

output: (0.6531479162158739, 0.5219170107019715, 18.0) ….> t-stat, p-val, df

pip install sklearn-pandas==1.5.0

You might also like