0% found this document useful (0 votes)

72 views5 pages

How To Perform T-Test in Pandas

The document discusses three types of t-tests that can be performed using pandas: independent two sample t-test, Welch's two sample t-test, and paired sample t-test. It provides examples of how to set up sample data in a pandas DataFrame and use functions from the SciPy library to conduct each t-test. The examples test whether two studying methods lead to different exam score means. The t-test output includes the test statistic and p-value, allowing a determination of whether the null hypothesis can be rejected.

Uploaded by

Garuma Abdisa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views5 pages

How To Perform T-Test in Pandas

Uploaded by

Garuma Abdisa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

How to Perform t-Tests in Pandas (3

Examples)

The following examples show how to perform three

different t-tests using a pandas DataFrame:

 Independent Two Sample t-Test

 Welch’s Two Sample t-Test
 Paired Samples t-Test

Example 1: Independent Two Sample t-Test in Pandas

An independent two sample t-test is used to determine if
two population means are equal.

For example, suppose a professor wants to know if

two different studying methods lead to different
mean exam scores.

To test this, he recruits 10 students to use method

A and 10 students to use method B.

The following code shows how to enter the scores

of each student in a pandas DataFrame and then
use the ttest_ind() function from the SciPy library to
perform an independent two sample t-test:
import pandas as pd
from scipy.stats import ttest_ind

#create pandas DataFrame

df = pd.DataFrame({'method': ['A', 'A', 'A', 'A', 'A', 'A', 'A', 'A', 'A', 'A',
'B', 'B', 'B', 'B', 'B', 'B', 'B', 'B', 'B', 'B'],
'score': [71, 72, 72, 75, 78, 81, 82, 83, 89, 91, 80, 81, 81,
84, 88, 88, 89, 90, 90, 91]})

#view first five rows of DataFrame

df.head()

method score
0 A 71
1 A 72
2 A 72
3 A 75
4 A 78

#define samples
group1 = df[df['method']=='A']
group2 = df[df['method']=='B']

#perform independent two sample t-test

ttest_ind(group1['score'], group2['score'])

Ttest_indResult(statistic=-2.6034304605397938, pvalue=0.017969284594810425)

From the output we can see:

 t test statistic: –2.6034

 p-value: 0.0179

Since the p-value is less than .05, we reject the null

hypothesis of the t-test and conclude that there is
sufficient evidence to say that the two methods
lead to different mean exam scores.

Example 2: Welch’s t-Test in Pandas

Welch’s t-test is similar to the independent two
sample t-test, except it does not assume that the
two populations that the samples came from
have equal variance.
To perform Welch’s t-test on the exact same
dataset as the previous example, we simply need
to specify equal_var=False within the ttest_ind()
function as follows:
import pandas as pd
from scipy.stats import ttest_ind

#create pandas DataFrame

#define samples
group1 = df[df['method']=='A']
group2 = df[df['method']=='B']

#perform Welch's t-test

ttest_ind(group1['score'], group2['score'], equal_var=False)

Ttest_indResult(statistic=-2.603430460539794, pvalue=0.02014688617423973)

From the output we can see:

 t test statistic: –2.6034

 p-value: 0.0201

Since the p-value is less than .05, we reject the null

hypothesis of Welch’s t-test and conclude that
there is sufficient evidence to say that the two
methods lead to different mean exam scores.

Example 3: Paired Samples t-Test in Pandas

A paired samples t-test is used to determine if two
population means are equal in which each
observation in one sample can be paired with an
observation in the other sample.

For example, suppose a professor wants to know if

two different studying methods lead to different
mean exam scores.

To test this, he recruits 10 students to use method

A and then take a test. Then, he lets the same 10
students used method B to prepare for and take
another test of similar difficulty.

Since all of the students appear in both samples,

we can perform a paired samples t-test in this
scenario.

The following code shows how to enter the scores

of each student in a pandas DataFrame and then
use the ttest_rel() function from the SciPy library to
perform a paired samples t-test:
import pandas as pd
from scipy.stats import ttest_rel

#create pandas DataFrame

#view first five rows of DataFrame

df.head()

method score
0 A 71
1 A 72
2 A 72
3 A 75
4 A 78

#define samples
group1 = df[df['method']=='A']
group2 = df[df['method']=='B']

#perform independent two sample t-test

ttest_rel(group1['score'], group2['score'])

Ttest_relResult(statistic=-6.162045351967805, pvalue=0.0001662872100210469)

From the output we can see:

 t test statistic: –6.1620

 p-value: 0.0001

Since the p-value is less than .05, we reject the null

hypothesis of the paired samples t-test and
conclude that there is sufficient evidence to say
that the two methods lead to different mean exam
scores.

Lecture Material 6
No ratings yet
Lecture Material 6
3 pages
Hypothesis Testing & T-Test Guide
No ratings yet
Hypothesis Testing & T-Test Guide
20 pages
Task 5
No ratings yet
Task 5
3 pages
Research Methods for Statisticians
50% (2)
Research Methods for Statisticians
5 pages
Common Statistics
No ratings yet
Common Statistics
23 pages
RDocumentation - Func (Ttest)
No ratings yet
RDocumentation - Func (Ttest)
3 pages
Hypothesis Testing and T-tests in R
No ratings yet
Hypothesis Testing and T-tests in R
16 pages
T Test
No ratings yet
T Test
3 pages
Two-Sample T-Test For Equal Means
No ratings yet
Two-Sample T-Test For Equal Means
5 pages
Statistical Hypothesis Testing
No ratings yet
Statistical Hypothesis Testing
20 pages
T-Test in ML
No ratings yet
T-Test in ML
3 pages
R Program Corrections
No ratings yet
R Program Corrections
20 pages
One Sample T-Test
No ratings yet
One Sample T-Test
2 pages
Paired T-Test for Cholesterol Levels
No ratings yet
Paired T-Test for Cholesterol Levels
7 pages
Statistical Analysis Homework
No ratings yet
Statistical Analysis Homework
6 pages
097 Palak Exp9
No ratings yet
097 Palak Exp9
5 pages
R Unit-4
No ratings yet
R Unit-4
13 pages
Project A
No ratings yet
Project A
17 pages
Staff Manual 06
No ratings yet
Staff Manual 06
3 pages
HW 2
No ratings yet
HW 2
8 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
T-Test Guide for Data Analytics Course
No ratings yet
T-Test Guide for Data Analytics Course
30 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
32 pages
Anova
No ratings yet
Anova
4 pages
DS P4 Tanvi
No ratings yet
DS P4 Tanvi
3 pages
SI 03-Paramteric Tests Python I
No ratings yet
SI 03-Paramteric Tests Python I
22 pages
Rttest
No ratings yet
Rttest
12 pages
Two Sample T Test
No ratings yet
Two Sample T Test
5 pages
Python Codes Test 2
No ratings yet
Python Codes Test 2
12 pages
Yeni Microsoft Office Word Belgesi
No ratings yet
Yeni Microsoft Office Word Belgesi
10 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
11 pages
Statistics Project
No ratings yet
Statistics Project
13 pages
Unit-2
No ratings yet
Unit-2
7 pages
Computing & Data Analysis Guide
No ratings yet
Computing & Data Analysis Guide
40 pages
R Hypothesis Testing & Graphs Guide
No ratings yet
R Hypothesis Testing & Graphs Guide
47 pages
Unit 2 DSRP
No ratings yet
Unit 2 DSRP
56 pages
Group 6 (T-Test)
100% (1)
Group 6 (T-Test)
32 pages
BAN5
No ratings yet
BAN5
2 pages
Glossary of Hypothesis Tests 1
No ratings yet
Glossary of Hypothesis Tests 1
26 pages
生物统计方法与应用6 Two Sample
No ratings yet
生物统计方法与应用6 Two Sample
43 pages
Raghunath Chatterjee - Statistical Tests - Lecture
No ratings yet
Raghunath Chatterjee - Statistical Tests - Lecture
47 pages
22-23 323 Week6Notes
No ratings yet
22-23 323 Week6Notes
28 pages
STATA T-Model Calculations Guide
No ratings yet
STATA T-Model Calculations Guide
4 pages
00 Lab Notes
No ratings yet
00 Lab Notes
8 pages
T TEST Lecture
No ratings yet
T TEST Lecture
26 pages
07 Analysis of Variance
No ratings yet
07 Analysis of Variance
122 pages
Analysis of Measured Data
No ratings yet
Analysis of Measured Data
77 pages
Statistical Methods For Continuous Variables - Part One
No ratings yet
Statistical Methods For Continuous Variables - Part One
83 pages
Commands For Data Analysis Using R
No ratings yet
Commands For Data Analysis Using R
11 pages
T Test
No ratings yet
T Test
12 pages
Hypothesis Testing in R
No ratings yet
Hypothesis Testing in R
13 pages
R Commands
No ratings yet
R Commands
5 pages
Statistics Lab Exp 9 (T Test)
No ratings yet
Statistics Lab Exp 9 (T Test)
4 pages
Pratical 11 Python DP
No ratings yet
Pratical 11 Python DP
5 pages
Paired Sample T-Test
100% (2)
Paired Sample T-Test
23 pages
Cheat Sheet 2
No ratings yet
Cheat Sheet 2
25 pages
HLST 2301 Notes Print Me
No ratings yet
HLST 2301 Notes Print Me
29 pages
ANCOVA How To Perform An Ancova in Python
No ratings yet
ANCOVA How To Perform An Ancova in Python
4 pages
ANOVA and T-Test Analysis Guide
No ratings yet
ANOVA and T-Test Analysis Guide
6 pages
Modern Physics, Final Exam
No ratings yet
Modern Physics, Final Exam
2 pages
Python OOP and Tkinter Guide
No ratings yet
Python OOP and Tkinter Guide
3 pages
Bayesian vs Frequentist Sample Size
No ratings yet
Bayesian vs Frequentist Sample Size
26 pages
RBI Grade B Syllabus 2024: Click Here For Bundle PDF Course Click Here To Subscribe Our Yearly Mock Test Package
No ratings yet
RBI Grade B Syllabus 2024: Click Here For Bundle PDF Course Click Here To Subscribe Our Yearly Mock Test Package
15 pages
Template Igj
No ratings yet
Template Igj
5 pages
Solution Manual for Biostatistics an Applied Introduction for the Public Health Practitioner 1st Edition
No ratings yet
Solution Manual for Biostatistics an Applied Introduction for the Public Health Practitioner 1st Edition
4 pages
Accelerated Instruction (TAI) Terhadap Hasil Belajar Siswa Pada Mata
No ratings yet
Accelerated Instruction (TAI) Terhadap Hasil Belajar Siswa Pada Mata
9 pages
MND Batch 2023-251
No ratings yet
MND Batch 2023-251
74 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
14 pages
GENEVIEVE BRIAND, R. CARTER HILL - Using Excel For Principles of Econometrics-Wiley (2011) PDF
100% (1)
GENEVIEVE BRIAND, R. CARTER HILL - Using Excel For Principles of Econometrics-Wiley (2011) PDF
484 pages
Kinds and Classifications of Research
67% (6)
Kinds and Classifications of Research
38 pages
Multiple Choice Questions (With Answers)
50% (2)
Multiple Choice Questions (With Answers)
19 pages
Abstract Nur, Nuseni & Heri Prawitno
No ratings yet
Abstract Nur, Nuseni & Heri Prawitno
3 pages
Advanced Statistics Manual PDF
100% (3)
Advanced Statistics Manual PDF
258 pages
Data Science by CFA
No ratings yet
Data Science by CFA
27 pages
Business Mathematics & Statistics (MTH 302)
No ratings yet
Business Mathematics & Statistics (MTH 302)
11 pages
(Human-Computer Interaction Series) Judy Robertson, Maurits Kaptein (Eds.) - Modern Statistical Methods For HCI-Springer International Publishing (2016) PDF
100% (2)
(Human-Computer Interaction Series) Judy Robertson, Maurits Kaptein (Eds.) - Modern Statistical Methods For HCI-Springer International Publishing (2016) PDF
359 pages
A Study On The Effects of Noise On Industrial Workers in Malaysia
No ratings yet
A Study On The Effects of Noise On Industrial Workers in Malaysia
14 pages
Inferences
No ratings yet
Inferences
4 pages
Cfa L1 - 2024: Subjects
No ratings yet
Cfa L1 - 2024: Subjects
19 pages
Solving Unemployment Problem Through The Establishment of Small Scale Industries (Ssis)
100% (4)
Solving Unemployment Problem Through The Establishment of Small Scale Industries (Ssis)
57 pages
Rip 2
No ratings yet
Rip 2
12 pages
Tratak Meditation Boosts Focus
No ratings yet
Tratak Meditation Boosts Focus
3 pages
Syllabi For M.B.A First and Second Semester: Indira Gandhi University Meerpur, Rewari, Haryana
No ratings yet
Syllabi For M.B.A First and Second Semester: Indira Gandhi University Meerpur, Rewari, Haryana
15 pages
Faktor - Faktor Yang Mempengaruhi Pertumbuhan Kendaraan Bermotor Roda Dua Di Kota Pekanbaru
No ratings yet
Faktor - Faktor Yang Mempengaruhi Pertumbuhan Kendaraan Bermotor Roda Dua Di Kota Pekanbaru
15 pages
How To Perform A Two-Way (Within-Between) ANOVA in SPSS
No ratings yet
How To Perform A Two-Way (Within-Between) ANOVA in SPSS
12 pages
T Test
No ratings yet
T Test
32 pages
Environmental Science Textbook
100% (2)
Environmental Science Textbook
342 pages
STPM Maths T 2020 Assignment Conclusion Example
No ratings yet
STPM Maths T 2020 Assignment Conclusion Example
1 page
SBST3203 Elementary Data Analysis MAY 2020: Name: Arif Soebah Id No: 830811125679001 Phone Number: 013-8880791 Email
No ratings yet
SBST3203 Elementary Data Analysis MAY 2020: Name: Arif Soebah Id No: 830811125679001 Phone Number: 013-8880791 Email
9 pages
Unit Iii Data Analysis and Reporting
100% (1)
Unit Iii Data Analysis and Reporting
14 pages
Lesson-6 - Data Analysis
No ratings yet
Lesson-6 - Data Analysis
24 pages

How To Perform T-Test in Pandas

Uploaded by

How To Perform T-Test in Pandas

Uploaded by

How to Perform t-Tests in Pandas (3

The following examples show how to perform three

 Independent Two Sample t-Test

Example 1: Independent Two Sample t-Test in Pandas

For example, suppose a professor wants to know if

To test this, he recruits 10 students to use method

The following code shows how to enter the scores

#create pandas DataFrame

#view first five rows of DataFrame

#perform independent two sample t-test

From the output we can see:

 t test statistic: –2.6034

Since the p-value is less than .05, we reject the null

Example 2: Welch’s t-Test in Pandas

#create pandas DataFrame

#perform Welch's t-test

From the output we can see:

 t test statistic: –2.6034

Since the p-value is less than .05, we reject the null

Example 3: Paired Samples t-Test in Pandas

For example, suppose a professor wants to know if

To test this, he recruits 10 students to use method

Since all of the students appear in both samples,

The following code shows how to enter the scores

#create pandas DataFrame

#view first five rows of DataFrame

#perform independent two sample t-test

From the output we can see:

 t test statistic: –6.1620

Since the p-value is less than .05, we reject the null

You might also like