Research Methodology
Research Methodology
As the term suggests, exploratory research design deals with exploring the phenomenon.
It is generally used in cases where the problem must be defined more precisely, and to
gain additional insights before an approach can be developed.
Example: Exploring the impact of social media on mental health among adolescents.
In-depth interview
Focus Groups bring together a small group of participants to engage in open and
structured discussions about a particular topic.
Literature Review
● Purpose: To identify existing theories, concepts, and gaps in the literature related
to a research topic
● Methods: Researchers review academic papers, books, articles, and other
scholarly sources. They synthesize and analyze the findings and theories
presented in these sources.
● Benefits: A literature review provides a solid foundation for understanding the
historical context and key debates surrounding a topic. It helps researchers
identify areas where further investigation is needed.
Content Analysis
Content Analysis is a method for analyzing textual, visual, or audio content to uncover
patterns, themes, or
trends.
Case Studies
● Objective:
○ Descriptive research: The primary objective of descriptive research is to describe
and summarize characteristics, behaviors, or phenomena.
○ Causal research: The main objective of causal research is to establish
cause-and-effect relationships between variables.
● Research Question:
○ Descriptive research: Descriptive research answers questions about "what" and
"how" by describing and documenting the characteristics, behaviors, or conditions
of a particular group or situation.
○ Causal research: Causal research addresses questions about "why" and "how" by
investigating the causes and effects between variables.
● Data Collection and Analysis:
○ Descriptive research: Descriptive research collects data through various methods,
such as surveys, observations, or existing records. The data are then analyzed
using descriptive statistics, such as measures of central tendency and dispersion,
to summarize and present the findings.
○ Causal research: Causal research often requires more rigorous study designs, such
as experiments or quasi-experiments, to establish cause-and-effect relationships.
Statistical analyses, such as regression analysis or hypothesis testing, are used to
determine the strength and significance of the causal relationship.
4. Time Orientation:
○ Descriptive research: Descriptive research focuses on the present or a specific
time period, providing a snapshot of the current situation or characteristics.
○ Causal research: Causal research investigates the relationship between variables
over time, seeking to establish the temporal sequence of cause and effect.
5. Generalizability:
○ Descriptive research: Descriptive research aims to provide a comprehensive
description of a particular group or situation.
○ Causal research: Causal research seeks to establish cause-and-effect relationships
that can be generalized to broader populations or situations.
In research methodology, variables are essential components that researchers manipulate and
measure to understand phenomena, test hypotheses, and draw conclusions. Here are the main
types of variables and their uses:
Use:Indicates the outcome of the experiment or study. In the previous example, the exam
scores are the dependent variable.
● Control Variable
Definition: Any variable other than the independent variable that may affect the
dependent variable.
Use: Researchers try to control or randomize these variables to minimize their impact.
For example, prior knowledge of the subject can be an extraneous variable in the study of
study time and exam scores.
● Confounding Variable
Definition: A type of extraneous variable that changes systematically along with the
independent variable and affects the dependent variable, leading to erroneous
conclusions.
Use: Researchers aim to identify and control these variables. For example, in the study
time and exam scores study, if students who study more also have better prior knowledge,
prior knowledge becomes a confounding variable.
● Moderator Variable
Definition:A variable that influences the strength or direction of the relationship between
the independent and dependent variables.
Use:Helps understand under what conditions or for whom an effect exists. For example,
the effect of study time on exam scores might be moderated by the student's level of
motivation.
● Mediator Variable
Definition: A variable that explains the process through which the independent variable
affects the dependent variable.
Use: Provides insight into the mechanism or pathway of the effect. For instance,
increased study time (IV) might improve study habits (mediator), which in turn improves
exam scores (DV).
● Latent Variable
Definition:Variables that are not directly observed but are inferred from other variables
that are measured (often found in psychological and social sciences).
Use: Helps in understanding underlying constructs that influence observed variables. For
example, intelligence is a latent variable that might be inferred from performance on
various cognitive tasks.
● Intervening Variable
Use: Helps in detailing the step-by-step process leading from the cause to the effect. For
instance, educational programs (IV) might lead to better learning strategies (intervening
variable), resulting in improved academic performance (DV).
● Demographic Variable
Use:Used to describe and control for the effects of population characteristics in a study.
For example, understanding the demographic distribution of study participants can help
generalize findings.
Understanding these types of variables and their appropriate uses is crucial for designing robust
research studies, analyzing data accurately, and drawing valid conclusions.
Here are some key reasons why literature reviews are essential in research:
● Title: The title should be concise, clear, and accurately reflect the content of the research
report. It should provide a glimpse into the research topic and capture the reader's
attention.
● Abstract: The abstract provides a brief summary of the research report, highlighting the
research objectives, methodology, key findings, and conclusions.
● Introduction: The introduction section introduces the research topic, provides
background information, and presents the research problem or objectives.
● Literature Review: The literature review critically examines the existing body of
knowledge and research relevant to the research topic.
● Methodology: The methodology section describes the research design, data collection
methods, sample selection, and data analysis techniques.
● Results: The results section presents the findings of the research in a clear and organized
manner. It may include tables, charts, graphs, or other visual aids to facilitate data
presentation.
● Discussion: The discussion section interprets the results, relates them to the research
objectives and literature review, and offers explanations for the findings.
● Conclusion: The conclusion section provides a concise summary of the main findings
and their implications.
● References: The references section lists all the sources cited in the research report.
● Appendices: Appendices include supplementary materials that support the research
report but are not essential for understanding the main findings. This may include
questionnaires, raw data, detailed descriptions of statistical analyses, or other relevant
documents.
● Identify the Research Problem: The first step is to identify and define the research
problem or topic.
● Review the Literature: Conduct a comprehensive literature review to gather existing
knowledge and research on the chosen topic.
● Formulate a Research Design: Based on the research questions and objectives,
determine the appropriate research design and methodology.
● Collect Data: Implement the chosen data collection methods, which may include
surveys, interviews, observations, experiments, or analysis of existing data
● Analyze Data: Once the data is collected, analyze it using appropriate statistical or
qualitative analysis techniques.
● Interpret and Draw Conclusions: Analyze the results of the data analysis, interpret the
findings, and relate them to the research objectives and literature review.
● Communicate the Findings: Prepare a research report that communicates the research
process, findings, and conclusions.
● Peer Review and Feedback: Submit the research report to peers, experts, or the intended
audience for feedback and review.
● Spread the Research: Share the research findings through conference presentations,
academic journals, or other appropriate channels.
● Reflect and Iterate: Reflect on the research process and outcomes, considering the
strengths, weaknesses, and limitations of the study.
In research methodology, data sources are typically classified into primary and secondary
sources. Each type has distinct characteristics and purposes within the research process. Here's
an overview of these categories and examples of each:
● Surveys and Questionnaires: Structured instruments for collecting data from a large
number of respondents.
● Focus Groups: Guided discussions with a group of people to gather opinions and
attitudes about a specific topic.
● Diaries and Journals: Personal records maintained by individuals over a period of time.
Secondary data sources consist of information that has been collected and compiled by others
and is available for use by researchers.
● Media Sources: Information from newspapers, magazines, television, and other media
outlets.
● Historical Records and Archives: Documents and records from past events, often
preserved in libraries or archives.
When selecting data sources, researchers consider factors such as the research question,
objectives, and the reliability and validity of the data. Primary data is often more specific to the
researcher's needs but can be time-consuming and expensive to collect. Secondary data is usually
more accessible and cost-effective but may not be as precisely aligned with the research
objectives.
By understanding and leveraging both primary and secondary data sources, researchers can
gather comprehensive and robust information to support their studies.
10. Significance of using the primary and secondary data and limitations.
Primary data:
● Relevance and Fit: Primary data collection enables researchers to gather data that
precisely aligns with their research objectives and specific requirements.
● Control and Quality: By collecting primary data, researchers have greater control over
the research process. This control enhances the overall quality and integrity of the
research findings.
● Uniqueness and Originality: Primary data provides researchers with the opportunity to
generate unique and original insights
● Customization and Flexibility: Primary data collection allows researchers to tailor their
approach to fit the specific research context and population of interest.
Despite the advantages, primary data collection also has limitations, which should be
considered:
● Time and Resource Intensive: Collecting primary data can be time-consuming and
require significant resources, including financial, human, and logistical support.
● Cost: Primary data collection can be expensive, especially when large sample sizes or
specialized data collection methods are required.
● Potential Bias: Researchers may introduce bias in primary data collection through the
design of the study, wording of questions, or their own presence during data collection.
● Limited Generalizability: Primary data collection often focuses on a specific sample or
population, making it challenging to generalize the findings to larger populations or
different contexts.
● Ethical Considerations: Collecting primary data requires researchers to attach to ethical
guidelines and protect the rights and privacy of participants.
Secondary data:
Secondary data refers to information that has been collected by someone else for a purpose other
than the current research project. It is data that already exists and is available in various sources
such as research studies, government reports, databases, books, articles, or online resources. The
significance of secondary data in research includes:
● Time and Cost Efficiency: Secondary data is readily available, which saves time and
resources compared to collecting primary data.
● Large and Diverse Sample: Secondary data often represents a larger and more diverse
sample compared to what a single research project can collect
● Longitudinal and Comparative Studies: Secondary data sources may contain data
collected over an extended period, making it possible to conduct longitudinal studies and
examine changes or trends over time.
● Validation and Replication: Secondary data provides an opportunity for validation and
replication of previous research findings.
● Exploration and Hypothesis Generation: Secondary data allows for exploratory
analysis and hypothesis generation. It serves as a foundation for further investigation.
Despite its advantages, secondary data also has limitations that researchers should consider:
● Data Quality: The quality and accuracy of secondary data are influenced by the
methods, instruments, and procedures used by the original data collectors.
● Data Availability and Relevance: Not all desired variables or measures may be
available in secondary data sources.
● Lack of Control: Since secondary data is collected by others for their specific purposes,
researchers have limited control over the data collection process.
● Bias and Confounding Factors: Secondary data may contain biases or confounding
factors that were not originally accounted for in the data collection process
● Data Interpretation Challenges: Interpreting secondary data requires a thorough
understanding of the context, data collection methods, and limitations of the original
study.
● Bar Chart: A bar chart represents data using rectangular bars of equal width but varying
heights. It is useful for comparing discrete categories or groups
● Line Chart: A line chart displays data points connected by lines. It is commonly used to
show trends or patterns over time.
● Pie Chart: A pie chart is a circular graph divided into sectors, representing different
proportions of a whole.
● Scatter Plot: A scatter plot uses dots or markers to represent individual data points on a
two-dimensional plane. It is used to visualize the relationship between two continuous
variables.
● Histogram: A histogram displays the distribution of continuous or interval data using
adjacent rectangles or bars. It shows the frequency or count of data falling within
specified intervals, also known as classes.
● Box Plot: A box plot, also known as a box-and-whisker plot, provides a summary of the
distribution of continuous or ordinal data. It displays the minimum, maximum, median,
quartiles, and outliers in a dataset.
● Heatmap: A heatmap uses colors to represent data values in a matrix or table format. It is
effective for visualizing patterns, correlations, or relationships between multiple
variables.
● Area Chart: An area chart is similar to a line chart but with the area below the line filled
in with color. It is commonly used to display cumulative data over time or to compare the
relative contribution of different variables to a whole data set.
Creating a questionnaire in research methodology involves several systematic steps to ensure the
tool is effective, reliable, and valid. Here’s a detailed guide to the creation of a questionnaire,
with an emphasis on best practices and practical tips:
● Question Formulation:
-Clarity: Use simple and unambiguous language.
-Relevance: Ensure each question aligns with your research objectives.
-Neutrality: Avoid leading or biased questions.
-Specificity: Make sure questions are specific and directly address the topic.
-Brevity: Keep questions concise to avoid respondent fatigue.
● Pilot Testing:
-Conduct a Pre-test: Test the questionnaire with a small group representative of your
target population.
-Feedback Collection: Gather feedback on question clarity, length, and overall
understanding.
-Revise: Make necessary changes based on the feedback to improve the questionnaire.
A good questionnaire is a critical tool in research methodology, and its design can significantly
influence the quality and reliability of the data collected. Here are the key characteristics that
make a questionnaire effective:
● Relevance:
Ensure all questions are directly related to the research objectives.
● Brevity:
Keep the questionnaire as short as possible while still collecting the necessary data.
Long questionnaires can lead to respondent fatigue and lower completion rates.
● Neutrality:
● Specificity:
Ask specific questions that can be clearly understood and answered accurately.
Avoid vague questions that may be interpreted in multiple ways.
Arrange questions in a logical sequence that flows naturally from one topic to the next.
Group similar questions together to maintain coherence and focus.
● Pilot Testing:
Pre-test the questionnaire with a small sample of the target population to identify any
issues.
Use the feedback to refine and improve the questionnaire before full deployment.
Introduction:
Demographic Questions:
Collect basic information (e.g., age, gender, education level) relevant to the research.
Main Content:
Conclusion:
By following these steps and characteristics, researchers can design effective questionnaires that
yield high-quality, reliable data for their studies.
Ans. Here's an example of a questionnaire to assess the service quality of a restaurant using
various types of measurement scales:
Section 2: Service Quality Assessment Please rate the following aspects of the restaurant's
service quality using the scales provided:
18. A teacher claims that the mean score of students in his class is greater than 82 with a
standard deviation of 20. If a sample of 81 students was selected with a mean score of 90
then check if there is enough evidence to support this claim at a 0.05 significance level.
19. An online medicine shop claims that the mean delivery time for medicines is less than
120 minutes with a standard deviation of 30 minutes. Is there enough evidence to support
this claim at a 0.05 significance level if 49 orders were examined with a mean of 100
minutes?
20. A gym trainer claimed that all the new boys in the gym are not equal to 100 kg.A
random sample of thirty boys weight have a mean score of 112.5 kg and the population
mean weight is 100 kg and the standard deviation is 15. Test is at 0.05 significance level.
21.From the data given below obtain the regression equation of X on Y:
X: 2 3 7 9 10
Y: 9 9 10 8 12
22. Properties of correlation coefficient.
● Range: The correlation coefficient, denoted by "r," ranges between -1 and +1. A
correlation coefficient of +1 indicates a perfect positive relationship, -1 indicates a
perfect negative relationship, and 0 indicates no relationship between the variables.
● Direction: The sign of the correlation coefficient indicates the direction of the
relationship between the variables. A positive correlation coefficient indicates that as one
variable increases, the other variable also tends to increase. A negative correlation
coefficient indicates an inverse relationship, where as one variable increases, the other
variable tends to decrease.
● Strength: A correlation coefficient closer to +1 or -1 indicates a stronger relationship,
while a correlation coefficient closer to 0 indicates a weaker relationship.
● Linearity: The correlation coefficient measures the linear relationship between variables.
It assumes that the relationship between the variables can be represented by a straight
line.
● Independence: The correlation coefficient measures the association between two
variables but does not imply causation.
The correlation coefficient and the coefficient of determination are both statistical measures used
to describe the relationship between variables. However, they have distinct purposes and
interpretations. Here’s a detailed comparison:
1. Definition:
- The correlation coefficient measures the strength and direction of the linear relationship
between two variables.
- It is denoted by r .
2. Range:
3. Interpretation:
4. Usage:
5. Calculation:
- It is denoted by R^2 .
2. Range:
- R^2 = 1 indicates that the regression model perfectly explains the variance in the
dependent variable.
- R^2 = 0 indicates that the model does not explain any of the variance.
3. Interpretation:
- R^2 represents the proportion of the variance in the dependent variable that is explained
by the independent variable(s).
- For example, R^2 = 0.64 means 64% of the variance in the dependent variable is
explained by the model.
4. Usage:
- Helps in understanding how well the model explains the variability of the outcome.
5. Calculation:
Key Differences
1. Nature:
- The correlation coefficient r measures the strength and direction of a linear relationship
between two variables.
- The coefficient of determination R^2 measures the proportion of the variance in the
dependent variable explained by the independent variable(s).
2. Interpretation of Values:
- r ranges from -1 to 1 and includes the direction (positive or negative).
- R^2 ranges from 0 to 1 and does not indicate direction.
3. Context of Use:
- r is used in correlation analysis.
- R^2 is used in the context of regression analysis to evaluate model fit.
4. Relationship:
Suppose you have data on the hours studied (independent variable) and exam scores (dependent
variable) for a group of students.
- If the correlation coefficient r is 0.8, this indicates a strong positive linear relationship between
hours studied and exam scores.
- If the coefficient of determination R^2 is 0.64, this means that 64% of the variance in exam
scores can be explained by the number of hours studied.
Conclusion
While both the correlation coefficient and the coefficient of determination are useful for
understanding relationships between variables, they serve different purposes and provide
different insights. The correlation coefficient is more about the strength and direction of a linear
relationship, whereas the coefficient of determination focuses on the explanatory power of the
model.
Skewness is a statistical measure that describes the asymmetry of a distribution around its mean.
It indicates the extent to which the distribution of data deviates from a symmetric bell curve
(normal distribution). Understanding skewness helps in identifying the nature of the distribution,
which can affect statistical analyses and interpretations.
● Characteristics:
- The left tail (smaller values) of the distribution is longer or fatter than the right
tail.
- Most of the data values are concentrated on the right side of the distribution.
- The mean is typically less than the median.
● Example: Age at retirement, where most people retire around a typical age but some
retire much earlier.
● Characteristics:
- The distribution is symmetric around the mean.
- The left and right tails are mirror images of each other.
- The mean, median, and mode are all equal.
● Example: Heights of adult males in a large population (approximately).
Practical Implications of Skewness
● Data Analysis:
- Skewness affects the mean and variance of the data.
- Many statistical tests assume normality; significant skewness can violate this
assumption and affect the validity of these tests.
● Decision Making:
- Understanding skewness helps in making better decisions based on the
distribution of data.
- For example, in financial risk management, right-skewed distributions might
indicate potential for higher returns but also higher risk.
● Transformation:
- Data transformation techniques (e.g., logarithmic, square root) can be applied to
reduce skewness and make the data more normally distributed, facilitating better
statistical analysis.
Conclusion
Skewness is a crucial aspect of descriptive statistics that provides insight into the shape and
nature of a data distribution. Recognizing and addressing skewness can enhance the accuracy and
reliability of data analysis and interpretation.
Stratified random sampling is a sampling technique that involves dividing the population into
homogeneous subgroups or strata and then selecting a random sample from each stratum. The
goal is to ensure that the sample represents the diversity and characteristics of the population
more accurately.
Stratified random sampling offers several advantages over simple random sampling:
● Increased Precision: Stratified random sampling can increase the precision and accuracy
of estimates compared to simple random sampling.
● Enhanced Representativeness: Stratified random sampling ensures that each subgroup or
stratum is represented in the sample proportionally to its size in the population.
● Efficient Resource Allocation: Stratified random sampling enables more efficient use of
resources by focusing data collection efforts on specific subgroups.
● Subgroup Analysis: Stratified random sampling facilitates subgroup analysis by ensuring
an adequate representation of each subgroup.
● Increased Statistical Power: Stratified random sampling can enhance the statistical power
of a study.
Cluster sampling is a sampling technique used in research and surveys where the target
population is divided into clusters or groups, and a subset of clusters is selected for data
collection. It is a practical and efficient method of sampling when the population is large and
geographically dispersed.
29. Discuss the primary scales of measurement.
● Nominal Scale: The nominal scale represents the most basic level of measurement. It
involves categorizing data into distinct groups or categories without any order or
numerical significance. Examples of nominal scale variables include gender
(male/female), marital status (single/married/divorced), or colors (red/blue/green).
● Ordinal Scale: The ordinal scale incorporates the properties of the nominal scale, but it
additionally establishes a relative order or ranking among the categories.
● Interval Scale: The interval scale possesses the characteristics of the ordinal scale, but in
addition, it has equal intervals between the values. An example of the interval scale is the
Fahrenheit or Celsius temperature scale.
● Ratio Scale: The ratio scale represents the highest level of measurement and encompasses
all the characteristics of the nominal, ordinal, and interval scales. Examples of ratio scale
variables include height, weight, time, and income. Top of Form
A research hypothesis is a guess you make about what you think will happen in your study. It's
like a prediction that you can test to see if it's true.
Types of Hypotheses
1. Identify the Problem : Figure out what question you want to answer.
2. Review Existing Research : Look at what other studies have found.
3. Define Variables : Decide what you are going to measure (e.g., study time, grades).
4. Write Your Hypothesis : Make a clear and testable guess.
5. Refine It :Make sure it's specific and can be measured.
● Research Question: Does using interactive tools in online classes help students stay
engaged?
● Literature Review: Previous studies show that interactive tools might make learning more
fun and engaging.
● Variables:
- Independent Variable: Use of interactive tools (like quizzes).
- Dependent Variable: Student engagement (measured by how much they
participate).
● Hypothesis:
- Null Hypothesis (H0): "Interactive tools do not make a difference in student
engagement."
- Alternative Hypothesis (H1): "Interactive tools make students more engaged."
- Design the Study: Plan how you'll test your hypothesis (e.g., an experiment).
- Collect Data: Gather information by observing or surveying students.
- Analyze Data: Look at the data to see if it supports your hypothesis.
- Draw Conclusions: Decide if your guess was right or wrong based on the data.
Conclusion
A research hypothesis is a clear and testable guess about what you think will happen in your
study. It's an important part of research because it guides your study, helps you focus, and
connects your work to existing research. By making and testing hypotheses, you can learn more
about the world and contribute to scientific knowledge.
31. Discuss one tail and two tail tests with respect to hypothesis testing.
When you do hypothesis testing, you can use a one-tail test or a two-tail test. The type of test you
pick depends on what you're trying to find out.
1. One-Tail Test
A one-tail test checks for a difference in just one direction. It can show if something is only
going up or only going down, but not both.
Example 1: You want to see if a new medicine lowers blood pressure more than the old one.
● Hypothesis:
- Null Hypothesis (H0): The new medicine is not more effective.
- Alternative Hypothesis (H1): The new medicine is more effective (it lowers blood
pressure more).
Example 2: You want to know if a new teaching method improves test scores.
● Hypothesis:
- Null Hypothesis (H0): The new method does not improve test scores.
- Alternative Hypothesis (H1): The new method improves test scores.
2. Two-Tail Test
A two-tail test checks for a difference in both directions. It can show if something goes up or
down.
Example 1: You want to see if a new medicine changes blood pressure (either higher or lower).
● Hypothesis:
- Null Hypothesis (H0): The new medicine does not change blood pressure.
- Alternative Hypothesis (H1): The new medicine changes blood pressure.
Example 2: You want to know if a new teaching method changes test scores (either better or
worse).
● Hypothesis:
- Null Hypothesis (H0): The new method does not change test scores.
- Alternative Hypothesis (H1): The new method changes test scores.
1. Set the Significance Level (α): Usually 0.05, but split into two (0.025 each tail).
2. Find the Critical Values: For both ends of the range.
3. Calculate the Test Statistic: Using the data you have.
4. Compare the Test Statistic to the Critical Values:
- If the test statistic is in either critical region (tails), you reject the null hypothesis.
Conclusion
Choose a one-tail test if you only care about finding a change in one direction. Choose a two-tail
test if you care about finding any change, whether it goes up or down. This choice affects how
you look at your test results and what you can conclude from your study.
When performing hypothesis testing, there are two main types of errors that can occur: Type I
errors and Type II errors. Understanding these errors is crucial for interpreting the results of
statistical tests correctly.
1. Type I Error (False Positive)
A Type I error occurs when the null hypothesis (H0) is true, but you incorrectly reject it. In other
words, you think there is an effect or difference when there actually isn't.
Example
● You test a new drug to see if it lowers blood pressure. The drug doesn't actually work, but
your test results mistakenly show that it does.
● Null Hypothesis (H0): The drug has no effect on blood pressure.
● Alternative Hypothesis (H1): The drug lowers blood pressure.
Type I Error: Concluding that the drug lowers blood pressure when it actually does not.
Probability: The probability of making a Type I error is denoted by the significance level (α),
commonly set at 0.05. This means there is a 5% chance of making a Type I error.
A Type II error occurs when the null hypothesis (H0) is false, but you fail to reject it. In other
words, you think there is no effect or difference when there actually is one.
Example
● You test the same new drug to see if it lowers blood pressure. The drug actually works,
but your test results fail to show this.
● Null Hypothesis (H0): The drug has no effect on blood pressure.
● Alternative Hypothesis (H1): The drug lowers blood pressure.
Type II Error: Concluding that the drug does not lower blood pressure when it actually does.
Probability: The probability of making a Type II error is denoted by β. The power of a test,
which is 1 - β, is the probability of correctly rejecting a false null hypothesis.
Conclusion
In hypothesis testing, understanding and balancing Type I and Type II errors is crucial.
Researchers aim to minimize both errors, but this often involves trade-offs. The chosen
significance level (α) and the power of the test (1 - β) help determine the likelihood of these
errors, guiding decisions in statistical analysis and ensuring accurate interpretations.
33. Steps of hypothesis testing.
Example: α = 0.05
Example: Collect blood pressure readings from patients before and after taking the drug.
5. Calculate the Test Statistic
Use the chosen test to calculate a number (test statistic) that tells you how far your
sample data is from what the null hypothesis predicts.
Example: Calculate the t-statistic for the difference in blood pressure before and after the drug.
Example: Find the critical value from a t-table or calculate the p-value.
7. Make a Decision
● Compare the test statistic to the critical value or compare the p-value to α.
- If the test statistic is greater than the critical value or the p-value is less
than α, reject H0.
- If not, do not reject H0.
Draw a Conclusion
Example: There is enough evidence to say the new drug has an effect.
The significance level, often denoted as alpha (α), is a probability value that researchers set
before conducting an experiment. It helps in deciding whether the observed results are due to a
real effect or just random chance. Common values for α are 0.05 (5%), 0.01 (1%), and 0.10
(10%).
Why is it Important?
1. Hypothesis Testing: Researchers start with a null hypothesis (e.g., a new drug has no
effect) and an alternative hypothesis (e.g., the drug does have an effect).
2. Calculate P-Value: After conducting the experiment, they calculate a p-value, which
indicates the probability of obtaining the observed results if the null hypothesis is true.
3. Compare with α: If the p-value is less than or equal to the significance level (α), the null
hypothesis is rejected in favor of the alternative hypothesis. If the p-value is greater than
α, the null hypothesis is not rejected.
Example
Imagine you are testing a new fertilizer on plant growth. You set your significance level at 0.05
(5%). After conducting the experiment and analyzing the data, you find a p-value of 0.03. Since
0.03 is less than 0.05, you reject the null hypothesis and conclude that the new fertilizer has a
significant effect on plant growth.
● When conducting research, you collect data and perform statistical tests to see which
hypothesis is more likely to be true.
● If the data shows strong evidence against the null hypothesis, you reject it and accept the
alternative hypothesis.
● If the data does not show strong evidence against the null hypothesis, you fail to reject it
(which means you don't have enough evidence to support the alternative hypothesis).
In simple terms, the null hypothesis is like saying "nothing is happening," and the alternative
hypothesis is like saying "something is happening." Researchers use these hypotheses to guide
their experiments and analyze their results.