0% found this document useful (0 votes)

280 views4 pages

Assignment 2 Solution

The document is an assignment on Large Language Models consisting of 8 questions, covering topics such as N-gram models, Maximum Likelihood Estimation, and Kneser-Ney smoothing. Each question includes multiple-choice answers with correct answers and solutions provided. The assignment assesses understanding of probabilistic language models and their limitations.

Uploaded by

Harsh Vardhan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

280 views4 pages

Assignment 2 Solution

Uploaded by

Harsh Vardhan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Introduction to Large Language Models

Assignment- 2

Number of questions: 8 Total mark: 6 X 1 + 2 X 2 = 10

_________________________________________________________________________

QUESTION 1:
A 5-gram model is a ___________ order Markov Model.

a. Constant
b. Five
c. Six
d. Four

Correct Answer: d
Solution: An N-gram model considers only the preceding N −1 words.
An N-gram Language Model ≡ (N −1) order Markov Model
_________________________________________________________________________

QUESTION 2:
For a given corpus, the count of occurrence of the unigram “stay” is 300. If the Maximum
Likelihood Estimation (MLE) for the bigram “stay curious” is 0.4, what is the count of
occurrence of the bigram?

a. 123
b. 300
c. 273
d. 120

Correct Answer: d
Solution:

PMLE(curious | stay) = C(stay, curious) / C(stay)

0.4 = C(stay, curious) / 300
C(stay, curious) = 0.4 x 300 = 120
_________________________________________________________________________

QUESTION 3:

Which of the following are governing principles for Probabilistic Language Models?
a. Chain Rule of Probability
b. Markov Assumption
c. Fourier Transform
d. Gradient Descent

Correct Answer: a,b

Solution: Probabilistic Language Models exploit the Chain Rule of Probability and
Markov Assumption to build a probability distribution over sequences of
words.

_________________________________________________________________________

For Question 4 to 5, consider the following corpus:

<s> the sunset is nice </s>

<s> people watch the sunset </s>
<s> they enjoy the beautiful sunset </s>

QUESTION 4:
Assuming a bi-gram language model, calculate the probability of the sentence:
<s> people watch the beautiful sunset </s>

Ignore the unigram probability of P(<s>) in your calculation.

a. 2/27
b. 1/27
c. 2/9
d. 1/6

Correct Answer: a
Solution:

Ignoring the leading unigram probability of P(<s>), we have:

The conditional probability P(y | x) is calculated according its MLE as:

P(y | x) = Count(x, y) / Count(x)

P(people | <s>) = 1/3

P(watch | people) = 1/1
P(the ∣ watch) = 1/1
P(beautiful ∣ the) = 1/3
P(sunset ∣ beautiful) = 1/1
P(</s> ∣ sunset) = 2/3

Thus, P(<s> people watch the beautiful sunset </s>) = ⅓ x 1 x 1 x ⅓ x 1 x ⅔ = 2/27

QUESTION 5:
Assuming a bi-gram language model, calculate the perplexity of the sentence:
<s> people watch the beautiful sunset </s>
Please do not consider <s> and </s> as words of the sentence.

a. 271/4
b. 271/5
c. 91/6
!
!" "
d. * ! +

Correct Answer: d
Solution:

As calculated in the previous question,

!
P(<s> people watch the beautiful sunset </s>) =
!"
Ignoring <s> and </s>, total number of words in the sentence = 5
!
!" "
Thus, Perplexity = * ! +

_________________________________________________________________________

QUESTION 6:

What is the main intuition behind Kneser-Ney smoothing?

a. Assign higher probability to frequent words.
b. Use continuation probability to better model words appearing in a novel context.
c. Normalize probabilities by word length.
d. Minimize perplexity for unseen words.

Correct Answer: b
Solution: Please refer to lecture slides.
_________________________________________________________________________

QUESTION 7:

In perplexity-based evaluation of a language model, what does a lower perplexity score

indicate?
a. Worse model performance
b. Better language model performance
c. Increased vocabulary size
d. More sparse data

Correct Answer: b
Solution: Please refer to lecture slides.
_________________________________________________________________________
QUESTION 8:

Which of the following is a limitation of statistical language models like n-grams?

a. Fixed context size
b. High memory requirements for large vocabularies
c. Difficulty in generalizing to unseen data
d. All of the above

Correct Answer: d
Solution: N-gram models suffer from fixed context size, data sparsity, high memory usage,
and inability to generalize well to unseen data.
_________________________________________________________________________

Introduction To Large Language Models (LLMS) - Unit 7 - Week 5
No ratings yet
Introduction To Large Language Models (LLMS) - Unit 7 - Week 5
4 pages
LLM 1-11
No ratings yet
LLM 1-11
51 pages
Week 4: Word2Vec Assignment Solutions
No ratings yet
Week 4: Word2Vec Assignment Solutions
3 pages
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
8 pages
Assignment 1 Solution
No ratings yet
Assignment 1 Solution
4 pages
Assignment 5
No ratings yet
Assignment 5
3 pages
IML-IITKGP - Assignment 5 Solution
No ratings yet
IML-IITKGP - Assignment 5 Solution
7 pages
ML Assignment 2 2019 Nptel
No ratings yet
ML Assignment 2 2019 Nptel
34 pages
ML Assignment 6
No ratings yet
ML Assignment 6
5 pages
Assignment 9
No ratings yet
Assignment 9
4 pages
Machine Learning Quiz Solutions
No ratings yet
Machine Learning Quiz Solutions
3 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Assignment 10
100% (1)
Assignment 10
3 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
8 pages
Assignment 10 2024
No ratings yet
Assignment 10 2024
5 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 5 - Week 2
No ratings yet
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 5 - Week 2
6 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 7 - Week 4 - Models
No ratings yet
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 7 - Week 4 - Models
4 pages
SN Week7 PDF
No ratings yet
SN Week7 PDF
5 pages
Understanding Machine Learning Solution Manual: 2 Gentle Start
No ratings yet
Understanding Machine Learning Solution Manual: 2 Gentle Start
67 pages
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Deep Learning Week 9 Quiz
No ratings yet
Deep Learning Week 9 Quiz
4 pages
Deep Learning - IIT Ropar - Unit 10 - Week 7
100% (1)
Deep Learning - IIT Ropar - Unit 10 - Week 7
4 pages
Machine Learning Quiz for Students
No ratings yet
Machine Learning Quiz for Students
45 pages
Deep Learning Midterm Exam
No ratings yet
Deep Learning Midterm Exam
2 pages
Machine Learning Clustering Quiz
No ratings yet
Machine Learning Clustering Quiz
8 pages
CS230 Midterm Solutions Fall 2022
No ratings yet
CS230 Midterm Solutions Fall 2022
20 pages
Assignment 6 2024
No ratings yet
Assignment 6 2024
11 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (2)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Deep Learning with RBMs and DBNs
No ratings yet
Deep Learning with RBMs and DBNs
79 pages
IAT-1 Workbook P3-Python
No ratings yet
IAT-1 Workbook P3-Python
16 pages
Introduction To Machine Learning - IITKGP - Unit 4 - Week 2
No ratings yet
Introduction To Machine Learning - IITKGP - Unit 4 - Week 2
5 pages
Deep Learning - IIT Ropar - Unit 12 - Week 9
No ratings yet
Deep Learning - IIT Ropar - Unit 12 - Week 9
4 pages
Week 5: Logistic Regression & SVM Quiz
100% (1)
Week 5: Logistic Regression & SVM Quiz
4 pages
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Assignment 7
100% (1)
Assignment 7
3 pages
Assignment 7 2024
No ratings yet
Assignment 7 2024
6 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
Introduction To Machine Learning - Unit 3 - Week 1 - Non - Graded
100% (1)
Introduction To Machine Learning - Unit 3 - Week 1 - Non - Graded
3 pages
IIT Madras Notes Machine Learning
No ratings yet
IIT Madras Notes Machine Learning
13 pages
Reinforcement Learning - Unit 14 - Week 11
No ratings yet
Reinforcement Learning - Unit 14 - Week 11
3 pages
Deep Learning - Week 7
No ratings yet
Deep Learning - Week 7
4 pages
Assignment 6 (COPY)
No ratings yet
Assignment 6 (COPY)
6 pages
Deep Learning - IIT Ropar - Unit 6 - Week 3
No ratings yet
Deep Learning - IIT Ropar - Unit 6 - Week 3
4 pages
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
4 pages
Assignment 2
No ratings yet
Assignment 2
7 pages
Week 6
No ratings yet
Week 6
3 pages
Social Networks - NPTEL JANUARY 2022: Assignment 1
No ratings yet
Social Networks - NPTEL JANUARY 2022: Assignment 1
4 pages
Machine Learning Question Bank-Unit 3
No ratings yet
Machine Learning Question Bank-Unit 3
6 pages
SN Week12 PDF
No ratings yet
SN Week12 PDF
3 pages
Deep Learning - Week 11
No ratings yet
Deep Learning - Week 11
4 pages
Deep Learning Assignment3 Solution
No ratings yet
Deep Learning Assignment3 Solution
9 pages
Assignment 5 (COPY)
No ratings yet
Assignment 5 (COPY)
5 pages
Assignment-2 Nptel
No ratings yet
Assignment-2 Nptel
2 pages
Artificial Intelligence - Search Methods For Problem Solving - Unit 11 - Week 7
No ratings yet
Artificial Intelligence - Search Methods For Problem Solving - Unit 11 - Week 7
4 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Machine Learning Assignment Guide
No ratings yet
Machine Learning Assignment Guide
8 pages
Week 3
No ratings yet
Week 3
3 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Natural Language Processing - Unit 10 - Week 8
No ratings yet
Natural Language Processing - Unit 10 - Week 8
6 pages
NLP MCQ Assignment on Language Models
No ratings yet
NLP MCQ Assignment on Language Models
6 pages
Assignment 6 Solution
No ratings yet
Assignment 6 Solution
3 pages
Assignment 7 Solution
No ratings yet
Assignment 7 Solution
3 pages
Assignment 9 Solution
No ratings yet
Assignment 9 Solution
7 pages
Assignment 11 Solution
No ratings yet
Assignment 11 Solution
7 pages
Assignment 12 Solution
No ratings yet
Assignment 12 Solution
6 pages
Assignment 4 Solution
No ratings yet
Assignment 4 Solution
3 pages
Assignment 10 Solution
No ratings yet
Assignment 10 Solution
6 pages
Assignment 8 Solution
No ratings yet
Assignment 8 Solution
7 pages
Assignment 5 Solution
No ratings yet
Assignment 5 Solution
4 pages
Assignment 3 Solution
No ratings yet
Assignment 3 Solution
3 pages
b2 Iesol Practice Paper 1 Exam Paper
No ratings yet
b2 Iesol Practice Paper 1 Exam Paper
17 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
1 page
Life of Pi
No ratings yet
Life of Pi
5 pages
Tau Gamma Phi: Triskelion Grand Fraternity
No ratings yet
Tau Gamma Phi: Triskelion Grand Fraternity
2 pages
Final Report - Format
No ratings yet
Final Report - Format
6 pages
1.2.5 Practice - Completing Tax Forms (Practice)
No ratings yet
1.2.5 Practice - Completing Tax Forms (Practice)
4 pages
A Checklist For Bipa Module A1-C2
No ratings yet
A Checklist For Bipa Module A1-C2
3 pages
Simplifying Radicals Lesson Plan
No ratings yet
Simplifying Radicals Lesson Plan
3 pages
Module 1 Language and Communication
100% (1)
Module 1 Language and Communication
4 pages
Long Quiz
No ratings yet
Long Quiz
3 pages
Health Delegate Yemen
No ratings yet
Health Delegate Yemen
5 pages
Language Use and Style As A Depiction of African Literature An Example of Niyi Osundare's The State Visit
No ratings yet
Language Use and Style As A Depiction of African Literature An Example of Niyi Osundare's The State Visit
12 pages
Icici Deposit Slip For Ielts Test Dates After 01 April 2017 3 PDF
No ratings yet
Icici Deposit Slip For Ielts Test Dates After 01 April 2017 3 PDF
1 page
Q8 IM03 Final
100% (1)
Q8 IM03 Final
42 pages
Matias Anghileri
100% (1)
Matias Anghileri
32 pages
B.Tech CSE Entrepreneurship Guide
No ratings yet
B.Tech CSE Entrepreneurship Guide
14 pages
Ashley Dunns Resume
No ratings yet
Ashley Dunns Resume
4 pages
Poets&Quants - A Sports Business Master's in Silicon Valley - Santa Clara Leavey Says Game On
No ratings yet
Poets&Quants - A Sports Business Master's in Silicon Valley - Santa Clara Leavey Says Game On
6 pages
Pedagogical Instruction For Beginning Male Singers Addressed Thro
No ratings yet
Pedagogical Instruction For Beginning Male Singers Addressed Thro
126 pages
Epic ASAP ED Workflow Guide
No ratings yet
Epic ASAP ED Workflow Guide
5 pages
Estimation Techniques: Prof. Arthur Goldberg NYU
No ratings yet
Estimation Techniques: Prof. Arthur Goldberg NYU
18 pages
De Waal 1997 Are We in Anthropodenial
No ratings yet
De Waal 1997 Are We in Anthropodenial
5 pages
Mother To Son, Literary Devices
No ratings yet
Mother To Son, Literary Devices
3 pages
Future Review Business Version British English Teacher Ver2
No ratings yet
Future Review Business Version British English Teacher Ver2
5 pages
Electrical Installation PDF
No ratings yet
Electrical Installation PDF
1 page
Sullivan Interpersonal Theory: Respond Efficiently To Different Behavior
No ratings yet
Sullivan Interpersonal Theory: Respond Efficiently To Different Behavior
35 pages
Introduction To Socio Cultural and Anthropological Concepts
No ratings yet
Introduction To Socio Cultural and Anthropological Concepts
17 pages
Important Instructions: FC (Sc./Arts) /Ph.D./2024/323 Date: 2 April, 2025
No ratings yet
Important Instructions: FC (Sc./Arts) /Ph.D./2024/323 Date: 2 April, 2025
1 page
Semantic 70-73 Winong
0% (1)
Semantic 70-73 Winong
6 pages
Detecting Low-Rate DoS/DDoS with Deep Learning
No ratings yet
Detecting Low-Rate DoS/DDoS with Deep Learning
7 pages

Assignment 2 Solution

Uploaded by

Assignment 2 Solution

Uploaded by

Introduction to Large Language Models

Number of questions: 8 Total mark: 6 X 1 + 2 X 2 = 10

PMLE(curious | stay) = C(stay, curious) / C(stay)

Correct Answer: a,b

For Question 4 to 5, consider the following corpus:

<s> the sunset is nice </s>

Ignore the unigram probability of P(<s>) in your calculation.

Ignoring the leading unigram probability of P(<s>), we have:

The conditional probability P(y | x) is calculated according its MLE as:

P(people | <s>) = 1/3

Thus, P(<s> people watch the beautiful sunset </s>) = ⅓ x 1 x 1 x ⅓ x 1 x ⅔ = 2/27

As calculated in the previous question,

What is the main intuition behind Kneser-Ney smoothing?

In perplexity-based evaluation of a language model, what does a lower perplexity score

Which of the following is a limitation of statistical language models like n-grams?

You might also like