Week12

The document contains a series of questions and solutions related to machine learning concepts, including VC dimension, empirical risk minimization, PAC learning, bias-variance tradeoff, and reinforcement learning. It discusses the performance of linear classifiers, various learning algorithms, and the design of reward schemes for teaching robots. Additionally, it presents a simplistic game scenario to illustrate how a reinforcement learning agent can learn optimal behavior through value iteration.

Uploaded by

diyadivya528

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views3 pages

Week12

Uploaded by

diyadivya528

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Introduction to Machine Learning

Week 12
Prof. B. Ravindran, IIT Madras

1. (1 Mark) What is the VC dimension of the class of linear classifiers in 2D space?

(a) 2
(b) 3
(c) 4
(d) None of the above
Soln. B - Any 3 points can be classified using a linear decision boundary
2. (1 Mark) Which of the following learning algorithms does NOT typically perform empirical
risk minimization?
(a) Linear regression
(b) Logistic regression
(c) Decision trees
(d) Support Vector Machines
Soln. D - Refer to the lectures
3. (2 Marks) Statement 1: As the size of the hypothesis class increases, the sample complexity
for PAC learning always increases.
Statement 2: A larger hypothesis class has a higher VC dimension.
Choose the correct option:
(a) Statement 1 is true. Statement 2 is true. Statement 2 is the correct reason for statement
1
(b) Statement 1 is true. Statement 2 is true. Statement 2 is not the correct reason for
statement 1
(c) Statement 1 is true. Statement 2 is false
(d) Both statements are false
Soln. B - Refer to the lectures
4. (1 Mark) When a model’s hypothesis class is too small, how does this affect the model’s
performance in terms of bias and variance?
(a) High bias, low variance
(b) Low bias, high variance
(c) High bias, high variance
(d) Low bias, low variance
Soln. A - Refer to the lectures

1
5. (1 Mark) Imagine you’re designing a robot that needs to navigate through a maze to reach a
target. Which reward scheme would be most effective in teaching the robot to find the shortest
path?
(a) +5 for reaching the target, -1 for hitting a wall
(b) +5 for reaching the target, -0.1 for every second that passes before the robot reaches the
target.
(c) +5 for reaching the target, -0.1 for every second that passes before the robot reaches the
target, +1 for hitting a wall.
(d) -5 for reaching the target, +0.1 for every second that passes before the robot reaches the
target.
Soln. B - The +5 reward for reaching the target encourages goal achievement, while the -0.1
penalty for each second promotes finding the shortest path. Omitting rewards for hitting walls
as question has nothing in this regard.

For the rest of the questions, we will follow a simplistic game and see how a Reinforcement
Learning agent can learn to behave optimally in it.
This is our game:

At the start of the game, the agent is on the Start state and can choose to move left or right
at each turn. If it reaches the right end(RE), it wins and if it reaches the left end(LE), it loses.

Because we love maths so much, instead of saying the agent wins or loses, we will say that the
agent gets a reward of +1 at RE and a reward of -1 at LE. Then the objective of the agent is
simply to maximum the reward it obtains!
6. (1 Mark) For each state, we define a variable that will store its value. The value of the state
will help the agent determine how to behave later. First we will learn this value.

Let V be the mapping from state to its value.

Initially,
V(LE) = -1
V(X1) = V(X2) = V(X3) = V(X4) = V(Start) = 0
V(RE) = +1
For each state S ∈ {X1, X2, X3, X4, Start}, with SL being the state to its immediate left and
SR being the state to its immediate right, repeat:

V (S) = 0.9 × max(V (SL ), V (SR ))

Till V converges (does not change for any state).

What is V(X4) after one application of the given formula?

2
(a) 1
(b) 0.9
(c) 0.81
(d) 0
Soln. B -
V (X4) = 0.9 × max(V (X3), V (RE))
V (S) = 0.9 × max(0, +1) = 0.9

7. (1 Mark) What is V(X1) after one application of given formula?

(a) -1
(b) -0.9
(c) -0.81
(d) 0
Soln. D -
V (X1) = 0.9 × max(V (LE), V (X2))
V (S) = 0.9 × max(−1, 0) = 0

8. (2 Marks) What is V(X1) after V converges?

(a) 0.59
(b) -0.9
(c) 0.63
(d) 0
Sol. A - This is the sequence of changes in V:
V (X4) = 0.9 → V (X3) = 0.81 → V (Start) = 0.729 → V (X2) = 0.656 → V (X1) = 0.59
Final value for X1 is 0.59.

MCQ and Descriptive Questions - Guidelines
No ratings yet
MCQ and Descriptive Questions - Guidelines
4 pages
Research Methodology and IPR
No ratings yet
Research Methodology and IPR
10 pages
Unit 5 - Graphs (Repaired)
No ratings yet
Unit 5 - Graphs (Repaired)
81 pages
Adsunit III
No ratings yet
Adsunit III
11 pages
Unit Vads
No ratings yet
Unit Vads
10 pages
Exp 10
No ratings yet
Exp 10
1 page
ML (Unit 1)
No ratings yet
ML (Unit 1)
2 pages
ADM (Unit 2)
No ratings yet
ADM (Unit 2)
35 pages
Mtech r19 Sem1 m2
No ratings yet
Mtech r19 Sem1 m2
20 pages
Unit I
No ratings yet
Unit I
6 pages
UNIT3
No ratings yet
UNIT3
6 pages
MEAN (Unit 3)
No ratings yet
MEAN (Unit 3)
1 page
ML (Unit 3)
No ratings yet
ML (Unit 3)
1 page
ADS&A
No ratings yet
ADS&A
16 pages
UNIT4
No ratings yet
UNIT4
5 pages
Unit Ii
No ratings yet
Unit Ii
21 pages
Unit Iii
No ratings yet
Unit Iii
5 pages
Unit 1
No ratings yet
Unit 1
6 pages
Unit 2
No ratings yet
Unit 2
6 pages
Kurmanji Basic Learning Manual
No ratings yet
Kurmanji Basic Learning Manual
32 pages
Thyristor Three-Phase Rectifier/Inverter Guide
100% (1)
Thyristor Three-Phase Rectifier/Inverter Guide
8 pages
Video
No ratings yet
Video
5 pages
Process Costing Weighted-Average Worksheet
No ratings yet
Process Costing Weighted-Average Worksheet
5 pages
Traffic Monitoring System
No ratings yet
Traffic Monitoring System
16 pages
Casting Processes and Defects Quiz
No ratings yet
Casting Processes and Defects Quiz
10 pages
The Implementation of API RP 1102 Code To Evaluate Gas Pipeline Road Crossing
No ratings yet
The Implementation of API RP 1102 Code To Evaluate Gas Pipeline Road Crossing
10 pages
Beestar Math Worksheet - Grade 1: Target: 5 Mins - 90%
No ratings yet
Beestar Math Worksheet - Grade 1: Target: 5 Mins - 90%
2 pages
Zayat - Wireless Infra Structure & DDF
No ratings yet
Zayat - Wireless Infra Structure & DDF
18 pages
Diagonal Web Member PDF
No ratings yet
Diagonal Web Member PDF
1 page
Model Risk Tiering
100% (2)
Model Risk Tiering
32 pages
Third Quarter Departmental Test in ICT 1
100% (1)
Third Quarter Departmental Test in ICT 1
3 pages
Oracle E-Business Tax Extensibility
No ratings yet
Oracle E-Business Tax Extensibility
5 pages
By Kevin E. Presley Training Coordinator
No ratings yet
By Kevin E. Presley Training Coordinator
35 pages
PVT Termal Panel Genel Bilgi (Photovoltaic - Thermal - PVT - Solar - Panels)
No ratings yet
PVT Termal Panel Genel Bilgi (Photovoltaic - Thermal - PVT - Solar - Panels)
4 pages
Cellular Respiration for Students
No ratings yet
Cellular Respiration for Students
15 pages
SCERT Kerala State Syllabus 9th Standard Maths Textbooks English Medium Part 1
No ratings yet
SCERT Kerala State Syllabus 9th Standard Maths Textbooks English Medium Part 1
112 pages
02 Air Conditioning Tools
No ratings yet
02 Air Conditioning Tools
23 pages
Reversible Computing
No ratings yet
Reversible Computing
2 pages
Ciit VC Date Sheet (1st Sessional April 2014)
No ratings yet
Ciit VC Date Sheet (1st Sessional April 2014)
11 pages
Applications of Optimization With Xpress
No ratings yet
Applications of Optimization With Xpress
264 pages
El Cuento Mexicano de Fin de Siglo Algun
No ratings yet
El Cuento Mexicano de Fin de Siglo Algun
9 pages
Acceptance for Road Repair Contract
No ratings yet
Acceptance for Road Repair Contract
1 page
Importing Data Python Cheat Sheet PDF
No ratings yet
Importing Data Python Cheat Sheet PDF
1 page
Indian Journal Subscription Details
No ratings yet
Indian Journal Subscription Details
16 pages
Practice Test 01 - Test Paper - Lakshya NEET 2.0 2025 Attempted
No ratings yet
Practice Test 01 - Test Paper - Lakshya NEET 2.0 2025 Attempted
20 pages
Graph Theory Applications in Science & CS
No ratings yet
Graph Theory Applications in Science & CS
4 pages
Nuclear Physics Foundations
No ratings yet
Nuclear Physics Foundations
21 pages
Eaton Metal Seals
No ratings yet
Eaton Metal Seals
60 pages
0625 w15 Ms 61
No ratings yet
0625 w15 Ms 61
5 pages

Week12

Uploaded by

Week12

Uploaded by

Introduction to Machine Learning

1. (1 Mark) What is the VC dimension of the class of linear classifiers in 2D space?

Let V be the mapping from state to its value.

V (S) = 0.9 × max(V (SL ), V (SR ))

What is V(X4) after one application of the given formula?

7. (1 Mark) What is V(X1) after one application of given formula?

8. (2 Marks) What is V(X1) after V converges?

You might also like