0% found this document useful (0 votes)

18 views3 pages

Reinforcement Learning - Unit 7 - Week 4

The document outlines the details of an assignment for a Reinforcement Learning course offered through NPTEL, including submission deadlines and various questions related to Markov Decision Processes (MDPs). It includes multiple-choice questions on concepts such as state transition graphs, optimal policies, and the properties of finite MDPs. Additionally, it provides information on the course structure and links for registration and certification exam details.

Uploaded by

gamingcoding87

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views3 pages

Reinforcement Learning - Unit 7 - Week 4

Uploaded by

gamingcoding87

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

X

karthikrajmr.aids2023@citchennai.net 

(https://swayam.gov.in)

(https://swayam.gov.in/nc_details/NPTEL)

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Reinforcement Learning (course)


Click to register
for Certification
exam
Week 4: Assignment 4
(https://examform.nptel.ac.in/2025_10/exam_form/dashboard)
Your last recorded submission was on 2025-08-19, 11:44 IST Due date: 2025-08-20, 23:59 IST.

If already 1) State True/False 1 point

registered, click The state transition graph for any MDP is a directed acyclic graph.
to check your
payment status True
False

2) Consider the following statements: 1 point

Course outline (i) The optimal policy of an MDP is unique.
(ii) We can determine an optimal policy for a MDP using only the optimal value function(v∗ ), without
About NPTEL accessing the MDP parameters.
() (iii) We can determine an optimal policy for a given MDP using only the optimal q-value function(q ∗ ),
without accessing the MDP parameters.
How does an
NPTEL online Which of these statements are false?
course work?
() Only (ii)
Only (iii)
Week 0 ()
Only (i), (ii)
Only (i), (iii)
Week 1 ()
Only (ii), (iii)
Week 2 ()
3) Which of the following statements are true for a finite MDP? (Select all that apply). 1 point
Week 3 ()
The Bellman equation of a value function of a finite MDP defines a contraction in Banach space
(using the max norm).
Week 4 ()

MDP Modelling If 0 ≤ γ < 1 , then the eigenvalues of γPπ are less than 1 .
(unit? We call a normed vector space ’complete’ if Cauchy sequences exist in that vector space.
unit=42&lesson=
43) The sequence defined by vn = rπ + γP π vn−1 is a Cauchy sequence in Banach space (using the
Bellman
max norm).
Equation (unit? (Pπ is a stochastic matrix)
unit=42&lesson= 4) Which of the following is a benefit of using RL algorithms for solving MDPs? 1 point
44)
They do not require the state of the agent for solving a MDP.
Bellman
Optimality They do not require the action taken by the agent for solving a MDP.
Equation (unit? They do not require the state transition probability matrix for solving a MDP.
unit=42&lesson=
They do not require the reward signal for solving a MDP.
45)

Cauchy 5) Consider the following equations: 1 point

Sequence and
Green's (i) vπ (s) ∞ i−t
= Eπ [∑ γ Ri+1 |St = s]
i=t
Equation (unit?
(ii) q π
(s, a) = ∑ ′
′
p(s |s, a)v
π ′
(s )
unit=42&lesson= s

(iii) vπ (s) = ∑ π(a|s)q

π
(s, a)
46) a

Banach Fixed Which of the above are correct?

Point Theorem
(unit? Only (i)
unit=42&lesson=
Only (i), (ii)
47)
Only (ii), (iii)
Convergence
Only (i), (iii)
Proof (unit?
unit=42&lesson= (i), (ii), (iii)
48)
6) What is true about the γ (discount factor) in reinforcement learning? 1 point
Week 4
Feedback Form :
Discount factor can be any real number
Reinforcement
Learning (unit?
The value of γ cannot affect the optimal policy
unit=42&lesson=
237) The lower the value of gamma, the more myopic the agent gets, i.e the agent maximises rewards
that it receives over a shorter horizon
Practice: Week 4
: Assignment
7) Consider the following statements for a finite MDP (I is an identity matrix with dimensions 1 point
4(Non Graded)
(assessment? |S| × |S|(S is the set of all states) and Pπ is a stochastic matrix):
name=288) (i) MDP with stochastic rewards may not have a deterministic optimal policy.
(ii) There can be multiple optimal stochastic policies.
Quiz: Week 4:
(iii) If 0 ≤ γ < 1 , then rank of the matrix I − γP π is equal to |S| .
Assignment 4
(iv) If 0 ≤ γ < 1 , then rank of the matrix I − γP π is less than |S| .
(assessment?
name=289)
Which of the above statements are true?

Week 5 () Only (ii), (iii)

Only (ii), (iv)
DOWNLOAD Only (i), (iii)
VIDEOS () Only (i), (ii), (iii)

NPTEL
8) Consider an MDP with 3 states A, B, C . At each state we can go to either of the two states. 1 point
Resources ()
i.e if we are in state A then we can perform 2 actions, going to state B or C . The rewards for each
transactions are r(A, B) = −3 (reward if we go from A to B), r(B, A) ,
= −1 r(B, C ) = 8 ,
r(C , B) = 4 , r(A, C ) = 0 , r(C , A) = 5 , discount factor is 0.9. Find the fixed point of the value
function for the policy π(A) = B (if we are in state A we choose the action to go to B)
π(B) = C , π(C ) = A. v
π
([ABC ]) =? (round to 1 decimal place)

[20.6, 21.8, 17.6]

[30.4, 44.2, 32.4]
[30.4, 37.2, 32.4]
[21.6, 21.8, 17.6]
9) Which of the following is not a valid norm function? (x is a D dimensional vector) 1 point

maxd∈{1,…,D} |x d |

−−−−−−
D 2
√Σ x
d=1 d

mind∈{1,…,D} |x d |

D
Σ |x d |
d=1

10) For an operator L, which of the following properties must be satisfied by x for it to be a fixed 1 point
point for L?(Multi-Correct)

Lx = x

2
L x = x

∀λ > 0Lx = λx

None of the above

You may submit any number of times before the due date. The final submission will be considered for
grading.
Submit Answers

Reinforcement Learning - Unit 7 - Week 4
No ratings yet
Reinforcement Learning - Unit 7 - Week 4
2 pages
Reinforcement Learning - Unit 6 - Week 4
0% (1)
Reinforcement Learning - Unit 6 - Week 4
3 pages
RL-solution 4
No ratings yet
RL-solution 4
4 pages
Assignment 4: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 4: Reinforcement Learning Prof. B. Ravindran
4 pages
Introduction To Machine Learning - Unit 15 - Week 12
No ratings yet
Introduction To Machine Learning - Unit 15 - Week 12
3 pages
AI 3000 / CS5500: Reinforcement Learning Exam 1: Instructions
0% (1)
AI 3000 / CS5500: Reinforcement Learning Exam 1: Instructions
4 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
101 pages
Tut21 RL
No ratings yet
Tut21 RL
101 pages
402 Lec20
No ratings yet
402 Lec20
21 pages
Reinforcement Learning Exam
No ratings yet
Reinforcement Learning Exam
6 pages
DRL Homework 1
No ratings yet
DRL Homework 1
4 pages
RL 2021 22 Exam I
No ratings yet
RL 2021 22 Exam I
4 pages
Lecture13 Postclass
No ratings yet
Lecture13 Postclass
36 pages
RL - Exam2023 Solved
No ratings yet
RL - Exam2023 Solved
6 pages
Reinforcement Learning - Week 12
No ratings yet
Reinforcement Learning - Week 12
3 pages
Quiz2 Sol
No ratings yet
Quiz2 Sol
4 pages
Non-Graded: Assignment 1: (Https://swayam - Gov.in)
No ratings yet
Non-Graded: Assignment 1: (Https://swayam - Gov.in)
37 pages
RL Practice Midterm
No ratings yet
RL Practice Midterm
4 pages
Reinforcement Learning - Unit 6 - Week 3
No ratings yet
Reinforcement Learning - Unit 6 - Week 3
4 pages
RL Problem Sheet: E0 270: Machine Learning (Spring 2025)
No ratings yet
RL Problem Sheet: E0 270: Machine Learning (Spring 2025)
10 pages
RL Class Notes
No ratings yet
RL Class Notes
68 pages
Value Function Approximation SEO Guide
No ratings yet
Value Function Approximation SEO Guide
59 pages
Introduction To Machine Learning - Unit 15 - Week 12
No ratings yet
Introduction To Machine Learning - Unit 15 - Week 12
3 pages
Reinforcement Learning Assignment
No ratings yet
Reinforcement Learning Assignment
4 pages
Lecture 4 - ModelFreePrediction
No ratings yet
Lecture 4 - ModelFreePrediction
48 pages
RL Solution3
No ratings yet
RL Solution3
4 pages
16 RL
No ratings yet
16 RL
51 pages
Lecture Notes v1.0 687 F22
No ratings yet
Lecture Notes v1.0 687 F22
115 pages
20CM1111
No ratings yet
20CM1111
3 pages
2023 Week4 Funcapproximate Update
No ratings yet
2023 Week4 Funcapproximate Update
69 pages
2772 Blending MPC Value Function AP
No ratings yet
2772 Blending MPC Value Function AP
16 pages
Reinforcement Learning in A Nutshell
No ratings yet
Reinforcement Learning in A Nutshell
12 pages
RL Theory Tutorial
No ratings yet
RL Theory Tutorial
80 pages
An Overview of Machine Learning
No ratings yet
An Overview of Machine Learning
42 pages
L13 Reinforcement Learning
No ratings yet
L13 Reinforcement Learning
57 pages
Reinforcement Learning - Unit 12 - Week 9
No ratings yet
Reinforcement Learning - Unit 12 - Week 9
3 pages
Reinforcement Learning I:: The Setting and Classical Stochastic Dynamic Programming Algorithms
No ratings yet
Reinforcement Learning I:: The Setting and Classical Stochastic Dynamic Programming Algorithms
42 pages
Cs748 s2021 Quizzes Till q4
No ratings yet
Cs748 s2021 Quizzes Till q4
4 pages
ECE586BH Lecture1
No ratings yet
ECE586BH Lecture1
36 pages
Reinforcement Learning: Csci 5512: Artificial Intelligence Ii
No ratings yet
Reinforcement Learning: Csci 5512: Artificial Intelligence Ii
30 pages
l1 Mdps Exact Methods
No ratings yet
l1 Mdps Exact Methods
69 pages
Problem 1: Markov Reward Process
No ratings yet
Problem 1: Markov Reward Process
3 pages
Lecture 12 Slides - After
No ratings yet
Lecture 12 Slides - After
50 pages
AI Decision Making & RL Guide
No ratings yet
AI Decision Making & RL Guide
18 pages
RL Exam Tutti
No ratings yet
RL Exam Tutti
47 pages
20ai903 - RL - Unit 4
No ratings yet
20ai903 - RL - Unit 4
49 pages
RL 10 QUESTIONS FOR MID II Scheme of Evaluvation
No ratings yet
RL 10 QUESTIONS FOR MID II Scheme of Evaluvation
15 pages
Hansen 2022
No ratings yet
Hansen 2022
20 pages
A12 Spring2024
No ratings yet
A12 Spring2024
5 pages
15-381 Spring 2007 Final Exam SOLUTIONS
No ratings yet
15-381 Spring 2007 Final Exam SOLUTIONS
18 pages
Reinforcement Learning Basics
No ratings yet
Reinforcement Learning Basics
169 pages
Algorithm For RL
No ratings yet
Algorithm For RL
99 pages
Markov Decision & RL Overview
No ratings yet
Markov Decision & RL Overview
39 pages
5SC28 L7 Machine Learning
No ratings yet
5SC28 L7 Machine Learning
61 pages
Algorithms For Reinforced Learning
No ratings yet
Algorithms For Reinforced Learning
98 pages
2025 - MDPs 1
No ratings yet
2025 - MDPs 1
62 pages
Reinforcement Learning Quiz
No ratings yet
Reinforcement Learning Quiz
2 pages
Reinforcement Learning Algorithms
No ratings yet
Reinforcement Learning Algorithms
98 pages
Designing The Modules: This Lecture Is Based On The Chapter 6 of The Book "Software Engineering: Theory and Practice"
No ratings yet
Designing The Modules: This Lecture Is Based On The Chapter 6 of The Book "Software Engineering: Theory and Practice"
100 pages
Catalogo LanPro
No ratings yet
Catalogo LanPro
8 pages
Ventilador - Siemens Servo Screen 390 - Service Manual
0% (1)
Ventilador - Siemens Servo Screen 390 - Service Manual
49 pages
Bits For Mid1
100% (1)
Bits For Mid1
14 pages
Business Cash Control Strategies
No ratings yet
Business Cash Control Strategies
9 pages
Pythagoras-5 (Solutions)
No ratings yet
Pythagoras-5 (Solutions)
6 pages
In The 1960s and 1970s Dennis Ritchie and Ken Thompson Invented Unix
No ratings yet
In The 1960s and 1970s Dennis Ritchie and Ken Thompson Invented Unix
3 pages
1 s2.0 S0164121217303072 Main
No ratings yet
1 s2.0 S0164121217303072 Main
34 pages
4 - Tamerlane and Other Poems
No ratings yet
4 - Tamerlane and Other Poems
93 pages
Controller Design
No ratings yet
Controller Design
253 pages
Daniel K. Schneider
No ratings yet
Daniel K. Schneider
363 pages
Telecom Equipment Certification
No ratings yet
Telecom Equipment Certification
2 pages
ReleaseNote - FileList of G532LWS - 2009 - X64 - V2.01
No ratings yet
ReleaseNote - FileList of G532LWS - 2009 - X64 - V2.01
6 pages
SO003B Food Handbook 2011 - FINAL - Lowres
No ratings yet
SO003B Food Handbook 2011 - FINAL - Lowres
64 pages
DB QP
No ratings yet
DB QP
6 pages
Laser Business Plan
No ratings yet
Laser Business Plan
11 pages
K - DMS Unit 1
No ratings yet
K - DMS Unit 1
47 pages
WCS Wireless Communication by T L Singal - PDF
No ratings yet
WCS Wireless Communication by T L Singal - PDF
26 pages
JAKA Zu 5 Cobot - APP
No ratings yet
JAKA Zu 5 Cobot - APP
2 pages
Sap MM Sto p2p - Part 2
No ratings yet
Sap MM Sto p2p - Part 2
16 pages
RDX QuikStation 4 Quick Start Guide
No ratings yet
RDX QuikStation 4 Quick Start Guide
2 pages
Advanced Visualization
No ratings yet
Advanced Visualization
48 pages
Entry-Level Web Developer Profile
No ratings yet
Entry-Level Web Developer Profile
2 pages
Kliping Bhs. Inggris
No ratings yet
Kliping Bhs. Inggris
26 pages
LG K8 (2017) - Schematic Diagarm PDF
No ratings yet
LG K8 (2017) - Schematic Diagarm PDF
141 pages
PHXT 25 QUOT0563 26 May 2025 Sobha Furniture Industries LLC Ci62 + IQC May 2025
No ratings yet
PHXT 25 QUOT0563 26 May 2025 Sobha Furniture Industries LLC Ci62 + IQC May 2025
2 pages
Automatic Floor Cleaning Robot: Mariappan. S Thanga Dhinesh S Esakki Durai M Bala Sathya V
No ratings yet
Automatic Floor Cleaning Robot: Mariappan. S Thanga Dhinesh S Esakki Durai M Bala Sathya V
20 pages
Advanced Driver Assistance Systems
No ratings yet
Advanced Driver Assistance Systems
11 pages
REview Form
No ratings yet
REview Form
5 pages
NoteGPT AI PPT Maker 1728839592167
No ratings yet
NoteGPT AI PPT Maker 1728839592167
10 pages

Reinforcement Learning - Unit 7 - Week 4

Uploaded by

Reinforcement Learning - Unit 7 - Week 4

Uploaded by

X

NPTEL (https://swayam.gov.in/explorer?ncCode=NPTEL) » Reinforcement Learning (course)

If already 1) State True/False 1 point

2) Consider the following statements: 1 point

Cauchy 5) Consider the following equations: 1 point

(iii) vπ (s) = ∑ π(a|s)q

Banach Fixed Which of the above are correct?

Week 5 () Only (ii), (iii)

[20.6, 21.8, 17.6]

None of the above

You might also like