0% found this document useful (0 votes)

31 views50 pages

Lecture 12 Slides - After

Reinforcement learning is a type of machine learning where an agent learns to map situations to actions to maximize a reward signal. It involves concepts such as Markov decision processes, policies, and various approaches like policy gradient methods. The field has applications in robotics, autonomous driving, and power grid control, but also faces challenges like sample inefficiency and lack of safety guarantees.

Uploaded by

baptiste.ferrer10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views50 pages

Lecture 12 Slides - After

Uploaded by

baptiste.ferrer10

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

Introduction to

Reinforcement Learning
What is reinforcement learning?
▪ Sutton and Barto, 1998: “Reinforcement learning is learning what to do - how to map situations to
actions - so as to maximize a numerical reward signal”.
What is reinforcement learning?
▪ Sutton and Barto, 1998: “Reinforcement learning is learning what to do - how to map situations to
actions - so as to maximize a numerical reward signal”.

▪ ChatGPT, 2022: “Reinforcement learning is a type of machine learning in which an agent learns to
interact with its environment in order to maximize a reward signal”.
What is reinforcement learning?
▪ Sutton and Barto, 1998: “Reinforcement learning is learning what to do - how to map situations to
actions - so as to maximize a numerical reward signal”.

▪ ChatGPT, 2022: “Reinforcement learning is a type of machine learning in which an agent learns to
interact with its environment in order to maximize a reward signal”.

Agent Environment
- State st
- Take action at

- Next state st+1

- Get reward r(st, at)
Example: Multi-agent game

2019: Learning to play hide and seek via

multi-agent reinforcement learning [1].
Recent advances
2013
Atari 2016
Deep Q-learning for
Atari games [2]. Energy saving 2017
DeepMind AI AlphaGo/ 2018
reduces
AlphaZero OpenAI Five 2019
Google data
centre cooling AI achieving grand Training five Alpha Star
bill by 40% [3]. master level in artificial 2022
chess, go, and intelligence AI achieving grand
shogi [4,5]. agents to play master level in AlphaTensor
the Dota 2 [6]. StarCraft II game [7].
Discovering faster
matrix multiplication
Rubik’s Cube
algorithms [9].
Solving Rubik's Cube
with a human-like robot ChatGPT
hand [8].
A language model
trained to generate
human-like responses
to text input [10].
(Potential) real world applications
▪ Robotics

Teaching a robot how to walk in the wild [11].

(Potential) real world applications
▪ Robotics

▪ Autonomous driving
Action at

Next state st+1

and reward r(st, at)
(Potential) real world applications
▪ Robotics

▪ Autonomous driving

▪ Control of power grids

Control of power grids [12].

An interdisciplinary field

Many facets of reinforcement learning [13].

Markov Decision processes

A Markov decision process is given by a tuple ℳ = (𝒮, 𝒜, P, ρ) where…

▪ 𝒮 is the set of all possible states.
▪ 𝒜 is the set of all possible actions.
▪ P is the transition law with P(s′| s, a) = Pr(st+1 = s′| st = s, at = a).
▪ ρ is the initial state distribution with ρ(s) = Pr(s0 = s).

Markov property
Pr(st+1 | st, st−1, . . . , s0, at, at−1, . . . . , a0) = Pr(st+1 | st, at) → stochastic dynamical system!
Example: Deterministic MDP
Time evolution
▪ Start in s0 ∼ ρ.
▪ At each time t:
• Take action at ∈ 𝒜.
• End up in state st+1 ∼ P( ⋅ | st, at).
Example: Stochastic MDP (Gridworld)
Time evolution
▪ Start in s0 ∼ ρ.
▪ At each time t:
• Take action at ∈ 𝒜.
• End up in state st+1 ∼ P( ⋅ | st, at).
Policy
Decision rule
▪ In state s ∈ 𝒮, we take action a ∈ 𝒜 with
probability π(a | s).
▪ π( ⋅ | s) is a probability distribution over 𝒜.
Objective

Reward and discount factor

▪ Reward function r : 𝒮 × 𝒜 → ℝ.
▪ Discount rate γ ∈ (0,1).

Objective function
The goal is to find an optimal policy π* maximizing

[∑ ]
J(π) := 𝔼 γ tr(st, at) | s0 ∼ ρ, π
t=0
Example: Stochastic MDP (Gridworld)
Time evolution
▪ Start in s0 ∼ ρ.
▪ At each time t:
• Take action at ∼ π( ⋅ | st).
• Get reward r(st, at).
• End up in state st+1 ∼ P( ⋅ | st, at).
Reinforcement learning vs. optimal control
Stochastic optimal control
∞

[∑ ]
For known P: dynamic programming. Still
max 𝔼 γ tr(st, at) | s0 ∼ ρ, π →
π very hard for large 𝒮 and 𝒜.
t=0

Reinforcement learning
▪ In RL we can only sample from the MDP (in simulation or real world), but don’t know P.
▪ We need to explore the environment.
Many different approaches

Taxonomy of reinforcement learning approaches [14].

Policy gradient method

[∑ ]
max J(πθ) := 𝔼 γ tr(st, at) | s0 ∼ ρ, at ∼ πθ( ⋅ | st)
θ
t=0
Policy gradient method

[∑ ]
max J(πθ) := 𝔼 γ tr(st, at) | s0 ∼ ρ, at ∼ πθ( ⋅ | st)
θ
t=0

▪ Policy Optimization: Parameterize the policy as πθ(a | s) and then find the best policy.
Policy gradient method

[∑ ]
max J(πθ) := 𝔼 γ tr(st, at) | s0 ∼ ρ, at ∼ πθ( ⋅ | st)
θ
t=0

▪ Policy Optimization: Parameterize the policy as πθ(a | s) and then find the best policy.

▪ Direct parameterization

∑
πθ(a | s) = θs,a, where θ ∈ ℝ|𝒮|×|𝒜|, θs,a ≥ 0 and θs,a = 1.
a∈𝒜
Policy parameterization

▪ Softmax parameterization
exp (θs,a)
πθ(a | s) = , where θ ∈ ℝ|𝒮|×|𝒜|
∑a′∈𝒜 exp (θs,a′)
Policy parameterization

▪ Softmax parameterization
exp (θs,a)
πθ(a | s) = , where θ ∈ ℝ|𝒮|×|𝒜|
∑a′∈𝒜 exp (θs,a′)

▪ Neural softmax parameterization

exp (fθ (s, a))
πθ(a | s) = , where fθ (s, a) represents a neural network.
∑a′∈𝒜 exp (fθ (s, a′))
Policy gradient method

[∑ ]
max J(πθ) := 𝔼 γ tr(st, at) | s0 ∼ ρ, at ∼ πθ( ⋅ | st)
θ
t=0

▪ Parameterize the policy as πθ(a | s).

▪ Using gradient ascent method to find best policy π*
▪ Pseudo-code for policy gradient method
Policy gradient method

[∑ ]
max J(πθ) := 𝔼 γ tr(st, at) | s0 ∼ ρ, at ∼ πθ( ⋅ | st)
θ
t=0

▪ Parameterize the policy as πθ(a | s).

▪ Using gradient ascent method to find best policy π*
▪ Pseudo-code for policy gradient method
Results for policy gradient method
▪ Can we converge to the optimal policy when K is big enough?——Non-convexity may lead to sub-
optimal policy
Results for policy gradient method
▪ Can we converge to the optimal policy when K is big enough?——Non-convexity may lead to sub-
optimal policy

▪ Convergence for direct parametrization [15]:

(1 − γ)3 c1
Let ρ(s) > 0, ∀s ∈ 𝒮 and α ≤ , we have min J(π*) − J(πt) ≤ .
• 2γ | 𝒜 | t≤K K
Results for policy gradient method
▪ Can we converge to the optimal policy when K is big enough?——Non-convexity may lead to sub-
optimal policy

▪ Convergence for direct parametrization [15]:

(1 − γ)3 c1
Let ρ(s) > 0, ∀s ∈ 𝒮 and α ≤ , we have min J(π*) − J(πt) ≤ .
• 2γ | 𝒜 | t≤K K
▪ Convergence for softmax parametrization [16]:
(1 − γ)3 c2
• Let ρ(s) > 0, ∀s ∈ 𝒮 and α ≤ 8
, we have J(π*) − J(πK ) ≤
K
.

▪ Here, c1, c2 are the constant that depend on MDP ℳ = (𝒮, 𝒜, P, ρ, r, γ).
How to compute the gradient ∇θ J(πθ)?
▪ For every random trajectory τ = (s0, a0, s1, a1, …), the probability of choosing this trajectory as
∞
pθ (τ) := ρ (s0) πθ (at | st) P (st+1 | st, at)
∏
t=0
How to compute the gradient ∇θ J(πθ)?
▪ For every random trajectory τ = (s0, a0, s1, a1, …), the probability of choosing this trajectory as
∞
pθ (τ) := ρ (s0) πθ (at | st) P (st+1 | st, at)
∏
t=0

▪ And we set reward we get from this trajectory τ as

∞
γ tr(st, at)
∑
R (τ) :=
t=0
How to compute the gradient ∇θ J(πθ)?
▪ For every random trajectory τ = (s0, a0, s1, a1, …), the probability of choosing this trajectory as
∞
pθ (τ) := ρ (s0) πθ (at | st) P (st+1 | st, at)
∏
t=0

▪ And we set reward we get from this trajectory τ as

∞
γ tr(st, at)
∑
R (τ) :=
t=0
▪ Then,
∞

[∑ ]
J(πθ) := 𝔼 γ tr(st, at) | s0 = s, π = 𝔼τ∼pθ [R(τ)]
t=0
How to compute the gradient ∇θ J(πθ)?
▪ For every random trajectory τ = (s0, a0, s1, a1, …), the probability of choosing this trajectory as
∞
pθ (τ) := ρ (s0) πθ (at | st) P (st+1 | st, at)
∏
t=0

▪ And we set reward we get from this trajectory τ as

∞
γ tr(st, at)
∑
R (τ) :=
t=0
▪ Then,
∞

[∑ ]
J(πθ) := 𝔼 γ tr(st, at) | s0 = s, π = 𝔼τ∼pθ [R(τ)]
t=0

▪ Therefore,
∇θ J(πθ) = ∇θ 𝔼τ∼pθ[R(τ)]
How to compute the gradient?
Policy gradient theorem:
∞ ∞

[( t=0 ) ( t=0 )]
γ tr(st, at) ×
∑ ∑
∇θ J(πθ) = 𝔼τ∼pθ ∇θ log πθ(at | st)

Proof: Because
∇θ J(πθ) = ∇θ 𝔼τ∼pθ[R(τ)]
How to compute the gradient?
Policy gradient theorem:
∞ ∞

[( t=0 ) ( t=0 )]
γ tr(st, at) ×
∑ ∑
∇θ J(πθ) = 𝔼τ∼pθ ∇θ log πθ(at | st)

Proof: Because
∞
γ tr(st, at)
∑
∇θ J(πθ) = 𝔼τ∼pθ[R(τ) ∇log pθ(τ)], R(τ) =
t=0
How to estimate the gradient?
Monte Carlo approximation:
▪ Consider a random variable X ∼ q.
▪ Given independent and identically distributed X1, . . . , XN ∼ q, we can estimate
1 N
N∑
𝔼[ f(X)] ≈ f(Xi) .
i=1
In Reinforcement learning: We don’t know P, but we can approximate
∞ ∞

[( t=0 ) ( t=0 )]
γ tr(st, at) ×
∑ ∑
∇θ J(πθ) = 𝔼τ∼pθ ∇θ log πθ(at | st)

1 N ∞ ∞

N i=1 ( t=0 ) ( t=0 )

t i i
∑ ∑ ∑
≈ γ r(st , at ) ∇θ log πθ(at | st)

1 N H H

N i=1 ( t=0 ) ( t=0 )

t i i
∑ ∑ ∑
≈ γ r(st , at ) ∇θ log πθ(at | st)
Stochastic policy gradient method
Demonstration of policy gradient method

Using policy gradient method to play CartPole

Pros and cons of reinforcement learning
Pros and cons of reinforcement learning
Pros
▪ General methods for complex tasks
Pros and cons of reinforcement learning
Pros
▪ General methods for complex tasks
▪ Adapt to changing environments
Pros and cons of reinforcement learning
Pros
▪ General methods for complex tasks
▪ Adapt to changing environments
▪ Model free: no need to know dynamic model
Pros and cons of reinforcement learning
Pros
▪ General methods for complex tasks
▪ Adapt to changing environments
▪ Model free: no need to know dynamic model
Pros and cons of reinforcement learning
Pros Cons
▪ General methods for complex tasks
▪ Adapt to changing environments
▪ Model free: no need to know dynamic model
Pros and cons of reinforcement learning
Pros Cons
▪ General methods for complex tasks ▪ Sample inefficiency for model free approach
▪ Adapt to changing environments
▪ Model free: no need to know dynamic model
Pros and cons of reinforcement learning
Pros Cons
▪ General methods for complex tasks ▪ Sample inefficiency for model free approach
▪ Adapt to changing environments ▪ Lack of safety and convergence guarantees
▪ Model free: no need to know dynamic model
Pros and cons of reinforcement learning
Pros Cons
▪ General methods for complex tasks ▪ Sample inefficiency for model free approach
▪ Adapt to changing environments ▪ Lack of safety and convergence guarantees
▪ Model free: no need to know dynamic model ▪ Hard to assign meaningful rewards
Pros and cons of reinforcement learning
Pros Cons
▪ General methods for complex tasks ▪ Sample inefficiency for model free approach
▪ Adapt to changing environments ▪ Lack of safety and convergence guarantees
▪ Model free: no need to know dynamic model ▪ Hard to assign meaningful rewards
Reinforcement learning projects in Sycamore lab
Bachelor projects
▪ Policy optimization for Pacman
Reinforcement learning projects in Sycamore lab
Bachelor projects
▪ Policy optimization for Pacman

Semester and master projects

▪ Safe reinforcement learning
▪ Inverse reinforcement learning
References
▪ [1] Mnih, Volodymyr, et al. "Playing atari with deep reinforcement learning." arXiv preprint arXiv:1312.5602 (2013).
▪ [2] Baker, Bowen, et al. "Emergent tool use from multi-agent autocurricula." arXiv preprint arXiv:1909.07528 (2019).
▪ [3] DeepMind AI Reduces Google Data Centre Cooling Bill by 40%. https://www.deepmind.com/blog/deepmind-ai-reduces-google-data-centre-cooling-
bill-by-40. Accessed 15 Dec. 2022.
▪ [4] Silver, David, et al. "Mastering the game of Go with deep neural networks and tree search." nature 529.7587 (2016): 484-489
▪ [5] Silver, David, et al. "Mastering chess and shogi by self-play with a general reinforcement learning algorithm." arXiv preprint arXiv:1712.01815 (2017).
▪ [6] Berner, Christopher, et al. "Dota 2 with large scale deep reinforcement learning." arXiv preprint arXiv:1912.06680 (2019).
▪ [7] Arulkumaran, Kai, Antoine Cully, and Julian Togelius. "Alphastar: An evolutionary computation perspective." Proceedings of the genetic and
evolutionary computation conference companion. 2019.
▪ [8] Akkaya, Ilge, et al. "Solving rubik's cube with a robot hand." arXiv preprint arXiv:1910.07113 (2019).
▪ [9] Fawzi, Alhussein, et al. "Discovering faster matrix multiplication algorithms with reinforcement learning." Nature 610.7930 (2022): 47-53.
▪ [10] “ChatGPT: Optimizing Language Models for Dialogue.” OpenAI, 30 Nov. 2022, https://openai.com/blog/chatgpt/.
▪ [11] Miki, Takahiro, et al. "Learning robust perceptive locomotion for quadrupedal robots in the wild." Science Robotics 7.62 (2022): eabk2822.
▪ [12] Ibrahim, Muhammad Sohail, Wei Dong, and Qiang Yang. "Machine learning driven smart electric power systems: Current trends and new
perspectives." Applied Energy 272 (2020): 115237.
▪ [13] Niao He, Lecture notes on “Introduction to Reinforcement Learning”, ETH Zurich, 2021, https://odi.inf.ethz.ch/files/zinal/Lecture-1-RL-
introduction.pdf
▪ [14] Open AI, “Taxonomy of RL Algorithms”, https://spinningup.openai.com/en/latest/spinningup/rl_intro2.html#citations-below
▪ [15] Agarwal, Alekh, et al. "Optimality and approximation with policy gradient methods in markov decision processes." Conference on Learning Theory.
PMLR, 2020.
▪ [16] Mei, Jincheng, et al. "On the global convergence rates of softmax policy gradient methods." International Conference on Machine Learning. PMLR,
2020.

An Introduction To Reinforcement Learning From Theory To Algorithms (December 19, 2024) - Joon Kwon
No ratings yet
An Introduction To Reinforcement Learning From Theory To Algorithms (December 19, 2024) - Joon Kwon
66 pages
5 - Policy Gradient Methods
No ratings yet
5 - Policy Gradient Methods
57 pages
ml4r 2025 06
No ratings yet
ml4r 2025 06
16 pages
RL Test Leif
No ratings yet
RL Test Leif
163 pages
Reinforcement Learning in A Nutshell
No ratings yet
Reinforcement Learning in A Nutshell
12 pages
Reinforcement Learning and Control: CS229 Lecture Notes
No ratings yet
Reinforcement Learning and Control: CS229 Lecture Notes
15 pages
Reinforcement Learning Basics
No ratings yet
Reinforcement Learning Basics
7 pages
Cs229-Notes12 Reinforcement in Control
No ratings yet
Cs229-Notes12 Reinforcement in Control
17 pages
5SC28 Machine Learning For Systems and Control
No ratings yet
5SC28 Machine Learning For Systems and Control
68 pages
RL Cheatsheet for Researchers
No ratings yet
RL Cheatsheet for Researchers
16 pages
Lec17 ReinforcementLearning
No ratings yet
Lec17 ReinforcementLearning
58 pages
Lec 04 Reinforcement Learning
No ratings yet
Lec 04 Reinforcement Learning
57 pages
کتاب هشتم بارگزاری شده
No ratings yet
کتاب هشتم بارگزاری شده
112 pages
Lecture Notes RL
No ratings yet
Lecture Notes RL
14 pages
RL 5
No ratings yet
RL 5
26 pages
CSE 445 - Lecture 9 - Reinforcement Learning
No ratings yet
CSE 445 - Lecture 9 - Reinforcement Learning
45 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
101 pages
Lecture 30 Reinforcement-Learning
No ratings yet
Lecture 30 Reinforcement-Learning
50 pages
16 RL
No ratings yet
16 RL
51 pages
Reinforcement Learning An Introduction 2 Trimmed Edition Richard S. Sutton Updated 2025
No ratings yet
Reinforcement Learning An Introduction 2 Trimmed Edition Richard S. Sutton Updated 2025
113 pages
07 Deep Reinforcement Learning (John)
No ratings yet
07 Deep Reinforcement Learning (John)
52 pages
CS229
No ratings yet
CS229
17 pages
Lecture Notes v1.0 687 F22
No ratings yet
Lecture Notes v1.0 687 F22
115 pages
Reinforcement Learning As Classification: Leveraging Modern Classifiers
No ratings yet
Reinforcement Learning As Classification: Leveraging Modern Classifiers
8 pages
Fundamentals of Reinforcement Learning
No ratings yet
Fundamentals of Reinforcement Learning
33 pages
Reinforcement Learning: Karan Kathpalia
No ratings yet
Reinforcement Learning: Karan Kathpalia
80 pages
Reinforcement Learning MY101
No ratings yet
Reinforcement Learning MY101
15 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
31 pages
RL Basics 1737166593
No ratings yet
RL Basics 1737166593
30 pages
Lecture 3 - MDPs and Dynamic Programming
No ratings yet
Lecture 3 - MDPs and Dynamic Programming
62 pages
Lecture 10
No ratings yet
Lecture 10
25 pages
New CZ3005 Module 4 - Markov Decision Process
No ratings yet
New CZ3005 Module 4 - Markov Decision Process
38 pages
402 Lec20
No ratings yet
402 Lec20
21 pages
10 - Reinforcement Learning
No ratings yet
10 - Reinforcement Learning
24 pages
Policy Gradient Methods
No ratings yet
Policy Gradient Methods
70 pages
Markov Decision Processes & Reinforcement Learning: Megan Smith Lehigh University, Fall 2006
No ratings yet
Markov Decision Processes & Reinforcement Learning: Megan Smith Lehigh University, Fall 2006
40 pages
Reinforcement Learning Note
No ratings yet
Reinforcement Learning Note
16 pages
Multi-Agent Learning Dynamics
No ratings yet
Multi-Agent Learning Dynamics
26 pages
ML Unit-4 - RTU
No ratings yet
ML Unit-4 - RTU
18 pages
11-DL-Deep Learning For Reinforcement Learning
No ratings yet
11-DL-Deep Learning For Reinforcement Learning
47 pages
A Crash Course On Reinforcement Learning - Felix Wagner
No ratings yet
A Crash Course On Reinforcement Learning - Felix Wagner
84 pages
Unit 4
No ratings yet
Unit 4
49 pages
L13 Reinforcement Learning
No ratings yet
L13 Reinforcement Learning
57 pages
Algorithms For Reinforced Learning
No ratings yet
Algorithms For Reinforced Learning
98 pages
AI Decision Making & RL Guide
No ratings yet
AI Decision Making & RL Guide
18 pages
Lecture#5 Monte Carlo Methods Part I
No ratings yet
Lecture#5 Monte Carlo Methods Part I
28 pages
Reinforcement Learning Algorithms
No ratings yet
Reinforcement Learning Algorithms
98 pages
An Introduction To Policy Search Methods: Thomas Furmston
No ratings yet
An Introduction To Policy Search Methods: Thomas Furmston
33 pages
Artificial Intelligence: Lecture 10 - Reinforcement Learning Prof. Shivanjali Khare
No ratings yet
Artificial Intelligence: Lecture 10 - Reinforcement Learning Prof. Shivanjali Khare
45 pages
2024 MDPs Part 1
No ratings yet
2024 MDPs Part 1
59 pages
06 MDP
No ratings yet
06 MDP
89 pages
Lecture 3 - MDPs and Dynamic Programming
No ratings yet
Lecture 3 - MDPs and Dynamic Programming
66 pages
Ideai Reinforcement Learning
No ratings yet
Ideai Reinforcement Learning
167 pages
4 Reinforcement Learning - Basic Algorithms: - S, A) ) and The Immediate Reward Function R (R (S, A, S
No ratings yet
4 Reinforcement Learning - Basic Algorithms: - S, A) ) and The Immediate Reward Function R (R (S, A, S
16 pages
Tut21 RL
No ratings yet
Tut21 RL
101 pages
SP14 CS188 Lecture 10 - Reinforcement Learning I
No ratings yet
SP14 CS188 Lecture 10 - Reinforcement Learning I
35 pages
Reinforcement Learning: Csci 5512: Artificial Intelligence Ii
No ratings yet
Reinforcement Learning: Csci 5512: Artificial Intelligence Ii
30 pages
Algorithm For RL
No ratings yet
Algorithm For RL
99 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
5 pages
Faster, Better, Cheaper in The History of Manufacturing: From The Stone Age To Lean Manufacturing and Beyond 1 Edition Edition Christoph Roser
100% (1)
Faster, Better, Cheaper in The History of Manufacturing: From The Stone Age To Lean Manufacturing and Beyond 1 Edition Edition Christoph Roser
65 pages
Tài Liệu Ôn Tập Đầu Vào CH 2023 - 2024
No ratings yet
Tài Liệu Ôn Tập Đầu Vào CH 2023 - 2024
35 pages
State of Brand Consistency Infographic
No ratings yet
State of Brand Consistency Infographic
1 page
Discounted Cash Flow Valuation
100% (1)
Discounted Cash Flow Valuation
26 pages
Car - Pla 043
No ratings yet
Car - Pla 043
2 pages
Score Reports
No ratings yet
Score Reports
4 pages
Validation of Hot Corrosion and Fatigue Models in HOTPITS: K. S. Chan
No ratings yet
Validation of Hot Corrosion and Fatigue Models in HOTPITS: K. S. Chan
11 pages
Grade 11 Oral Communication Lesson Plan
No ratings yet
Grade 11 Oral Communication Lesson Plan
4 pages
Falcon FiberOptics
100% (1)
Falcon FiberOptics
17 pages
Mobile Phone Repairing
75% (4)
Mobile Phone Repairing
22 pages
Xpander 2024 Maintenance Schedule
No ratings yet
Xpander 2024 Maintenance Schedule
1 page
Design and Implementation of Automated Clearance System
No ratings yet
Design and Implementation of Automated Clearance System
5 pages
ONGC
100% (1)
ONGC
25 pages
Schneider MCCB - Compact NSX - LV431630
No ratings yet
Schneider MCCB - Compact NSX - LV431630
3 pages
Optimal Capital Transfer Strategy
No ratings yet
Optimal Capital Transfer Strategy
11 pages
SALES Review
No ratings yet
SALES Review
9 pages
MD Practicals
No ratings yet
MD Practicals
36 pages
Comba ODI-065R12M15JJJ02-GQ V1
No ratings yet
Comba ODI-065R12M15JJJ02-GQ V1
3 pages
RCA Training 6apr20
No ratings yet
RCA Training 6apr20
84 pages
Open DeepRacer Autonomous Racing Platform For Experimentation With Sim2Real Reinforcement Learning PDF
No ratings yet
Open DeepRacer Autonomous Racing Platform For Experimentation With Sim2Real Reinforcement Learning PDF
9 pages
Unit 1 - Meeting A New Friend
No ratings yet
Unit 1 - Meeting A New Friend
13 pages
Russian Politics in Exile The Northeast Asian Balance of Power 19241931 Felix Patrikeeff Download
100% (1)
Russian Politics in Exile The Northeast Asian Balance of Power 19241931 Felix Patrikeeff Download
77 pages
Myanmar National Airline Analysis and Improvement Plan
100% (1)
Myanmar National Airline Analysis and Improvement Plan
15 pages
20 Great Pick Up Lines We Dare You To Try Out - AMOG
No ratings yet
20 Great Pick Up Lines We Dare You To Try Out - AMOG
13 pages
5 Garc
No ratings yet
5 Garc
16 pages
Financing Trade and International Supply Chains 1409454606 Instant Download
100% (1)
Financing Trade and International Supply Chains 1409454606 Instant Download
135 pages
Ncoi Promotion
100% (5)
Ncoi Promotion
16 pages
SR03 01Mk2 Datasheet Rev4
No ratings yet
SR03 01Mk2 Datasheet Rev4
4 pages
ABBF Analysis in P6
100% (1)
ABBF Analysis in P6
11 pages
Chanakya National Law UNIVERSITY, Patna: Legal History Project Topic: Anglo-French Struggle
No ratings yet
Chanakya National Law UNIVERSITY, Patna: Legal History Project Topic: Anglo-French Struggle
19 pages