0% found this document useful (0 votes)

609 views44 pages

00 Pytorch and Deep Learning Fundamentals PDF

Deep learning uses neural networks to analyze large amounts of unstructured data to discover complex patterns. It is well-suited for problems that have many changing variables or where hand-crafting rules would be impractical or impossible. However, deep learning models lack explainability, require a lot of data, and are not appropriate if errors cannot be tolerated or a simple rule-based system would work.

Uploaded by

MAHMOUDI Oussama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

609 views44 pages

00 Pytorch and Deep Learning Fundamentals PDF

Uploaded by

MAHMOUDI Oussama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Deep Learning with

“What is deep learning?”

Machine learning is turning things (data)
into numbers and finding patterns in those
numbers.
The computer does this part.
How?
Code & math.
We’re going to be writing the code.
Machine Learning vs. Deep Learning
Arti cial
Intelligence

Machine
Learning

Deep
Learning
fi

Inputs Rules Output

programming 1. Cut vegetables

Traditional

2. Season chicken
3. Preheat oven
4. Cook chicken for 30-minutes
5. Add vegetables

Starts with Makes

Inputs Output Rules

Machine learning

1. Cut vegetables
algorithm

2. Season chicken
3. Preheat oven
4. Cook chicken for 30-minutes
5. Add vegetables

Starts with Figures out

“Why use machine learning (or

deep learning)?”
Good reason: Why not?
Better reason: For a complex
problem, can you think of all the rules?
(probably not)
Source: 2020 Machine Learning Roadmap video.

(maybe not very simple…)

“If you can build a simple rule-based system
that doesn’t require machine learning, do
that.”

— A wise software engineer… (actually rule 1 of Google’s Machine Learning Handbook)

What deep learning is good for 🤖✅
• Problems with long lists of rules—when the traditional
approach fails, machine learning/deep learning may help.

• Continually changing environments—deep learning can

adapt (‘learn’) to new scenarios.

• Discovering insights within large collections of data—can

you imagine trying to hand-craft rules for what 101 di erent
kinds of food look like?

y p ic a l l y )
(t
What deep learning is not good for 🤖🚫
• When you need explainability—the patterns learned by a deep
learning model are typically uninterpretable by a human.

• When the traditional approach is a better option — if you can

accomplish what you need with a simple rule-based system.

• When errors are unacceptable — since the outputs of deep

learning model aren’t always predictable.

• When you don’t have much data — deep learning models

usually require a fairly large amount of data to produce great
results.
(though we’ll see how to get great results without huge amounts of data)

Machine Learning vs. Deep Learning

Machine Learning

Deep Learning
d ient
g r a
r i t h m:
Alg o
a c h ine ral
st e d m : n e u
bo o it hm
o r
Alg rk
e t w o
n

Structured data Unstructured data

Machine Learning vs.(coDeep Learning
mmon algorithms)
• Random forest • Neural networks
• Gradient boosted models • Fully connected neural network
• Naive Bayes • Convolutional neural network
• Nearest neighbour • Recurrent neural network
• Support vector machine • Transformer
• …many more these are rn i n g
• …many more
o f d ee p l e a ” )
a d v en t o r i t h m s
(since the d to a s “ s h a l l o w a l g
r e
often refer

What we’re focused on building

(with PyTorch)
(depending how you represent your problem,
many algorithms can be used for both)
Structured data Unstructured data

“What are neural networks?”

Neural Networks a n
(a hum d these)
ca n
e r s t a n
und
Ramen,
(before data gets used Each of these nodes is Spaghetti
with a neural network, called a “hidden unit”
it needs to be turned or “neuron”.
into numbers)

[[116, 78, 15], [[0.983, 0.004, 0.013],

[117, 43, 96], [0.110, 0.889, 0.001], Not a diaster
[125, 87, 23], [0.023, 0.027, 0.985],
…, …,

p r o p r i a te
t h e a p
(choo s e
o rk fo r y o u r “Hey Siri, what’s
n e t w
n e u r a l the weather
problem)
today?”

Learns
Numerical representation Representation
Inputs Outputs
encoding (patterns/features/weights) outputs

Anatomy of Neural Networks

Overall
architecture

Input layer Output layer

(data goes in here) (outputs learned representation or
# units/neurons = 2 prediction probabilities)
# units/neurons = 1
Each laye
r is usual
linear (st ly combin
raight lin ation of
linear (no e) and/or
t-straight non-
line) fun
ctions
Hidden layer(s)
(learns patterns in data)
# units/neurons = 3

Note: “patterns” is an arbitrary term, you’ll often hear “embedding”, “weights”, “feature representation”,
“feature vectors” all referring to similar things.

Types of Learning

Supervised Unsupervised & Transfer

Learning Self-supervised Learning
Learning

We’ll be writing code to do these,

but the style of code can be adopted across learning paradigms.

“What is deep learning actually

used for?”
Source: 2020 Machine Learning Roadmap video.

(s o m e )
Deep Learning Use Cases

“Hey Siri, who’s the

biggest big dog of
them all?”

Recommendation Translation Speech recognition

Sequence to sequence
To: daniel@mrdbourke.com (seq2seq)
To: daniel@mrdbourke.com
Hey Daniel, Hay daniel…

This deep learning course is incredible! C0ongratu1ations! U win $1139239230

I can’t wait to use what I’ve learned!

Not spam Spam Classi cation/regression

Computer Vision Natural Language Processing (NLP)
fi

“What is PyTorch?”
What is PyTorch?
• Most popular research deep learning framework*
• Write fast deep learning code in Python (able to run on a GPU/many
GPUs)

• Able to access many pre-built deep learning models (Torch Hub/

torchvision.models)

• Whole stack: preprocess data, model data, deploy model in your

application/cloud

• Originally designed and used in-house by Facebook/Meta (now open-

source and used by companies such as Tesla, Microsoft, OpenAI)

*Source: paperswithcode.com/trends February 2022

Why PyTorch?

Research favourite
Source: paperswithcode.com/trends February 2022
Why PyTorch?

P y Torc h
an d

Source: @fchollet Twitter

Why PyTorch?
What is a GPU/TPU?

TPU (Tensor Processing Unit)

GPU (Graphics Processing Unit)

“What is a tensor?”
Neural Networks a n
(a hum d these)
ca n
e r s t a n
und

These a Ramen,
(before data gets used
re tens Spaghetti
with an algorithm, it
needs to be turned into
numbers)
o rs!
[[116, 78, 15], [[0.983, 0.004, 0.013],
[117, 43, 96], [0.110, 0.889, 0.001], Not spam
[125, 87, 23], [0.023, 0.027, 0.985],
…, …,

p r o p r i a te
t h e a p
(choo s e
o rk fo r y o u r “Hey Siri, what’s
n e t w
n e u r a l the weather
problem)
today?”

Learns
Numerical representation Representation
Inputs Outputs
encoding (patterns/features/weights) outputs

These a
re tens
o rs!
[[116, 78, 15], [[0.983, 0.004, 0.013],
[117, 43, 96], [0.110, 0.889, 0.001], Ramen,
[125, 87, 23], [0.023, 0.027, 0.985], Spaghetti
…, …,

Learns
Numerical representation Representation
Inputs Outputs
encoding (patterns/features/weights) outputs

“What are we going to

cover?”
Source: @elonmusk Twitter
What we’re going to cover
(broadly)
• Now:

• PyTorch basics & fundamentals (dealing with tensors and tensor operations)

• Later:

• Preprocessing data (getting it into tensors)

• Building and using pretrained deep learning models

• Fitting a model to the data (learning patterns)

• Making predictions with a model (using patterns)

• Evaluating model predictions

• Saving and loading models

• Using a trained model to make predictions on custom data

👩🍳 👩🔬
(we’ll be cooking up lots of code!)

How:

What we’re going to cover

A PyTorch workflow
(one of many)
“How should I approach
this course?”
How to approach this course
# 2 : n t ,
ot to ri m e
M e x p e
e n t , t ! Motto #3:
e r im im e n
x p e r Visualize, visualize, visualize!
E exp

1. Code along
b t, ru n the c o d e! 2. Explore and 3. Visualize what you
Motto #1 : if in d o u
experiment don’t understand

(including the
“dumb” ones)
🛠 🤗
4. Ask questions 5. Do the exercises 6. Share your work

How not to approach this course

🔥🔥
Avoid: 🔥🧠 “I can’t learn
______”
This course Resources
Course materials Course Q&A Course online book

https://www.github.com/mrdbourke/pytorch-deep-learning/
https://www.github.com/mrdbourke/pytorch-deep-learning https://learnpytorch.io
discussions

PyTorch website &

forums

All things PyTorch

Let’s code!
Tensor dimensions
tensor([[[1, 2, 3],
dim=0 [3, 6, 9],
[2, 4, 5]]]) Dimension (dim)
0 1 2

tensor([[[1, 2, 3], 0
dim=1 [3, 6, 9], 1 torch.Size([1, 3, 3])
[2, 4, 5]]]) 2

tensor([[[1, 2, 3],
dim=2 [3, 6, 9],
[2, 4, 5]]])

0 1 2

Dot product
A B C J K A*J + B*L + C*N A*K + B*M + C*O

torch.matmul( D E F , L M ) DJ + EL + FN DK + EM + FO

G H I N O GJ + HL + IN GK + HM + IO

3x3 3x2 3x2

Numbers on the inside must match New size is same as outside numbers

5 0 3 4 7 5 0 3 44 38

torch.matmul( 3 7 9 6 8 ) * * * 126 86
,
4 6 8
3 5 2 8 1 58 63
= = =
3x3 3x2 3x2
20 + 0 + 24 = 44

For a live demo, checkout www.matrixmultiplication.xyz

2. Show examples
m
Supervised 1. In i t i a l i s e w i t h
b e
ra
g
n
i n
d
n
o
i n g )
a t
learning we i g h ts ( o n l y
[[0.092, 0.210, 0.415],
(overview) [0.778, 0.929, 0.030],
[0.019, 0.182, 0.555],
…,

[[116, 78, 15], [[0.983, 0.004, 0.013],

[117, 43, 96], [0.110, 0.889, 0.001], Ramen,
[125, 87, 23], [0.023, 0.027, 0.985], Spaghetti
…, …,

4. Repeat with more 3. Update representation

examples outputs

Learns
Numerical representation Representation
Inputs Outputs
encoding (patterns/features/weights) outputs

Tensor attributes
Attribute Meaning Code

The length (number of elements) of

Shape tensor.shape
each of the dimensions of a tensor.

The total number of tensor

dimensions. A scalar has rank 0, a
Rank/dimensions tensor.ndim or tensor.size()
vector has rank 1, a matrix is rank 2, a
tensor has rank n.

Speci c axis or dimension (e.g. “1st

A particular dimension of a tensor. tensor[0], tensor[:, 1]…
axis” or “0th dimension”)
fi

Pytorch Tutorial 1 Rev 1
No ratings yet
Pytorch Tutorial 1 Rev 1
48 pages
6 Types of Neural Network
No ratings yet
6 Types of Neural Network
8 pages
Deep Learning Career Launch
No ratings yet
Deep Learning Career Launch
15 pages
Lora and Qlora
No ratings yet
Lora and Qlora
5 pages
"Hello World" of Deep Learning
No ratings yet
"Hello World" of Deep Learning
26 pages
Lecture+Notes Intro To MLOps Session3
No ratings yet
Lecture+Notes Intro To MLOps Session3
8 pages
Machine Learning & DevOps Sessions Summary
No ratings yet
Machine Learning & DevOps Sessions Summary
23 pages
Sandeep Interview
No ratings yet
Sandeep Interview
27 pages
LSTM for Touchpoint Prediction
100% (1)
LSTM for Touchpoint Prediction
73 pages
Python AI ML Complete Roadmap With Skills
No ratings yet
Python AI ML Complete Roadmap With Skills
3 pages
Machine Learning Algorithms Theory - Vimal Mishra
No ratings yet
Machine Learning Algorithms Theory - Vimal Mishra
931 pages
GenAI Pinnacle Plus Brochure
No ratings yet
GenAI Pinnacle Plus Brochure
10 pages
Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
Cours 1 - Intro To Deep Learning
100% (1)
Cours 1 - Intro To Deep Learning
38 pages
Deep Learning: Training Techniques
No ratings yet
Deep Learning: Training Techniques
42 pages
Understanding Vector Embeddings
No ratings yet
Understanding Vector Embeddings
14 pages
Machine Learning - 2 Books in 1 - The Complete Guide For Beginners To Master Neural Networks, Artificial Intelligence, and Data Science With Python (BooksRack - Net)
No ratings yet
Machine Learning - 2 Books in 1 - The Complete Guide For Beginners To Master Neural Networks, Artificial Intelligence, and Data Science With Python (BooksRack - Net)
201 pages
Semantic Kernel
100% (1)
Semantic Kernel
162 pages
Computer Vision - Ipynb - Colaboratory
No ratings yet
Computer Vision - Ipynb - Colaboratory
17 pages
AI - ML Resource Sheet
100% (1)
AI - ML Resource Sheet
10 pages
Fast Python High Performance Techniques For Large Datasets MEAP V10 Tiago Rodrigues Antao Instant Download
No ratings yet
Fast Python High Performance Techniques For Large Datasets MEAP V10 Tiago Rodrigues Antao Instant Download
110 pages
Transformers From Scratch
No ratings yet
Transformers From Scratch
39 pages
11 Machine Learning System Design PDF
No ratings yet
11 Machine Learning System Design PDF
7 pages
My CV
No ratings yet
My CV
2 pages
Transformers Explained "Attention Is All You Need."
No ratings yet
Transformers Explained "Attention Is All You Need."
28 pages
Deep Learning CNN Training Guide
No ratings yet
Deep Learning CNN Training Guide
20 pages
Probability and Statistics For ML - Cwa
No ratings yet
Probability and Statistics For ML - Cwa
822 pages
Mehryar Mohri - Foundations of Machine Learning - Book
No ratings yet
Mehryar Mohri - Foundations of Machine Learning - Book
1 page
(Deep Learning Using PyTorch) (Cheatsheet)
No ratings yet
(Deep Learning Using PyTorch) (Cheatsheet)
7 pages
Machine Learning: Trustworthy
No ratings yet
Machine Learning: Trustworthy
267 pages
Lang Graph
No ratings yet
Lang Graph
113 pages
Ai Foundation Syllabus
No ratings yet
Ai Foundation Syllabus
22 pages
Professional Machine Learning Engineer Demo
No ratings yet
Professional Machine Learning Engineer Demo
6 pages
LLM Test Case Generation for Software
No ratings yet
LLM Test Case Generation for Software
6 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
Gluon Tutorials: Deep Learning - The Straight Dope
No ratings yet
Gluon Tutorials: Deep Learning - The Straight Dope
403 pages
Intern - Gen AI
No ratings yet
Intern - Gen AI
2 pages
AI Engineer Resume
No ratings yet
AI Engineer Resume
2 pages
6months ML
No ratings yet
6months ML
161 pages
Lang Chain
No ratings yet
Lang Chain
143 pages
Apache Spark vs Dask: Big Data Tools
No ratings yet
Apache Spark vs Dask: Big Data Tools
55 pages
NLP and Generative AI Syllabus - 2025
No ratings yet
NLP and Generative AI Syllabus - 2025
5 pages
ML System Design
100% (1)
ML System Design
11 pages
AI ML Interview Introduction
No ratings yet
AI ML Interview Introduction
15 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
BERT
No ratings yet
BERT
21 pages
Whitepaper Emebddings Vectorstores v2
No ratings yet
Whitepaper Emebddings Vectorstores v2
64 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages
Pulkit Agarwal
No ratings yet
Pulkit Agarwal
1 page
Deep Learning Lit Review Guide
100% (1)
Deep Learning Lit Review Guide
8 pages
LangChain in Action v5 MEAP
100% (1)
LangChain in Action v5 MEAP
372 pages
Data Science and Machine Learning Interview Questions Using Python Second Edition Vishwanathan Narayanan PDF Version
No ratings yet
Data Science and Machine Learning Interview Questions Using Python Second Edition Vishwanathan Narayanan PDF Version
138 pages
Machine Learning Engineering Guide
No ratings yet
Machine Learning Engineering Guide
80 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
Unit I
No ratings yet
Unit I
41 pages
Vanet Presentation
No ratings yet
Vanet Presentation
8 pages
Top 45 Machine Learning Interview Questions in 2025
100% (1)
Top 45 Machine Learning Interview Questions in 2025
37 pages
Deep Learning Course Introduction
No ratings yet
Deep Learning Course Introduction
34 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
ETH Zurich Talk - April 14, 2025
No ratings yet
ETH Zurich Talk - April 14, 2025
84 pages
DownloadFull TextResearchPaperPDFPp1 15 - Signed
No ratings yet
DownloadFull TextResearchPaperPDFPp1 15 - Signed
16 pages
An - Energy-Efficient - Mixed-Bitwidth - Systolic - Accelerator - For - NAS-Optimized - Deep - Neural - Networks
No ratings yet
An - Energy-Efficient - Mixed-Bitwidth - Systolic - Accelerator - For - NAS-Optimized - Deep - Neural - Networks
13 pages
Hydramini: An Fpga-Based Affordable Research and Education Platform For Autonomous Driving
No ratings yet
Hydramini: An Fpga-Based Affordable Research and Education Platform For Autonomous Driving
8 pages
Neural Network Architectures Guide
No ratings yet
Neural Network Architectures Guide
6 pages
Quoc Hung Vuong - Module 7-8
No ratings yet
Quoc Hung Vuong - Module 7-8
8 pages
Pls Academy Pca Student Slides 1 2301
No ratings yet
Pls Academy Pca Student Slides 1 2301
132 pages
Virtual - Machines GCP
100% (1)
Virtual - Machines GCP
75 pages
W03 Benchmarking
No ratings yet
W03 Benchmarking
25 pages
SemiAnalysis - AMD 2.0 - New Sense of Urgency - MI450X Chance To Beat Nvidia
No ratings yet
SemiAnalysis - AMD 2.0 - New Sense of Urgency - MI450X Chance To Beat Nvidia
55 pages
AI Hardware - Edge Machine Learning Inference - Viso - Ai
No ratings yet
AI Hardware - Edge Machine Learning Inference - Viso - Ai
8 pages
03 Innovating With Google Cloud Artificial Intelligence
No ratings yet
03 Innovating With Google Cloud Artificial Intelligence
11 pages
Artificial Intelligence Hardware Design 1st Edition Albert Chun-Chen Liu Full
100% (2)
Artificial Intelligence Hardware Design 1st Edition Albert Chun-Chen Liu Full
147 pages
Applied Neural Networks With Tensorflow 2 Api Oriented Deep Learning With Python 1St Edition Orhan Gazi Yalcın Yalçın Orhan
100% (6)
Applied Neural Networks With Tensorflow 2 Api Oriented Deep Learning With Python 1St Edition Orhan Gazi Yalcın Yalçın Orhan
69 pages
New Exam Guide
No ratings yet
New Exam Guide
6 pages
PyTorch Lightning Guide 0.8.5
No ratings yet
PyTorch Lightning Guide 0.8.5
562 pages
NITI Aayog Research Paper On AI
No ratings yet
NITI Aayog Research Paper On AI
114 pages
10.48550 Arxiv.2204.02311
No ratings yet
10.48550 Arxiv.2204.02311
87 pages
Python & Delphi Integration Guide
100% (1)
Python & Delphi Integration Guide
21 pages
Vision Transformer Reliability Evaluation On The C
No ratings yet
Vision Transformer Reliability Evaluation On The C
9 pages
Engineering: Yiran Chen, Yuan Xie, Linghao Song, Fan Chen, Tianqi Tang
No ratings yet
Engineering: Yiran Chen, Yuan Xie, Linghao Song, Fan Chen, Tianqi Tang
11 pages
MLDD 1
No ratings yet
MLDD 1
44 pages
AI-MT: Multi-Neural Network Accelerator
No ratings yet
AI-MT: Multi-Neural Network Accelerator
14 pages
Large-Scale Deep Learning With Tensorflow: Jeff Dean Google Brain Team
No ratings yet
Large-Scale Deep Learning With Tensorflow: Jeff Dean Google Brain Team
119 pages
Google Earnings Call Search
No ratings yet
Google Earnings Call Search
24 pages
T-Aimlgc-b - m1 - Ai Foundations - Ilt v.2.0
No ratings yet
T-Aimlgc-b - m1 - Ai Foundations - Ilt v.2.0
122 pages
Google Colab for ML Beginners
No ratings yet
Google Colab for ML Beginners
14 pages
Fulltext01 7
No ratings yet
Fulltext01 7
43 pages
ISTQB AIT Presentation v3
No ratings yet
ISTQB AIT Presentation v3
433 pages
Deep Learning's Impact on Chip Design
No ratings yet
Deep Learning's Impact on Chip Design
17 pages

00 Pytorch and Deep Learning Fundamentals PDF

Uploaded by

00 Pytorch and Deep Learning Fundamentals PDF

Uploaded by

Deep Learning with

“What is deep learning?”

Inputs Rules Output

programming 1. Cut vegetables

Starts with Makes

Inputs Output Rules

Starts with Figures out

“Why use machine learning (or

(maybe not very simple…)

— A wise software engineer… (actually rule 1 of Google’s Machine Learning Handbook)

• Continually changing environments—deep learning can

• Discovering insights within large collections of data—can

• When the traditional approach is a better option — if you can

• When errors are unacceptable — since the outputs of deep

• When you don’t have much data — deep learning models

Machine Learning vs. Deep Learning

Structured data Unstructured data

What we’re focused on building

“What are neural networks?”

[[116, 78, 15], [[0.983, 0.004, 0.013],

Anatomy of Neural Networks

Input layer Output layer

Supervised Unsupervised & Transfer

We’ll be writing code to do these,

“What is deep learning actually

“Hey Siri, who’s the

Recommendation Translation Speech recognition

This deep learning course is incredible! C0ongratu1ations! U win $1139239230

Not spam Spam Classi cation/regression

• Able to access many pre-built deep learning models (Torch Hub/

• Whole stack: preprocess data, model data, deploy model in your

• Originally designed and used in-house by Facebook/Meta (now open-

*Source: paperswithcode.com/trends February 2022

Source: @fchollet Twitter

TPU (Tensor Processing Unit)

GPU (Graphics Processing Unit)

“What are we going to

• Preprocessing data (getting it into tensors)

• Building and using pretrained deep learning models

• Fitting a model to the data (learning patterns)

• Making predictions with a model (using patterns)

• Evaluating model predictions

• Saving and loading models

• Using a trained model to make predictions on custom data

What we’re going to cover

How not to approach this course

PyTorch website &

All things PyTorch

torch.matmul( D E F , L M ) D*J + E*L + F*N D*K + E*M + F*O

G H I N O G*J + H*L + I*N G*K + H*M + I*O

3x3 3x2 3x2

For a live demo, checkout www.matrixmultiplication.xyz

[[116, 78, 15], [[0.983, 0.004, 0.013],

4. Repeat with more 3. Update representation

The length (number of elements) of

The total number of tensor

Speci c axis or dimension (e.g. “1st

You might also like

torch.matmul( D E F , L M ) DJ + EL + FN DK + EM + FO

G H I N O GJ + HL + IN GK + HM + IO