0% found this document useful (0 votes)

15 views32 pages

Lecture 1

Notes

Uploaded by

bhagyavantrajapur88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views32 pages

Lecture 1

Notes

Uploaded by

bhagyavantrajapur88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Lecture 1: Introduction to Deep Learning

CSE599W: Spring 2018

Lecturers
ML Applications need more than algorithms

Learning Systems: this course

What’s this course
● Not about Learning aspect of Deep Learning (except for the first two)

● System aspect of deep learning: faster training, efficient serving, lower

memory consumption.
Logistics
● Location/Date: Tue/Thu 11:30 am - 12:50pm MUE 153

● Join slack: https://uw-cse.slack.com dlsys channel

● We may use other time and locations for invited speakers.

● Compute Resources: AWS Education, instruction sent via email.

● Office hour by appointment

Homeworks and Projects
● Two code assignments

● Group project
○ Two to three person team
○ Poster presentation and write-up
A Crash Course on Deep Learning
Elements of Machine Learning

Model

Objective

Training
What’s Special About Deep Learning

layer1 layer2 predictor

exractor extractor

Compositional
Model

End to End Training

Ingredients in Deep Learning
● Model and architecture

● Objective function, training techniques

○ Which feedback should we use to guide the algorithm?
○ Supervised, RL, adversarial training.

● Regularization, initialization (coupled with modeling)

○ Dropout, Xavier

● Get enough amount of data

Major Architectures

Image Modeling
Convolutional Nets

Language/Speech
Recurrent Nets
Image Modeling and Convolutional Nets
Breakthrough of Image Classification
Evolution of ConvNets
• LeNet (LeCun, 1998)
– Basic structures: convolution, max-pooling, softmax
• Alexnet (Krizhevsky et.al 2012)
– ReLU, Dropout
• GoogLeNet (Szegedy et.al. 2014)
– Multi-independent pass way (Sparse weight matrix)
• Inception BN (Ioffe et.al 2015)
– Batch normalization
• Residual net (He et.al 2015)
– Residual pass way
Fully Connected Layer

Output

Input
Convolution = Spatial Locality + Sharing

Spatial Locality

Without Sharing

With Sharing
Convolution with Multiple Channels

Source: http://cs231n.github.io/convolutional-networks/
Pooling Layer
Can be replaced by strided convolution

Source: http://cs231n.github.io/convolutional-networks/
LeNet (LeCun 1998)

• Convolution
• Pooling
• Flatten
• Fully connected
• Softmax output
AlexNet (Krizhevsky et.al 2012)
Challenges: From LeNet to AlexNet

● Need much more data: ImageNet

● A lot more computation burdens: GPU

● Overfitting prevention
○ Dropout regularization

● Stable initialization and training

○ Explosive/vanishing gradient problems
○ Requires careful tuning of initialization and data normalization
ReLU Unit

• ReLU

• Why ReLU?
– Cheap to compute
– It is roughly linear..
Dropout Regularization
● Randomly zero out neurons with
probability 0.5

● During prediction, use expectation

value (keep all neurons but scale
output by 0.5)

Dropout Mask
Dropout Regularization
● Randomly zero out neurons with
probability 0.5

● During prediction, use expectation

value (keep all neurons but scale
output by 0.5)

Dropout Mask
GoogleNet: Multiple Pathways, Less Parameters
Vanishing and Explosive Value Problem
● Imagine each layer multiplies
Its input by same weight matrix
○ W > 1: exponential explosion
○ W < 1: exponential vanishing

● In ConvNets, the weight are not tied, but

their magnitude matters
○ Deep nets training was initialization sensitive
Batch Normalization: Stabilize the Magnitude

• Subtract mean
• Divide by standard deviation
• Output is invariant to input scale!
– Scale input by a constant
– Output of BN remains the same

• Impact
– Easy to tune learning rate
– Less sensitive initialization
(Ioffe et.al 2015)
The Scale Normalization (Assumes zero mean)

Scale
Normalization

Invariance to
Magnitude!
Residual Net (He et.al 2015)

● Instead of doing transformation

add transformation result to input

● Partly solve vanishing/explosive

value problem
Evolution of ConvNets
• LeNet (LeCun, 1998)
– Basic structures: convolution, max-pooling, softmax
• Alexnet (Krizhevsky et.al 2012)
– ReLU, Dropout
• GoogLeNet (Szegedy et.al. 2014)
– Multi-independent pass way (Sparse weight matrix)
• Inception BN (Ioffe et.al 2015)
– Batch normalization
• Residual net (He et.al 2015)
– Residual pass way
More Resources
● Deep learning book (Goodfellow et. al)

● Stanford CS231n: Convolutional Neural Networks for Visual Recognition

● http://dlsys.cs.washington.edu/materials
Lab1 on Thursday
● Walk through how to implement a simple model for digit recognition
using MXNet Gluon
● Focus is on data I/O, model definition and typical training loop
● Familiarize with typical framework APIs for vision tasks

● Before class: sign up for AWS educate credits

● https://aws.amazon.com/education/awseducate/apply/
● Create AWS Educate Starter Account to avoid getting charged
● Will email out instructions, but very simple to DIY, so do it today!

Deep Learning for CS Students
No ratings yet
Deep Learning for CS Students
75 pages
Unit III
No ratings yet
Unit III
58 pages
Lecture 9 Training Deep Networks
No ratings yet
Lecture 9 Training Deep Networks
20 pages
10 Architectures
No ratings yet
10 Architectures
90 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
138 B Pretrained Networks Classification Complete
No ratings yet
138 B Pretrained Networks Classification Complete
47 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
19 ResNet 10 09 2024
No ratings yet
19 ResNet 10 09 2024
35 pages
Cours 8 B
No ratings yet
Cours 8 B
39 pages
CNN Architectures for Text and Image
No ratings yet
CNN Architectures for Text and Image
167 pages
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
No ratings yet
Basics of DL: Prof. Leal-Taixé and Prof. Niessner 1
76 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
Pretrained Networks
No ratings yet
Pretrained Networks
42 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
8.augmentation and Regularization
No ratings yet
8.augmentation and Regularization
62 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
AML - Lecture 7
No ratings yet
AML - Lecture 7
33 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
48 pages
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
No ratings yet
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
33 pages
CNN Fundamentals & Case Studies
No ratings yet
CNN Fundamentals & Case Studies
27 pages
Ai 4 All
No ratings yet
Ai 4 All
18 pages
BMM 2018 - Deep Learning Tutorial
No ratings yet
BMM 2018 - Deep Learning Tutorial
47 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
No ratings yet
Machine Learning (ML) :: Aim: Analysis and Implementation of Deep Neural Network. Definitions
6 pages
CSCI417 Machine Intelligence - Lec11 RNN - V1
No ratings yet
CSCI417 Machine Intelligence - Lec11 RNN - V1
61 pages
Introduction To Deep Learning: Nandita Bhaskhar
No ratings yet
Introduction To Deep Learning: Nandita Bhaskhar
56 pages
CV - Deep Convolutional Neural Networks
No ratings yet
CV - Deep Convolutional Neural Networks
55 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
15 pages
DL Theory Syllabus
No ratings yet
DL Theory Syllabus
3 pages
Deep CNN
No ratings yet
Deep CNN
66 pages
Unit 3
No ratings yet
Unit 3
37 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
001 Intro
No ratings yet
001 Intro
66 pages
DLRL Module 2
No ratings yet
DLRL Module 2
22 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
Cs437 Cs5317 Ee414 Ee513 l10 Cnncasestudies
No ratings yet
Cs437 Cs5317 Ee414 Ee513 l10 Cnncasestudies
55 pages
Course - A Deep Understanding of Deep Learning (With Python Intro)
No ratings yet
Course - A Deep Understanding of Deep Learning (With Python Intro)
4 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Convnets 3
No ratings yet
Convnets 3
17 pages
Convolutional Neural Networks Explained
No ratings yet
Convolutional Neural Networks Explained
97 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
Modules 3 C
No ratings yet
Modules 3 C
44 pages
Comprehensive Notes On Advanced CNN Concepts & Vision Tasks
No ratings yet
Comprehensive Notes On Advanced CNN Concepts & Vision Tasks
5 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
GoogleNET and ResNet v4 With Nin and Bias
No ratings yet
GoogleNET and ResNet v4 With Nin and Bias
82 pages
Lecture 6 Review
No ratings yet
Lecture 6 Review
74 pages
Convolutional Neural Networks Guide
No ratings yet
Convolutional Neural Networks Guide
31 pages
Intro to CNNs for Tech Enthusiasts
No ratings yet
Intro to CNNs for Tech Enthusiasts
31 pages
Deep Learning for Tech Enthusiasts
No ratings yet
Deep Learning for Tech Enthusiasts
50 pages
Lec 2
No ratings yet
Lec 2
42 pages
Hot Chips Overview
No ratings yet
Hot Chips Overview
47 pages
FDL Module1
No ratings yet
FDL Module1
102 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
Lecture05 DeepLearningCNN
No ratings yet
Lecture05 DeepLearningCNN
84 pages
Lecture05 DeepLearningCNN Trang 1
No ratings yet
Lecture05 DeepLearningCNN Trang 1
39 pages
LIS - Quarterly Grades User Guide
No ratings yet
LIS - Quarterly Grades User Guide
20 pages
JavaScript Quick Reference Guide
100% (1)
JavaScript Quick Reference Guide
56 pages
Database Security Essentials
No ratings yet
Database Security Essentials
16 pages
Multimedia Applications and Design
No ratings yet
Multimedia Applications and Design
6 pages
Efficient Embedded Systems Design - Arm®
No ratings yet
Efficient Embedded Systems Design - Arm®
9 pages
AndroidRIL-SourceCode AppNote (UBX-13002041)
No ratings yet
AndroidRIL-SourceCode AppNote (UBX-13002041)
46 pages
Programming in Java Assignment
No ratings yet
Programming in Java Assignment
8 pages
Des-1444 Exam Qs
No ratings yet
Des-1444 Exam Qs
13 pages
Works - transcender.201.PDF - Download.2022 Mar 31.by - Berger.198q.vce 1
No ratings yet
Works - transcender.201.PDF - Download.2022 Mar 31.by - Berger.198q.vce 1
22 pages
Computer Networks: Anil Kumar K.M Computer Science & Engg. Department S.J.C.E Mysore-6
No ratings yet
Computer Networks: Anil Kumar K.M Computer Science & Engg. Department S.J.C.E Mysore-6
87 pages
SAP Business One Certified Servers For SAP HANA
No ratings yet
SAP Business One Certified Servers For SAP HANA
5 pages
IT Students' Antivirus Guide
No ratings yet
IT Students' Antivirus Guide
8 pages
Arduino Reference - Arduino Reference
No ratings yet
Arduino Reference - Arduino Reference
3 pages
VAPT Note
No ratings yet
VAPT Note
3 pages
Gokulkrishna 2021
No ratings yet
Gokulkrishna 2021
5 pages
INV, OM Profile Options
No ratings yet
INV, OM Profile Options
8 pages
Emerging Ass't Sec 16 Group 1
No ratings yet
Emerging Ass't Sec 16 Group 1
9 pages
Human Detection Robot For Natural Calamity Rescue Operation - 2019
No ratings yet
Human Detection Robot For Natural Calamity Rescue Operation - 2019
5 pages
Software Quality Assurance Course
No ratings yet
Software Quality Assurance Course
3 pages
Eltek FP2 Indoor
No ratings yet
Eltek FP2 Indoor
1 page
Program To Implement The Bisection Method: Anshul Siwach
No ratings yet
Program To Implement The Bisection Method: Anshul Siwach
49 pages
DbDOS Pro 3 QuickStartGuide
No ratings yet
DbDOS Pro 3 QuickStartGuide
80 pages
Old Second Year Evaluation Scheme
No ratings yet
Old Second Year Evaluation Scheme
2 pages
Change Folder Path in Internet Download Manager Aka IDM - SnTHostings
No ratings yet
Change Folder Path in Internet Download Manager Aka IDM - SnTHostings
5 pages
Python for Enterprise Data Analysis
No ratings yet
Python for Enterprise Data Analysis
4 pages
Visual Studio 2019
No ratings yet
Visual Studio 2019
3 pages
2024-06-11 Biz Main
No ratings yet
2024-06-11 Biz Main
28 pages
Python Beginner Programming Exercises
No ratings yet
Python Beginner Programming Exercises
7 pages
Configuring MAC Address Tables: Information About MAC Addresses
No ratings yet
Configuring MAC Address Tables: Information About MAC Addresses
6 pages
1aae Floatingpoint
No ratings yet
1aae Floatingpoint
2 pages