CSE 445 - Lecture 1 - Machine Learning Introduction

CSE 445: Machine Learning is a course taught by Intisar Tahmid Naheen at North South University, covering essential concepts, resources, and project requirements. The course emphasizes various machine learning techniques, including supervised, unsupervised, and reinforcement learning, while addressing common challenges like overfitting and model evaluation. Recommended resources include textbooks and online courses, with a focus on practical applications and group projects.

Uploaded by

sarwar76200

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views23 pages

CSE 445 - Lecture 1 - Machine Learning Introduction

Uploaded by

sarwar76200

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

CSE 445: Machine Learning

Introduction

Instructor: Intisar Tahmid Naheen, North South University

Resources
▪ Slides provided in course should be enough – but there is a plethora of
fantastic resources available, so use them!
▪ Recommended Books:
▪ Hands-On Machine Learning with Scikit-Learn, Keras, and Tensorflow by Aurelien
Geron (will be followed extensively in the course with code examples from
https://github.com/ageron/handson-ml )
▪ Pattern Recognition and Machine Learning by Christopher Bishop (excellent
resource for mathematical foundations)
▪ Elements of Statistical Learning by Jerome Friedman et al (good reference)
▪ Additional Material:
▪ Andrew Ng’s course on Machine Learning available on Coursera
▪ CS 189, Berkeley
▪ CS 229, Stanford
Helpful Prerequisites
▪ MAT361- Probability & Statistics
▪ Probability distribution, Random Variable, Conditional Probability, Variance (some
of the important concepts to recall to name a few)
▪ MAT125 – Linear Algebra
▪ Matrix Multiplication, Eigenvalues, Eigenvectors

▪ Basic programming background in Python (an OK understanding of python

syntax is all that’s necessary – Geron’s textbook has excellent code examples)

▪ None of them are compulsory – easier to grasp the material if completed

Course Project
▪ Groups of up to 3 members (3 is a hard maximum)
▪ Video Demo submission at the end of the semester, and in-person/online
presentation at the end of the semester
▪ 4-6 page Report due at semester end, IEEE format – must include link to Github
repo

▪ Potential Topics (few examples):

▪ Covid-19
▪ Computer Vision
▪ Natural Language Processing
▪ Reinforcement Learning
▪ Speech & Music Recognition
▪ Biomedical Imaging and Biosignals
What is Machine Learning?
Tom Mitchell (1998): a computer program is
said to learn from experience E with respect
to some class of tasks T and performance
measure P, if its performance at tasks in T, as
measured by P, improves with experience E.

Example:
Task: Playing Checkers
Experience (data): games played by the
program (with itself)
Performance measure: winning rate Image from Tom Mitchell’s homepage
Definition of Machine Learning
Arthur Samuel (1959): Machine Learning is the
field of study that gives the computer the ability
to learn without being explicitly programmed.

Photos from Wikipedia

Traditional Programming

• Traditional Programming: writing a set of RULES to find

ANSWERS from DATA
The ML Approach
Machine Learning: Use DATA and ANSWERS to learn the underlying set of RULES

Great for:

• Problems that require a lot of fine-

tuning or long list of rules
• Changing environments – ML
systems can ADAPT

• Getting insights from large amounts

of data

• Complex problems that yields no good

solution with traditional approach
Deep Learning
▪ Subset of ML - loosely mimics
structure/function of human brain
▪ Unlike traditional ML, does not require
manual feature extraction
▪ Keeps getting better with more data
(typically)
▪ Computer Vision (CNN, GAN)
▪ Natural Language Processing (RNN,
LSTM)
▪ Automatic Speech Recognition (RNN)
Summary – AI vs ML vs DL
▪ Subsets of each other
▪ 1950 – 1990: AI in the form of Expert systems (airplane
autopilot) and Games (checkers, chess)
▪ 1990- : Statistical Approaches with ML, busts AI winter
▪ 2010 - : Deep Learning revolutionizes CV, NLP among
other applications
▪ Narrow AI
▪ Systems can do a few defined things (such as playing
chess, or driving a car) as well, or better than humans
▪ Can’t do EVERYTHING a human being can do – yet
▪ AI is not “taking over the world” anytime soon
▪ Tell your uncles to relax and stop using Whatsapp
What kind of ML system is it?
▪ Useful to classify ML systems based on the following criteria:
1. Does it require human supervision? 3. Does the system build a predictive model?
➢ Model-based Learning
➢ Supervised Learning
➢ Instance-based Learning
➢ Semisupervised Learning
➢ Unsupervised Learning
➢ Reinforcement Learning
• These are not exclusive – can be
combined
2. Can it learn incrementally on the fly?
➢ Online Learning
• e.g. Spam filter may learn on the fly
➢ Batch Learning with a deep neural network – online,
model-based, supervised learning
system
Supervised Learning
▪ Training data fed to algorithm
includes the desired
answers/solutions (labels)
▪ Example algorithms:
▪ Linear Regression
▪ Logistic Regression
▪ SVM
▪ Decision Tree
▪ Neural Network
Unsupervised Learning
▪ Training data is unlabeled
▪ System learns without direct human
supervision
▪ Widely used in:
▪ Clustering
▪ Anomaly detection
▪ Association mining
▪ Data preprocessing
▪ Example algorithms:
▪ K-means
▪ PCA
▪ SVD
▪ ICA
Semisupervised Learning
▪ Partially labeled data
▪ Unsupervised learning used
to cluster similar data
together
▪ Human input taken to label
the clusters
▪ e.g. Google Photos will
cluster similar faces, and ask
the user if they are the
same person
Reinforcement Learning
▪ The learning system (agent) can:
▪ Observe the environment
▪ Select and perform an action
▪ Get rewards/penalties as a result
▪ Learns what the best policy should be
▪ Policy defines what actions should be
chosen in a certain situation
▪ Very effective in controlled
environments (such as a game of chess)
▪ With the progress in deep learning,
increasingly used in more complex
tasks (such as driving the mars rover)
Batch Learning vs Online Learning
▪ Batch Learning
▪ Not capable of learning after
deployment
▪ Must be retrained from scratch –
computationally expensive!
▪ Online Learning
▪ Can continue to learn after
deployment
▪ Can take advantage of parallel
computing – no down time
▪ Preferred choice in production
Example ML Task: Does money make people happy?

• Life Satisfaction data from OECD

• GDP per capita data from IMF

What relationship
can we infer between
life satisfaction and
GDP per capita from
the graph?
Problems with Machine Learning
▪ 3 V’s of Big Data
▪ Volume, Variety, Velocity
▪ Problem #1: Training data!
▪ Insufficient quantity
▪ Nonrepresentative data
▪ Poor-quality data
▪ Problem #2: How “fit” is it?
▪ Overfitting data
▪ Underfitting data
▪ Problem #3: Which features should be used?
▪ Deep Learning automates feature selection
Overfitting
▪ Most common problem in ML – do not overgeneralize!
▪ The polynomial model is better than the linear model on training
▪ How about testing?
How to avoid overfitting
▪ Tip #1: REGULARIZATION – USE IT
▪ Constrain model to keep it simple – reduce risk of overfitting
▪ If you can stand on one leg, you’ll be able to stay balanced with two legs
▪ Hyperparameters – control level of regularization
▪ Tip #2: Get more training data, and reduce noise in it
Model Evaluation
▪ How good is your model?
▪ Test it on new data – data not seen by the model ever before!
▪ Keep 80% for training, set 20% for testing
▪ NEVER go below 10% test data – better model is better than better
“accuracy”
▪ How to regularize?
▪ Keep a portion of training data held out for validation
▪ Alternatively, use cross-validation (many validation sets instead of one)
▪ Pick the hyperparameters that work best on validation for your model
on the test dataset
Ratios
▪ A great model
▪ trained with 60% training data, 20% validation data, and 20% testing data
▪ An okay model
▪ trained with 70% training data, 15% validation data, and 15% testing data
▪ A barely-acceptable model
▪ trained with 80% training data, 10% validation data, and 10% testing data
▪ Models with worse ratios – hacks
▪ Unless there’s millions of instances in the dataset
▪ “No Free Lunch” theorem
▪ Only way to know for sure which model works best is to evaluate them
▪ Make reasonable assumptions about your data to select model

ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
Data Management and Data Transformation, Introduction To Machine Learning
No ratings yet
Data Management and Data Transformation, Introduction To Machine Learning
54 pages
Advanced Machine Learning Tutorial
No ratings yet
Advanced Machine Learning Tutorial
37 pages
ML Short U1-4
No ratings yet
ML Short U1-4
60 pages
ML - Lecture - 1 Introduction To ML
No ratings yet
ML - Lecture - 1 Introduction To ML
29 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
51 pages
Lec 1,2
No ratings yet
Lec 1,2
69 pages
ML Lec1
No ratings yet
ML Lec1
5 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
40 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
Lecture 1 Ai
No ratings yet
Lecture 1 Ai
38 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
53 pages
Introduction To ML
No ratings yet
Introduction To ML
4 pages
MATH 370: Intro to Machine Learning
No ratings yet
MATH 370: Intro to Machine Learning
60 pages
Machine Learning: Louis Fippo Fitime
No ratings yet
Machine Learning: Louis Fippo Fitime
37 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
Introduction to Machine Learning Course
No ratings yet
Introduction to Machine Learning Course
37 pages
2025 Slides7 ML Eng
No ratings yet
2025 Slides7 ML Eng
59 pages
ML 01
No ratings yet
ML 01
15 pages
Machine Learning for CS Students
No ratings yet
Machine Learning for CS Students
13 pages
Machine Learning
No ratings yet
Machine Learning
13 pages
Report Rahul
No ratings yet
Report Rahul
26 pages
Advanced ML Slides Intro
No ratings yet
Advanced ML Slides Intro
14 pages
Unit 1 ML
No ratings yet
Unit 1 ML
41 pages
ML Module I
No ratings yet
ML Module I
71 pages
Lec 7 - 8 - Machine Learning Introduction
No ratings yet
Lec 7 - 8 - Machine Learning Introduction
55 pages
ML Intro Beginner Detailed
No ratings yet
ML Intro Beginner Detailed
22 pages
Machine Learning for CS Students
No ratings yet
Machine Learning for CS Students
35 pages
ML Unit 1 Intro ML
No ratings yet
ML Unit 1 Intro ML
43 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
Applied ML Course Overview
No ratings yet
Applied ML Course Overview
66 pages
ML - Week 1
No ratings yet
ML - Week 1
37 pages
Fundamentals of ML 1
No ratings yet
Fundamentals of ML 1
38 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
132 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
Topic 1 - Introduction
No ratings yet
Topic 1 - Introduction
30 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
23 pages
Machine Learning One Shot
No ratings yet
Machine Learning One Shot
4 pages
Machine Learning in Unit-1
No ratings yet
Machine Learning in Unit-1
10 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
58 pages
Bengal College of Engineering and Technology
No ratings yet
Bengal College of Engineering and Technology
15 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
Basic Concepts of Machine Learning For Beginners
No ratings yet
Basic Concepts of Machine Learning For Beginners
102 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Unit 1
No ratings yet
Unit 1
62 pages
Lecture 1
No ratings yet
Lecture 1
34 pages
Firoz Topic 0
No ratings yet
Firoz Topic 0
24 pages
A.I. Lecture 4 NEW
No ratings yet
A.I. Lecture 4 NEW
31 pages
MLUnit - 1 Share
No ratings yet
MLUnit - 1 Share
162 pages
Machine Learning (R20a0518)
No ratings yet
Machine Learning (R20a0518)
87 pages
Unit 1
No ratings yet
Unit 1
92 pages
Lecture - 1 Introduction To ML
No ratings yet
Lecture - 1 Introduction To ML
38 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
124 pages
Unit 1
No ratings yet
Unit 1
38 pages
ML Insights for Researchers & Practitioners
No ratings yet
ML Insights for Researchers & Practitioners
17 pages
Module 1
No ratings yet
Module 1
175 pages
AI Module 1 Simple Notes
No ratings yet
AI Module 1 Simple Notes
14 pages
CSE 445 - Lecture 2 - Data Exploration - Regression
No ratings yet
CSE 445 - Lecture 2 - Data Exploration - Regression
31 pages
CSE 445 Logistic Regression
No ratings yet
CSE 445 Logistic Regression
11 pages
CSE445 Linear-Regression
No ratings yet
CSE445 Linear-Regression
40 pages
Week 10 Lecture 1 - CE, CC BJT-Operation, Characteristics, Fixed-Bias Network
No ratings yet
Week 10 Lecture 1 - CE, CC BJT-Operation, Characteristics, Fixed-Bias Network
16 pages
Week 10 Lecture 2 - Fixed-Bias, Emitter-Bias, Voltage-Divider Bias Network and Load-Line Analysis
No ratings yet
Week 10 Lecture 2 - Fixed-Bias, Emitter-Bias, Voltage-Divider Bias Network and Load-Line Analysis
16 pages
Interface
No ratings yet
Interface
7 pages
Type Classification
No ratings yet
Type Classification
4 pages
Datasheet - How USM Anywhere Delivers Optimal Threat Detection With Fewer Rules
No ratings yet
Datasheet - How USM Anywhere Delivers Optimal Threat Detection With Fewer Rules
2 pages
I Have The Comprehensive Solution Manual, Solutions Manual
No ratings yet
I Have The Comprehensive Solution Manual, Solutions Manual
1 page
Custodians and Midwives
No ratings yet
Custodians and Midwives
184 pages
JSP Quick Reference Card
No ratings yet
JSP Quick Reference Card
4 pages
Faculty of Higher Education: HS1011 Data Communication and Networks Trimester 2 2018
No ratings yet
Faculty of Higher Education: HS1011 Data Communication and Networks Trimester 2 2018
5 pages
F
No ratings yet
F
1 page
Life Vision Int Wordlist Ukrainian
100% (1)
Life Vision Int Wordlist Ukrainian
112 pages
M12l64164a (2y) PDF
No ratings yet
M12l64164a (2y) PDF
45 pages
K-1000C LED Controller Manual
No ratings yet
K-1000C LED Controller Manual
9 pages
Install MariaDB on CentOS/RHEL 8 Guide
No ratings yet
Install MariaDB on CentOS/RHEL 8 Guide
6 pages
CompDB App
No ratings yet
CompDB App
2 pages
End Term
No ratings yet
End Term
6 pages
Wi Fi Recharge Receipt
No ratings yet
Wi Fi Recharge Receipt
1 page
Military Applications of Internet of Things: Operational Concerns Explored in Context of A Prototype Wearable
No ratings yet
Military Applications of Internet of Things: Operational Concerns Explored in Context of A Prototype Wearable
12 pages
Wa0010.
No ratings yet
Wa0010.
27 pages
YourSinclair 93 Sep 1993
No ratings yet
YourSinclair 93 Sep 1993
68 pages
PLAN - CYBER SECURITY v2
No ratings yet
PLAN - CYBER SECURITY v2
3 pages
Haskell Exercises Solutions
No ratings yet
Haskell Exercises Solutions
6 pages
Boolean Logic Truth Table Worksheet
No ratings yet
Boolean Logic Truth Table Worksheet
2 pages
Wireless Sensor Network Protocols
No ratings yet
Wireless Sensor Network Protocols
35 pages
PowerFlex 4 Class Multi-Drive Control On EtherNetIP PDF
No ratings yet
PowerFlex 4 Class Multi-Drive Control On EtherNetIP PDF
8 pages
Java Mainsit
No ratings yet
Java Mainsit
18 pages
Fit-Girlrepacks Blogspot Com 2019 12 Drfone-Crack-Latest-Version HTML PDF
No ratings yet
Fit-Girlrepacks Blogspot Com 2019 12 Drfone-Crack-Latest-Version HTML PDF
6 pages
Research Article
No ratings yet
Research Article
9 pages
Richard Project
No ratings yet
Richard Project
14 pages
Donut Disturb
No ratings yet
Donut Disturb
5 pages
Manhunt Game Modding Log
No ratings yet
Manhunt Game Modding Log
2 pages
Dell EMC VDI Complete Solutions Brief
No ratings yet
Dell EMC VDI Complete Solutions Brief
3 pages