0% found this document useful (0 votes)

79 views38 pages

ML Lecture 2 Supervised Learning Setup

This document provides an overview of machine learning concepts including: - Traditional computer science tasks vs problems machine learning can handle better - The machine learning pipeline involving data, algorithms, and outputs - Key concepts like supervised vs unsupervised learning, classification vs regression, challenges around explainability and fairness It uses examples like medical diagnosis, text analysis, and image recognition to illustrate machine learning applications and how data is represented as feature vectors to train models.

Uploaded by

Faizad Ullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views38 pages

ML Lecture 2 Supervised Learning Setup

Uploaded by

Faizad Ullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 38

CSCS 460 – Machine

Learning
Faizad Ullah

1
Traditional Computer Science
 Tasks like:
 Play an audio/video file
 Display a text file on screen
 Perform a mathematical operation on two numbers
 Sort an array of numbers using Insertion Sort
 Search for a string in a text file
 …

Data
Output
Program

2
Problems that Traditional CS Can’t Handle

Tumor? Y/N Price? What was said? Summarize text

Data
Output
Program?

3
Machine Learning
Regression
Classification

4
Traditional CS
Data
Output

Program

Machine Learning
Data
Program
Output

5
What is Machine Learning?
 Formally:
 A computer program A is said to learn from experience E with respect to some class of tasks T and
performance measure P if its performance at tasks in T , as measured by P, improves with experience
E. (Tom Mitchell, 1997)

 Informally:
 Algorithms that improve on some task with experience.

To train a classifier, we need labelled data (called dataset)

6
Machine Learning Pipeline

7
Data – Big, Big,… data!
 How do we obtain these massive datasets to train our Machine Learning models?
 From real interactions e.g., call centers
 Expert annotators e.g., hired tams of annotators
 Crowd sourcing

Recaptcha Tagging

8
Task-Label Relationship
 Labels are dictated by the task to be performed.
 Example: Speech Technologies
What was said? Speech Recognition

Who said it? Speaker Recognition

Was it John Doe? Speaker Verification

Did it mention “hey Google”? Keyword Detection

What’s the language? Language Identification

Is the language native for the speaker?

What is their height?
What is the age of the speaker?
What is emotional state?
What was the sentiment?
Is the voice fake?
9
Task-Label Relationship
 Example: Text Technologies

Who wrote it?

Summary of what was written?
Was it plagiarized?
What was the intent?
What language is this?
Is the language native for the speaker?
What is author’s literacy level?
What is the topic of this document?
What is emotional state?
What was the sentiment?
Can we fake this writing style?
10
Challenges of ML - Explainability
 A classifier can potentially learn to classify on the basis of features not desirable for humans
 All dogs waring a collar in the training data while no cat is wearing it – ML just learns to separate based
on collar
 All horse images have a copyrights notice – ML just learns to recognize horses based on the copyrights
notice

 Explainable ML: The results should be understandable by humans

 As opposed to a black-box system

11
Challenges of ML – Fairness
 AI tends to reflect the biases of the society
 Human taggers who mark a recording as misinformation based on accent or gender
 Court decisions in country that make a rich person’s acquittal more likely
 Automated standardized testing in the US could yield unfavorable results for certain demographic
groups
 AI plays a decision role in hiring decisions, with up to 72% of resumes in the US never being viewed by a
human (Automation Bias)
 Decision on immigration, bank loans, credit history checks, criminal profiling

12
ML in Low-resource settings
 Problems where large datasets and tools are not available
 Natural Language Processing and Speech
 Pakistan has 71 languages
 We barely have speech recognition capabilities for Urdu!

13
Types of Learning
Supervised

The outcome is provided along with the data.

Unsupervised

The outcome is NOT provided along with the data.

14
Supervised Learning

15
What does a classifier see?
• Features

Day: Night:
1. 1.
2. 2.
3. 3.
4. 4.
5. 5.
What does a classifier see?
What does a classifier see?
Day vs. Night Classifier
Unsupervised Learning

20
Supervised Learning Setup

22
Feature Space: Tabular Data
Features/Dimensions Label/Class/Category

Height Weight B.P.Sys B.P.Dia Heart

(inches) (kgs) disease
62 70 120 80 No Record is 4-dimensional Feature Vector
72 90 110 70 No
74 80 130 70 No
65 120 150 90 Yes
Training Data/Training Split
67 100 140 85 Yes
64 110 130 90 No
69 150 170 100 Yes
66 125 145 90 ?
Testing Data/Testing Split
74 67 110 60 ?

As labels are discrete, this is a classification task.

23
Feature Space: Tabular Data
Features/Dimensions Label

Height Weight B.P.Sys B.P.Dia Choleste

(inches) (kgs) rol Level
62 70 120 80 150 A Record is 4-dimensional Feature Vector
72 90 110 70 165
74 80 130 70 135
65 120 150 90 210
Training Data/Training Split
67 100 140 85 195
64 110 130 90 125
69 150 170 100 250
66 125 145 90 ?
Testing Data/Testing Split
74 67 110 60 ?

As labels are continuous, this is a regression task.

24
Feature Space: Image Data
 Images are nothing but a 2D/3D arrays with values of color
intensities, typically ranging 𝟎 − 𝟐𝟓𝟓

But we said a
record
should be 1D!

25
Feature Space: Image Data
 The color Image is 3D array (𝑊𝑖𝑑𝑡ℎ × 𝐻𝑒𝑖𝑔ℎ𝑡 × 𝐶ℎ𝑎𝑛𝑛𝑒𝑙𝑠)
 Color image has 3 channels while grayscale image has 1 channel.

26
Feature Space: Text Data
 Suppose you are given labeled textual data in excel sheet
Document# Text Class
Training 1 The Best movie best Pos
2 The Best best ever Pos
3 The Best film Pos
4 The Worst cast ever Neg
Testing 5 The Best best best worst ever ?

the best movie ever film worst cast label

1 1 1 0 0 0 0 1
1 1 0 1 0 0 0 1
1 1 0 0 1 0 0 1
1 0 0 1 0 1 1 0
These are called “Binary Occurrences” features.
1 1 0 1 0 1 0 ?
27
Rules vs. Learning
 Suppose we are working on classification of emails into “spam” and “ham”
(not spam)
 We can write a complicated set of rules
 Works well for a while
 Cannot adapt well to new emails
 Program could be reverse-engineered and circumvented

 Learn the mapping between an email and its label using past labelled
data
 Can be retrained on new emails
 Not easy to reverse-engineer and circumvent in all cases
 Easier to plug the leaks
References
 Murphy Chapter 1
 Alpaydin Chapter 1
 TM Chapter 1

 Lectures of Andrew Ng., Dr. Ali Raza, and “Machine Learning for Intelligent Systems
(CS4780/CS5780)”, Kilian Weinberger.

 This disclaimer should serve as adequate citation.

29
Formalizing the Setup
𝑫 = { 𝒙𝟏, 𝒚𝟏 , 𝒙𝟐, 𝒚𝟐 , … , 𝒙𝒏, 𝒚𝒏
⊆𝑿×𝒀

Feature vector
𝑫 = { 𝒙𝟏, 𝒚𝟏 , ⊆𝑿×𝒀
𝒙𝟐, 𝒚𝟐 , … ,
Any categorical attribute can be
𝒙𝒏, 𝒚𝒏 converted to numerical representation.
 Where,
𝑥
𝐷𝑖 is
𝑜𝑟the
𝑥 𝑖 dataset
is the input vector of the 𝑖𝑡ℎ sample/record/instance
𝑋 is the label
𝑌 space
d-dimensional feature space (ℝ𝑑)
If we don’t know the distribution,
The data points are drawn from an unknown distribution 𝑃 lets approximate that using
samples we gathered!
𝒙𝒊, 𝒚𝒊 ~𝑷(𝒙, 𝒚)

We want to learn a function ℎ ∈ 𝐻, such that for a new instance (𝒙𝟏, 𝒚)~𝑃
𝒉(𝒙) = 𝒚 with a high probability or at least 𝒉(𝒙) ≈ 𝒚
This also have to be from the In plain words, don’t train on
same distribution as 𝒙𝒊 dogs and ask prediction for cats.
31
Training and Testing: Formally

Testing Data
Training Data Traditional CS
Machine 𝒙~𝑷
Learning
𝒉(𝒙)
𝒙𝟏, 𝒙𝟐,…, 𝒙𝒏

𝒚 𝟏, 𝒚 𝟐, … , 𝒚 𝒏 𝒉

Label/Ground Truth Prediction

Model

Training Testing
𝒉 𝒙 = 𝒚 (Ideal)
𝒉 𝒙 ≈ 𝒚 (Plausible)

32
Label Space
 Binary (Binary classification)
 Sentiment: positive / negative
 Email: spam / ham
 Online Transactions Fraud: Yes
/ No
 Tumor: Malignant / Benign
 𝑦 ∈ 0,1
 𝑦 ∈ {−1, 1}

 Multi-class (multi-class classification)

 Sentiment: Positive / Negative / Neutral
 Emotion: Happy / Sad / Surprised /
Angry / …
 Parts of Speech Tag: Noun / Verb /
Adjective / Adver / …
 𝑦 ∈ {0,1,2, … }

 Real-valued (Regression)
 Temperature, height, age, length,
33
weight, duration, price, …
Hypothesis Space
 The hypothesis ℎ is sampled from a hypothesis space 𝐻
𝒉∈𝑯 𝑯 ∈ {𝑯𝑫, 𝑯𝑹, 𝑯𝑺𝑽𝑴, 𝑯𝑫𝑳, … }

 𝐻 can be thought of to contain types of hypotheses, which share

sets of assumptions like:
 Support Vector Machines 𝑯𝑺𝑽𝑴 ∈ {𝑯𝟏, 𝑯𝟐, … }

 Decision Tree 𝑯𝑫 ∈ {𝑯𝟏, 𝑯𝟐, … } 𝒉 ∈ 𝑯𝑫

 Perception 𝑯𝑷 ∈ {𝑯𝟏, 𝑯𝟐, … }

 Neural Networks 𝑯𝑵𝑵 ∈ {𝑯𝟏, 𝑯𝟐, … }

…
Selection done
Selection done automatically.
 For example: ℎ ∈ 𝐻 for 𝐻 decision trees: manually.
 Would be instance of decision trees of different height, arity, thresholds etc.

34
So, how do we choose our ℎ?
 Randomly?
 Exhaustively?

How do we evaluate 𝒉?

35
How to choose ℎ?
 Randomly
 May not work well
 Like using a random program to solve your sorting problem!
 May work if 𝐻 is constrained enough

 Exhaustively
 Would be very slow!
 The space 𝐻 is usually very large (if not infinite)

 𝐻 is usually chosen by ML Engineers (You!) based on their experience

 ℎ ∈ 𝐻 is estimated efficiently using various optimization techniques (math
alert!)

Before moving to finding 𝒉, let’s first evaluate the labels.

36
Book Reading
 Murphy – Chapter 1

37
References
 Murphy Chapter 1
 Alpaydin Chapter 1
 TM Chapter 1

 Lectures of Andrew Ng., Dr. Ali Raza, and “Machine Learning for Intelligent Systems
(CS4780/CS5780)”, Kilian Weinberger.

 This disclaimer should serve as adequate citation.

Motivation 24111
No ratings yet
Motivation 24111
23 pages
Types of ML & Supervised Learning
No ratings yet
Types of ML & Supervised Learning
17 pages
2-Introduction of Machine Learning
No ratings yet
2-Introduction of Machine Learning
39 pages
Reinforcement Learning: Amulya Viswambaran (202090007) Kehkashan Fatima (202090202) Sruthi Krishnan (202090333)
No ratings yet
Reinforcement Learning: Amulya Viswambaran (202090007) Kehkashan Fatima (202090202) Sruthi Krishnan (202090333)
40 pages
Dynamic Modeling: Grady Booch, James Rumbaugh, and Ivar Jacobson, Edition, Addison Wesley, 2005
No ratings yet
Dynamic Modeling: Grady Booch, James Rumbaugh, and Ivar Jacobson, Edition, Addison Wesley, 2005
49 pages
Data File Handling (Autosaved)
No ratings yet
Data File Handling (Autosaved)
48 pages
Discrete Structures For Computer Science (DSCS) : BITS Pilani
No ratings yet
Discrete Structures For Computer Science (DSCS) : BITS Pilani
41 pages
Bubble Sort Insertion Sort Selection Sort
No ratings yet
Bubble Sort Insertion Sort Selection Sort
21 pages
Frames Tables Forms HTML
No ratings yet
Frames Tables Forms HTML
27 pages
IT 101 Introduction To Computing EDITED
No ratings yet
IT 101 Introduction To Computing EDITED
8 pages
MTK3013-Chapter1.2 Propositional Equivalences Updated
No ratings yet
MTK3013-Chapter1.2 Propositional Equivalences Updated
38 pages
It0047 Sa1 Dos Debug - Mamaril
No ratings yet
It0047 Sa1 Dos Debug - Mamaril
6 pages
PHY150 Electricity and Magnetism: Induction
No ratings yet
PHY150 Electricity and Magnetism: Induction
89 pages
Graph Theory Basic Concepts
No ratings yet
Graph Theory Basic Concepts
56 pages
Use Case Diagrams
No ratings yet
Use Case Diagrams
15 pages
Lab Guidelines for Students
100% (1)
Lab Guidelines for Students
68 pages
ZACHMAN FRAMEWORK - (Autosaved)
No ratings yet
ZACHMAN FRAMEWORK - (Autosaved)
25 pages
Animation Character Design Guide
No ratings yet
Animation Character Design Guide
15 pages
Basic Marketing Syllabus
No ratings yet
Basic Marketing Syllabus
9 pages
Module 3 - NFA To DFA
No ratings yet
Module 3 - NFA To DFA
20 pages
(M. Awesome) Pointers in C
No ratings yet
(M. Awesome) Pointers in C
40 pages
State Transition Diagrams Guide
No ratings yet
State Transition Diagrams Guide
7 pages
Assembly Language Debug Commands
No ratings yet
Assembly Language Debug Commands
5 pages
22001001N9221 - 9 - 7009 - 24089HTML Lab Prcatical
No ratings yet
22001001N9221 - 9 - 7009 - 24089HTML Lab Prcatical
13 pages
A. Simplified Segment Directives: (.Exe Program Format)
No ratings yet
A. Simplified Segment Directives: (.Exe Program Format)
10 pages
Beginner's Guide to Arduino Basics
No ratings yet
Beginner's Guide to Arduino Basics
49 pages
Wix Portfolio Rubric
No ratings yet
Wix Portfolio Rubric
1 page
Lecture#1 Web Technologies Basics
No ratings yet
Lecture#1 Web Technologies Basics
31 pages
1-1introduction To Computer Systems - Overview of Organization and Architecture - Functional Components-02!08!202
No ratings yet
1-1introduction To Computer Systems - Overview of Organization and Architecture - Functional Components-02!08!202
22 pages
Middleware Technologies PPT 42
No ratings yet
Middleware Technologies PPT 42
16 pages
Automata Theory PPT Seminar Final
No ratings yet
Automata Theory PPT Seminar Final
16 pages
Object-Oriented Design & UML Basics
No ratings yet
Object-Oriented Design & UML Basics
89 pages
Topic 1 - Introduction To Living in The IT Era
No ratings yet
Topic 1 - Introduction To Living in The IT Era
33 pages
1 Coulomb S Law
No ratings yet
1 Coulomb S Law
1 page
Public Relations in
No ratings yet
Public Relations in
12 pages
Theory of Automata Chapter 2
100% (1)
Theory of Automata Chapter 2
7 pages
5@software Life Cycle Models
No ratings yet
5@software Life Cycle Models
30 pages
Combinatorics and Probability
No ratings yet
Combinatorics and Probability
15 pages
An Introduction To Computer Architecture: © 2019 Arm Limited
No ratings yet
An Introduction To Computer Architecture: © 2019 Arm Limited
46 pages
Hypothesis Testing: T-Test & ANOVA Guide
No ratings yet
Hypothesis Testing: T-Test & ANOVA Guide
20 pages
ML-Lec-06-Supervised Learning-Decision Trees
No ratings yet
ML-Lec-06-Supervised Learning-Decision Trees
45 pages
Theory of Automata - Lecture 1
0% (1)
Theory of Automata - Lecture 1
58 pages
Chapter 4: SQL: ©silberschatz, Korth and Sudarshan 4.1 Database System Concepts
No ratings yet
Chapter 4: SQL: ©silberschatz, Korth and Sudarshan 4.1 Database System Concepts
98 pages
Chapter 4 - Architectural Design and User Interface Design (PART 1)
No ratings yet
Chapter 4 - Architectural Design and User Interface Design (PART 1)
40 pages
To Object-Oriented Modeling Techniques
No ratings yet
To Object-Oriented Modeling Techniques
33 pages
Exercises
75% (4)
Exercises
13 pages
OOMD Module2 Dynamic Modelling
No ratings yet
OOMD Module2 Dynamic Modelling
58 pages
Software Architecture Essentials
No ratings yet
Software Architecture Essentials
45 pages
Pointers
No ratings yet
Pointers
29 pages
Information Technology (IT) Audit
No ratings yet
Information Technology (IT) Audit
31 pages
Work Breakdown Structure
No ratings yet
Work Breakdown Structure
27 pages
HydrophonicProject Finial Report
No ratings yet
HydrophonicProject Finial Report
45 pages
Data Mining All Summary
No ratings yet
Data Mining All Summary
47 pages
Unit 1.4 Wired and Wireless Networks - Lesson 1: Activity 1 - Local Area Network (LAN) Vs Wide Area Network (WAN)
No ratings yet
Unit 1.4 Wired and Wireless Networks - Lesson 1: Activity 1 - Local Area Network (LAN) Vs Wide Area Network (WAN)
4 pages
Cloud Computing 1
No ratings yet
Cloud Computing 1
57 pages
K Map
No ratings yet
K Map
49 pages
Randomized Algorithm
No ratings yet
Randomized Algorithm
9 pages
Machine Learning Infographics by Slidesgo
No ratings yet
Machine Learning Infographics by Slidesgo
35 pages
ML Lecture 1 Introduction and Policies
No ratings yet
ML Lecture 1 Introduction and Policies
45 pages
001 ML Introduction W1L2
No ratings yet
001 ML Introduction W1L2
64 pages
NLP Week 01
No ratings yet
NLP Week 01
57 pages
NLP Week 02
No ratings yet
NLP Week 02
55 pages
NLP Week 03
No ratings yet
NLP Week 03
33 pages
NLP Week 02
No ratings yet
NLP Week 02
54 pages
NLP Week 01
No ratings yet
NLP Week 01
57 pages
DM Lecture 1 Introudction and Policies
No ratings yet
DM Lecture 1 Introudction and Policies
17 pages
Exam Long Questions
No ratings yet
Exam Long Questions
8 pages
All Result Last
No ratings yet
All Result Last
276 pages
General Architecture of Text Mining Systems
No ratings yet
General Architecture of Text Mining Systems
6 pages
Advanced Function Analysis
No ratings yet
Advanced Function Analysis
10 pages
Terex Bendin 36b Calibration
100% (2)
Terex Bendin 36b Calibration
28 pages
Vooma Paybill Application Form
No ratings yet
Vooma Paybill Application Form
2 pages
NeurIPS 2024 Agentpoison Red Teaming LLM Agents Via Poisoning Memory or Knowledge Bases Paper Conference
No ratings yet
NeurIPS 2024 Agentpoison Red Teaming LLM Agents Via Poisoning Memory or Knowledge Bases Paper Conference
29 pages
Software Engineering Unit-1
No ratings yet
Software Engineering Unit-1
30 pages
Class 7 Science Chapter 15 LIGHT
No ratings yet
Class 7 Science Chapter 15 LIGHT
6 pages
CS513 MJP CloudComputing Slips
No ratings yet
CS513 MJP CloudComputing Slips
27 pages
Prayer and Fasting Anniv 2025
No ratings yet
Prayer and Fasting Anniv 2025
2 pages
DBMS Basics for Class 10 Students
100% (1)
DBMS Basics for Class 10 Students
7 pages
Las Normas Del Insti
No ratings yet
Las Normas Del Insti
9 pages
Board Game Questions and Answers
No ratings yet
Board Game Questions and Answers
1 page
KumarAnupam SDE FullStack 0 2 EXP
No ratings yet
KumarAnupam SDE FullStack 0 2 EXP
1 page
Abinitio Intvw Questions
100% (1)
Abinitio Intvw Questions
20 pages
ServiceNow ITSM TPSM Training Schedule
No ratings yet
ServiceNow ITSM TPSM Training Schedule
9 pages
Are Comprehension Questions Good Reading Exercises
100% (1)
Are Comprehension Questions Good Reading Exercises
16 pages
MYTHOLOGY and FOLKLORE
No ratings yet
MYTHOLOGY and FOLKLORE
59 pages
PI 100 Lecture Notes
No ratings yet
PI 100 Lecture Notes
4 pages
Grammar: Standard: Prepare Second Edition Level 4
No ratings yet
Grammar: Standard: Prepare Second Edition Level 4
2 pages
Bad Debt Write-Off Guide
No ratings yet
Bad Debt Write-Off Guide
8 pages
Methods of Bible Study
No ratings yet
Methods of Bible Study
9 pages
Signs of Social Media Addiction
No ratings yet
Signs of Social Media Addiction
1 page
Internux Recharge Using Dealer Inventory Stock API - Technical Specification V 1.0.0
No ratings yet
Internux Recharge Using Dealer Inventory Stock API - Technical Specification V 1.0.0
14 pages
Book of Kings
No ratings yet
Book of Kings
6 pages
Roles and Responsibilities of The MD and RSC Mission
No ratings yet
Roles and Responsibilities of The MD and RSC Mission
10 pages
PDF
No ratings yet
PDF
25 pages
India's Contribution To Linguistics
No ratings yet
India's Contribution To Linguistics
24 pages
Microprocessor Exam Guide
No ratings yet
Microprocessor Exam Guide
48 pages

ML Lecture 2 Supervised Learning Setup

Uploaded by

ML Lecture 2 Supervised Learning Setup

Uploaded by

CSCS 460 – Machine

Tumor? Y/N Price? What was said? Summarize text

To train a classifier, we need labelled data (called dataset)

Who said it? Speaker Recognition

Was it John Doe? Speaker Verification

Did it mention “hey Google”? Keyword Detection

Is the language native for the speaker?

Who wrote it?

 Explainable ML: The results should be understandable by humans

The outcome is provided along with the data.

The outcome is NOT provided along with the data.

Height Weight B.P.Sys B.P.Dia Heart

As labels are discrete, this is a classification task.

Height Weight B.P.Sys B.P.Dia Choleste

As labels are continuous, this is a regression task.

the best movie ever film worst cast label

 This disclaimer should serve as adequate citation.

Label/Ground Truth Prediction

 Multi-class (multi-class classification)

 𝐻 can be thought of to contain types of hypotheses, which share

 Decision Tree 𝑯𝑫 ∈ {𝑯𝟏, 𝑯𝟐, … } 𝒉 ∈ 𝑯𝑫

 Neural Networks 𝑯𝑵𝑵 ∈ {𝑯𝟏, 𝑯𝟐, … }

 𝐻 is usually chosen by ML Engineers (You!) based on their experience

Before moving to finding 𝒉, let’s first evaluate the labels.

 This disclaimer should serve as adequate citation.

You might also like