0% found this document useful (0 votes)

17 views22 pages

01 Intro

The document outlines the course ECE 6254 on Statistical Machine Learning, focusing on learning effective models from data for practical inference and signal processing problems. It covers various learning approaches, including supervised and unsupervised learning, and emphasizes the importance of probabilistic models. Prerequisites include knowledge of probability, linear algebra, and multivariable calculus, with no formal textbook required, and grading based on tests, homework, and projects.

Uploaded by

Mark Davenport

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views22 pages

01 Intro

Uploaded by

Mark Davenport

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

ECE 6254

Statistical Machine Learning

Spring 2024

Mark A. Davenport
Electrical & Computer
Disclaimer!
None of what I just said was written by me…

Me at 11pm last night:

– Lesson learned: If you want comedy (and nonsense), use Bard

Caveats
– I disavow the statement “this course is not all about math and equations”
– Natural language processing and self-driving cars are not going to be a central focus
Statistical machine learning
• How can we
– learn effective models from data?
– apply these models to practical inference and signal processing problems?

• Example problems: classification, regression, prediction, data modeling,

clustering, and data exploration/visualization

• Our approach: statistical inference

• Main subject of this course

– how to reason about and work with probabilistic models to help us make inferences
from data
What is machine learning?
learn: gain or acquire knowledge of or skill in (something) by
study, experience, or being taught

How do we learn that this is a tree?

My daddy told me that a tree is
a perennial plant with an
elongated stem, or trunk,
supporting leaves or branches.

This has a trunk and branches.

Therefore it is a tree.
What is machine learning?
learn: gain or acquire knowledge of or skill in (something) by
study, experience, or being taught

How do we learn that this is a tree?

EXAMPLES!

A good definition of learning for this course:

“using a set of examples to infer
something about an underlying process”
Why learn from data?
Traditional signal processing is “top down”

Given a model for our data, derive the optimal algorithm

A learning approach is more “bottom up”

Given some examples, derive a good algorithm

Sometimes a good model is really hard to derive from first principles

Examples of learning
The Netflix prize (2007)

Predict how a user will rate a movie

10% improvement = $1 million prize

• Some pattern exists

– users do not assign ratings completely at random – if you like Godfather I, you’ll
probably like Godfather II

• It is hard to pin down the pattern mathematically

• We have lots and lots of data

– we know how a user has rated other movies, and we know how other users have
rated this (and other) movies
Examples of learning
• Recommendation systems
• Speech recognition
• Image classification
• Object detection
• Language modeling
• Spam filtering
• Machine translation
• Time series forecasting (traffic, weather, markets, etc.)
• Search
• Fraud detection
• Medical diagnosis
• …
Supervised learning
We are given input data

Each represents a measurement or observation of some natural or man-made

phenomenon
– may be called input, pattern, signal, feature vector, instance, or independent
variable
– the coordinates may be called features, attributes, predictors, or covariates

In the supervised case, we are also given output data

– may be called output, label, response, or dependent variable

The data are called the training data

Supervised learning
We can think of a pair as obeying a (possibly noisy) input-output
relationship

The goal of supervised learning is usually to generalize the input-output

relationship so that we can predict the output associated with a previously
unseen input

The primary supervised learning

problems are
– classification:

– regression:
Unsupervised learning
The inputs are not accompanied by labels

The goal of unsupervised learning is typically not related to future observations

Instead we want to understand that structure in the data sample itself, or to infer
some characteristic of the underlying probability distribution

Examples of unsupervised learning problems include

– clustering
– density estimation
– dimensionality reduction/feature selection
– visualization
– generative modeling
Other variants of learning
• semi-supervised learning
• self-supervised learning
• active learning
• online learning
• reinforcement learning
• anomaly detection
• transfer learning
• multi-task learning
• …

In general, most learning problems can be thought of as variants of traditional

signal processing problems, but where we have no idea (a priori) how to model
our signals
Prerequisites
• Probability
– random variables, expectation, joint distributions, independence, conditional
distributions, Bayes rule, multivariate normal distribution, …

• Linear algebra
– norms, inner products, orthogonality, linear independence, eigenvalues/vectors,
eigenvalue decompositions, …

• Multivariable calculus
– partial derivatives, gradients, the chain rule, …

• Python or similar programming experience (C or MATLAB)

Text
There is no formally required textbook for this course, but I will draw material
heavily from these sources:

A list of other useful books and links to relevant papers will be posted on the
course webpage

Lecture notes and slides will also be posted on the course webpage
Grading
• Pre-test (5%)

• Homework (25%)

• Data challenges (10%)

• Midterm exam (20%)

• Final exam (20%)

• Final project (20%)

Distance learning
Welcome to our online students!

Recorded lectures will be available to all students

(including on-campus students)

I need your help to make this a success

Online resources:
• Course website
• Canvas
• Piazza
A brief interlude

Gradus
Descendo!
Could you learn this trick?
Suppose that
• denotes the color of the card
– 0 = black
– 1 = red
• denotes which card is hidden
– E.g., Ace of Spades, Queen of Hearts, …

You observe me doing this trick many times and form a dataset:

Can you learn a function such that is a reliable predictor of ?

Another approach
You watch me do this trick a couple times and notice I always hand out 5 cards

Suppose you instead consider

Now, can you learn a function such that is a reliable predictor of ?

Is learning even possible?
or: How I learned to stop worrying and love statistics

Supervised learning
Given training data , we would like to learn an (unknown)
function such that for other than

but…

as we have just seen, this is impossible. Without any additional assumptions, we

conclude nothing about except (maybe) for its value on
Probability to the rescue!
Any agreeing with the training data may be possible
but that does not mean that any is equally probable

A short digression
• Suppose that Javier has a biased coin, which lands on heads with some unknown
probability
–
–
• Javier toss the coin times
–

Does tell us anything about ?

What can we learn from ?
Given enough tosses (large ), we expect that

Law of large numbers

Clearly, at least in a very limited sense, we can learn something about from
observations

There is always the possibility that we are totally wrong, but given enough data,
the probability should be very small

Intro DL 01
No ratings yet
Intro DL 01
64 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
ML Merge
No ratings yet
ML Merge
145 pages
ML - 01 Supervised Learning
No ratings yet
ML - 01 Supervised Learning
26 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
ML 01
No ratings yet
ML 01
24 pages
Data Science & ML Course Guide
No ratings yet
Data Science & ML Course Guide
83 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
Chapter 1 Introduction To Machine Learning
No ratings yet
Chapter 1 Introduction To Machine Learning
29 pages
Aiml Co - 3,4 Notes
No ratings yet
Aiml Co - 3,4 Notes
98 pages
Unit 1
No ratings yet
Unit 1
92 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Unit 4
No ratings yet
Unit 4
34 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
606 pages
Lecture 1. Learning From Data
No ratings yet
Lecture 1. Learning From Data
30 pages
Introduction to Data Science Concepts
No ratings yet
Introduction to Data Science Concepts
56 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
Module 1
No ratings yet
Module 1
50 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Cs 419 Notes
No ratings yet
Cs 419 Notes
36 pages
Lecture 17
No ratings yet
Lecture 17
33 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Machine Learning Basics for Beginners
100% (5)
Machine Learning Basics for Beginners
134 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
45 pages
Day 2 Part 1
No ratings yet
Day 2 Part 1
52 pages
Cs 171 18 IntroLearning Old
No ratings yet
Cs 171 18 IntroLearning Old
47 pages
2024 Machine Learning Intro
No ratings yet
2024 Machine Learning Intro
50 pages
Unsupervised Learning Overview
No ratings yet
Unsupervised Learning Overview
32 pages
BE02000041 Funda of AI Unit 3 Basics of ML
No ratings yet
BE02000041 Funda of AI Unit 3 Basics of ML
86 pages
AAI Lecture 9 SP 25
No ratings yet
AAI Lecture 9 SP 25
26 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
DS-05 Introduction To Machine Learning
No ratings yet
DS-05 Introduction To Machine Learning
103 pages
DSA5102X Lecture1
No ratings yet
DSA5102X Lecture1
51 pages
Introduction to Machine Learning Concepts
No ratings yet
Introduction to Machine Learning Concepts
7 pages
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
No ratings yet
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
101 pages
Introduction To ML Unit-1
No ratings yet
Introduction To ML Unit-1
90 pages
2-Inductive Learning
No ratings yet
2-Inductive Learning
37 pages
Intro - Types of Machine Learning
No ratings yet
Intro - Types of Machine Learning
24 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
Machine Learning - Week 1
No ratings yet
Machine Learning - Week 1
1 page
Introduction To ML
100% (1)
Introduction To ML
39 pages
ML For Engineers
No ratings yet
ML For Engineers
48 pages
Unit 01
No ratings yet
Unit 01
32 pages
Lecture 2 - Supervised Learning
No ratings yet
Lecture 2 - Supervised Learning
6 pages
Machine Learning - 1
No ratings yet
Machine Learning - 1
52 pages
465-Lecture 1 (Deep Learning)
No ratings yet
465-Lecture 1 (Deep Learning)
47 pages
Machine Learning - ch1
No ratings yet
Machine Learning - ch1
46 pages
Machine Learning for CS Juniors
No ratings yet
Machine Learning for CS Juniors
59 pages
Notes
No ratings yet
Notes
125 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Unit 1-1
No ratings yet
Unit 1-1
75 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
88 pages
Chapter Introduction
No ratings yet
Chapter Introduction
7 pages
A Preliminary Idea On Machine Learning
No ratings yet
A Preliminary Idea On Machine Learning
40 pages
Tutorial Session 10 Autocorrelation Solution
No ratings yet
Tutorial Session 10 Autocorrelation Solution
4 pages
Linear Regression & Algorithms
No ratings yet
Linear Regression & Algorithms
24 pages
BusinessStatistics Assign 1
100% (1)
BusinessStatistics Assign 1
17 pages
Error Control Coding
No ratings yet
Error Control Coding
8 pages
Ai Chapter 3
No ratings yet
Ai Chapter 3
8 pages
Anova
No ratings yet
Anova
2 pages
ECON3002 2017 Final Merged Answer
No ratings yet
ECON3002 2017 Final Merged Answer
13 pages
Anderson-Darling Test Guide
No ratings yet
Anderson-Darling Test Guide
37 pages
Chapter#7
No ratings yet
Chapter#7
11 pages
Probability for Math Students
No ratings yet
Probability for Math Students
17 pages
Probability Events & Calculations
No ratings yet
Probability Events & Calculations
76 pages
Statistical Analysis: Dr. Shahid Iqbal Fall 2021
No ratings yet
Statistical Analysis: Dr. Shahid Iqbal Fall 2021
65 pages
Cosm Unit - IV
No ratings yet
Cosm Unit - IV
18 pages
Hypothesis Testing Essentials
No ratings yet
Hypothesis Testing Essentials
16 pages
Hasil Output Regresi Sederhana Dan Berganda
No ratings yet
Hasil Output Regresi Sederhana Dan Berganda
37 pages
Traffic Analysis & Arrival Distributions
No ratings yet
Traffic Analysis & Arrival Distributions
20 pages
Exercises2 2022 2023v1
No ratings yet
Exercises2 2022 2023v1
2 pages
Logistic Regression Explained
No ratings yet
Logistic Regression Explained
10 pages
Computerised Handwriting Speed Test System CHSTS Validation of A Handwriting Assessment For Chinese Secondary Students
No ratings yet
Computerised Handwriting Speed Test System CHSTS Validation of A Handwriting Assessment For Chinese Secondary Students
9 pages
Levendis, J. D. (2018) - Time Series Econometrics
100% (1)
Levendis, J. D. (2018) - Time Series Econometrics
417 pages
Zhou Thesis PDF
No ratings yet
Zhou Thesis PDF
188 pages
Comparing Quantities Using Analytical Tools
No ratings yet
Comparing Quantities Using Analytical Tools
10 pages
ML Lec 15 Naive Bayes
No ratings yet
ML Lec 15 Naive Bayes
16 pages
Understanding Kurtosis Measures
No ratings yet
Understanding Kurtosis Measures
4 pages
ANOVA Analysis: Awareness Levels by Demographics
No ratings yet
ANOVA Analysis: Awareness Levels by Demographics
22 pages
Bayesian Modeling For Infectious Diseases Using PyMC3
No ratings yet
Bayesian Modeling For Infectious Diseases Using PyMC3
31 pages
ML Mod 4 Part 1
No ratings yet
ML Mod 4 Part 1
99 pages
Self-Quiz Unit 4 - Attempt Review - Home
No ratings yet
Self-Quiz Unit 4 - Attempt Review - Home
4 pages
One-Way ANOVA: What Is This Test For?
No ratings yet
One-Way ANOVA: What Is This Test For?
21 pages
Statistics For Business and Economics: Continuous Random Variables and Probability Distributions
No ratings yet
Statistics For Business and Economics: Continuous Random Variables and Probability Distributions
68 pages

01 Intro

Uploaded by

01 Intro

Uploaded by

ECE 6254

Statistical Machine Learning

Me at 11pm last night:

– Lesson learned: If you want comedy (and nonsense), use Bard

• Example problems: classification, regression, prediction, data modeling,

• Our approach: statistical inference

• Main subject of this course

How do we learn that this is a tree?

This has a trunk and branches.

How do we learn that this is a tree?

A good definition of learning for this course:

Given a model for our data, derive the optimal algorithm

A learning approach is more “bottom up”

Given some examples, derive a good algorithm

Sometimes a good model is really hard to derive from first principles

Predict how a user will rate a movie

• Some pattern exists

• It is hard to pin down the pattern mathematically

• We have lots and lots of data

Each represents a measurement or observation of some natural or man-made

In the supervised case, we are also given output data

The data are called the training data

The goal of supervised learning is usually to generalize the input-output

The primary supervised learning

The goal of unsupervised learning is typically not related to future observations

Examples of unsupervised learning problems include

In general, most learning problems can be thought of as variants of traditional

• Python or similar programming experience (C or MATLAB)

• Data challenges (10%)

• Midterm exam (20%)

• Final exam (20%)

• Final project (20%)

Recorded lectures will be available to all students

I need your help to make this a success

Can you learn a function such that is a reliable predictor of ?

Suppose you instead consider

Now, can you learn a function such that is a reliable predictor of ?

as we have just seen, this is impossible. Without any additional assumptions, we

Does tell us anything about ?

Law of large numbers

You might also like