What is machine learning?
Machine learning (ML) is a branch of artificial intelligence (AI) focused on enabling
computers and machines to imitate the way that humans learn, to perform tasks
autonomously, and to improve their performance and accuracy through experience and
exposure to more data.
UC Berkeleybreaks out the learning system of a machine learning algorithm into three main
parts.
1. A Decision Process: In general, machine learning algorithms are used to make a
prediction or classification. Based on some input data, which can be labeled or
unlabeled, your algorithm will produce an estimate about a pattern in the data.
2. An Error Function: An error function evaluates the prediction of the model. If there
are known examples, an error function can make a comparison to assess the accuracy
of the model.
3. A Model Optimization Process: If the model can fit better to the data points in the
training set, then weights are adjusted to reduce the discrepancy between the known
example and the model estimate. The algorithm will repeat this iterative “evaluate and
optimize” process, updating weights autonomously until a threshold of accuracy has
been met.
Industry newsletter
The latest AI trends, brought to you by experts
Get curated insights on the most important—and intriguing—AI news. Subscribe to our
weekly Think newsletter. See the IBM Privacy Statement.
Business email
Subscribe
Machine learning versus deep learning versus neural networks
Since deep learning and machine learning tend to be used interchangeably, it’s worth noting
the nuances between the two. Machine learning, deep learning, and neural networks are all
sub-fields of artificial intelligence. However, neural networks is actually a sub-field of
machine learning, and deep learning is a sub-field of neural networks.
The way in which deep learning and machine learning differ is in how each algorithm learns.
"Deep" machine learning can use labeled datasets, also known as supervised learning, to
inform its algorithm, but it doesn’t necessarily require a labeled dataset. The deep learning
process can ingest unstructured data in its raw form (e.g., text or images), and it can
automatically determine the set of features which distinguish different categories of data from
one another. This eliminates some of the human intervention required and enables the use of
large amounts of data. You can think of deep learning as "scalable machine learning" as Lex
Fridman notes in this MIT lecture1.
Classical, or "non-deep," machine learning is more dependent on human intervention to learn.
Human experts determine the set of features to understand the differences between data
inputs, usually requiring more structured data to learn.
Neural networks, or artificial neural networks (ANNs), are comprised of node layers,
containing an input layer, one or more hidden layers, and an output layer. Each node, or
artificial neuron, connects to another and has an associated weight and threshold. If the output
of any individual node is above the specified threshold value, that node is activated, sending
data to the next layer of the network. Otherwise, no data is passed along to the next layer of
the network by that node. The “deep” in deep learning is just referring to the number of layers
in a neural network. A neural network that consists of more than three layers—which would
be inclusive of the input and the output—can be considered a deep learning algorithm or a
deep neural network. A neural network that only has three layers is just a basic neural
network.
Deep learning and neural networks are credited with accelerating progress in areas such
as computer vision, natural language processing (NLP), and speech recognition.
See the blog post “AI vs. Machine Learning vs. Deep Learning vs. Neural Networks: What’s
the Difference?” for a closer look at how the different concepts relate.