Académique Documents
Professionnel Documents
Culture Documents
MACHINE LEARNING
BHAVYA SETHI
16csu082
1
CONTENTS
2
What is Machine
Learning? A definition
Machine learning is an application of artificial
intelligence (AI) that provides systems the ability to
automatically learn and improve from experience
without being explicitly programmed. Machine
learning focuses on the development of computer
programs that can access data and use it learn for
themselves.
The process of learning begins with observations or
data, such as examples, direct experience, or
instruction, in order to look for patterns in data and
make better decisions in the future based on the
examples that we provide. The primary aim is to
allow the computers learn automatically without
human intervention or assistance and adjust actions
accordingly.
3
TYPES OF LEARNING
1. Supervised learning as the name indicates the
presence of a supervisor as a teacher. Basically
supervised learning is a learning in which we teach or
train the machine using data which is well labeled that
means some data is already tagged with the correct
answer. After that, the machine is provided with a new
set of examples(data) so that supervised learning
algorithm analyses the training data(set of training
examples) and produces a correct outcome from
labeled data.
For instance, suppose you are given a basket filled with
different kinds of fruits. Now the first step is to train the
machine with all different fruits one by one like this:
4
If shape of object is long curving cylinder having
color Green-Yellow then it will be labelled as –
Banana.
Now suppose after training the data, you have given a new
separate fruit say Banana from basket and asked to identify it.
Since the machine has already learned the things from previous
data and this time have to use it wisely. It will first classify the
fruit with its shape and color and would confirm the fruit name
as BANANA and put it in Banana category. Thus the machine
learns the things from training data(basket containing fruits)
and then apply the knowledge to test data(new fruit).
5
2. Unsupervised learning is the training of machine using
information that is neither classified nor labeled and
allowing the algorithm to act on that information without
guidance. Here the task of machine is to group unsorted
information according to similarities, patterns and
differences without any prior training of data.
Thus the machine has no idea about the features of dogs and
cat so we can’t categorize it in dogs and cats. But it can
categorize them according to their similarities, patterns, and
differences i.e., we can easily categorize the above picture into
two parts. First first may contain all pics having dogs in it and
second part may contain all pics having cats in it. Here you
didn’t learn anything before, means no training data or
examples.
Unsupervised learning classified into two categories of
algorithms:
6
Clustering: A clustering problem is where you want to
discover the inherent groupings in the data, such as
grouping customers by purchasing behavior.
Association: An association rule learning problem is
where you want to discover rules that describe large
portions of your data, such as people that buy X also
tend to buy Y.
7
4. Reinforcement machine learning
algorithms is a learning method that
interacts with its environment by
producing actions and discovers
errors or rewards. Trial and error
search and delayed reward are the
most relevant characteristics of
reinforcement learning. This method
allows machines and software agents
to automatically determine the ideal
behavior within a specific context in
order to maximize its performance.
Simple reward feedback is required
for the agent to learn which action is
best; this is known as the
reinforcement signal.
8
TYPES OF ALGORITHMS
1. REGRESSION
9
2. CLUSTERING
In basic terms, the objective of clustering is to
find different groups within the elements in the
data. To do so, clustering algorithms find the
structure in the data so that elements of the
same cluster (or group) are more similar to each
other than to those from different clusters.
K-Means Clustering
K-Means algorithms are extremely easy to
implement and very efficient computationally
speaking. Those are the main reasons that
explain why they are so popular. But they are
not very good to identify classes when dealing
with in groups that do not have a spherical
distribution shape.
Key Concepts
10
The most commonly used distance in K-Means is the
squared Euclidean distance.
D(x,y)=(x1-x2)*(x1-x2)
Algorithm Steps
6. Back to step 3.
11
3.ASSOCIATION
In this Association rule mining if a person buys a
product then another product that is complement
of first product appears in the wish list.
e.g-if a person buys mobile phone online then he
might get a wish list containing items such as back
cover,earphones,power bank and many more
items.
It is a part of unsupervised learning technique.
4.CLASSIFICATION
12