AI machine learning is becoming the
most training field of the 21st century it is starting to redefine the way we
live and it's time we understood what it is and why it matters in this session
we'll be discussing the different types of machine learning and we'll compare
them to each other so let me run you through today's agenda we're going to
begin the session with an introduction to machine learning next we will discuss
the types of machine learning after that we'll compare supervised unsupervised
and reinforcement learning based on a few key parameters we'll finally end the
session by discussing a few example problems that can be solved using
supervised unsupervised and reinforcement learning algorithms so without any
further delay let's get started so guys machine learning is the science of
getting computers to act by feeding them data and letting them learn a few
tricks on their own without being explicitly programmed now this sounds awfully
a lot like a human child so let's consider a small scenario to understand
machine learning now as a child if you had to distinguish between fruits such
as cherries apples and oranges you wouldn't even know where to start because
you're not familiar with how the fruits look now as we grow up we collect more
information and start developing the capability to distinguish between various
fruits the only reason why we are able to make this distinction is because we
absorb our surroundings we gathered more data and we learn from our past
experiences it's because our brain is capable enough to think and make
decisions since we have been feeding it a lot of data and this is exactly how
machine learning works it involves continuously feeding data to a machine so
that it can interpret this data understand the useful insides detect patterns
and ident my key features to solve
problems this is very similar to how our brain works now let's move ahead and
take a look at the different types of machine learning.
Supervised Learning
First of all we have supervised learning now guys supervised means to oversee or direct a certain activity and make sure it's done correctly in this type of learning the machine learns under guidance so at school or teachers guided us and taught us similarly in supervised learning machines learn by feeding them label data and explicitly telling them hey this is the input and this is exactly how the output must look okay so the teacher in this case is the training data.
Unsupervised Learning
Next we have unsupervised learning
unsupervised means to act without anyone's supervision or without anybody's direction now here the data is not
labeled there is no guide and the machine has to figure out the data set given
and it has to find hidden patterns in order to make predictions about the output
an example of unsupervised learning is an adult like you and me we don't need a
guide to help us with our daily activities we can figure things out on our own
without any supervision.
Reinforcement Learning
finally we have reinforcement
learning now guys reinforcement means to establish or encourage a pattern of
behavior let's say that you were dropped off at an isolated island what would
you do now initially you'd panic and you'd be unsure of what to do where to get
food from how to live and so on but after a while you will have to adapt you
must learn how to live in the island adapt to the changing climates learn more
to eat and what not to eat so here you're basically following the hit and trial
concept because you new to the surrounding and the only way to learn is experience
and then learn from your experience this is what reinforcement learning is it
is a learning method wherein an agent which is basically you stuck on the
island interacts with its environment which is the island by producing actions
and discovers errors or rewards and once the agent gets trained it gets ready
to predict the new data presented to it.
Differences Between Supervised Unsupervised And Reinforcement Learning
now let's move ahead and look at
the differences between supervised answer and reinforcement learning so let's
begin by looking at their definitions now like I mentioned earlier supervised
learning is a type of machine learning wherein we teach the machine using label
data so an input and your output is label next we have unsupervised learning
over here the data provided to the machine is not labeled and the machine has
to learn without any supervision so that's why it should discover hidden
patterns and trends in the data finally we have reinforcement learning now the
basic concept behind reinforcement learning is that there is an agent now this
agent is put in an unknown environment so the agent has to explore the
environment by taking actions and transitioning from one state to the other so
that he can get maximum rewards now the next parameter to consider is the type
of problems that are solved using supervised unsupervised and reinforcement
learning so under supervised learning we have two main categories of problems
we have regression problems and we have classification problems now guys there
is an important difference between classification and regression basically
classification is about predicting a label or a class whereas regression is
about predicting a continuous quantity now let's say that you have to classify
your emails into two different routes so here basically we'll be labeling our
emails as spam and non-spam mails for this kind of problem where we have to
assign our input data into different classes we make use of classification
algorithms on the other hand regression is used to predict a continuous
quantity now a continuous variable is a variable that has infinite number of
possibilities for example a person's weight so someone could be 180 pounds or
they could be 180 point 10 pounds or 180 point 1 1 0 pounds now the number of
possibilities for weight are limitless and this is exactly what a continuous
variable is so regression is a predictive analysis used to predict continuous
variables here you don't have to label data in two different classes instead
you have to predict a final outcome like let's say that you want to predict the
price of a stock over a period for such problems you can make use of regression
algorithms coming to unsupervised learning this type of learning can be used to
solve association problems and clustering problems association problems
basically involve discovering patterns in data finding co-occurrences and so on
a classic example of Association rule mining is a relationship between bread
and jam so people who tend to buy bread also tend to buy jam over here it's all
about finding associations between items that frequently co-occur or items are
similar to each other apart from Association problems unsupervised learning
also deals with clustering and anomaly detection problems clustering is used
for cases that involve targeted marketing wherein you are given a list of
customers and some information about them and what you have to do is you have
to cluster these customers based on their similarity now guys Digital AdWords
use a clustering technique to cluster potential
buyers into different categories based on their interests and their intent
anomaly detection on the other hand is used for tracking unusual activities an
example of this is credit card fraud where in various unsupervised algorithms
are used to detect suspicious activities then there is reinforcement learning
now this type of learning is comparatively different in reinforcement learning
the key difference is that the input itself depends on the actions we take for
example in robotics we might start in a situation where the robot does not know
anything above the surrounding it is in so after it performs certain actions it finds out more
about the world but the world it sees depends on whether it chooses to move
right or whether it shows to move forward or backward in this case the robot is
known as the agent and its surrounding is the environment so for each action it
takes it can receive a reward or it might receive a punishment now the next
parameter is the type of data used to train a machine when it comes to
supervised learning it's quite clear and simple the machine will be provided
with a label set of input and output data in the training phase itself so
basically you feed the output of your algorithm into the system this means that
in supervised learning the machine already knows the output of the algorithm
before it starts working on it now an example is classifying a data set into
either cats or dogs alright so if the algorithm is fed an image of a cat the
image is labeled as a cat similarly for a dog so guys this is how the model is
taught it's told that this is a cat by labeling it after the algorithm is
taught it is then tested using a new data set but a point to remember here is
that in the training phase for a supervised learning algorithm the beta is
labeled alright the input is also labeled and the output is also labeled in
unsupervised learning the machine is only given the input data so here we don't
tell the system where to go the system has to understand itself from the input
data that we give to it so it does this by finding patterns in the data so if
we try to classify images into cats and dogs in unsupervised learning the
machine will be fed images of cats and dogs and at the end it will form two
groups one containing cats and the other containing dogs now the only
difference here is that it won't add labels to the output okay it will just
understand how cats look and cluster them into one group and similarly for dogs
coming to reinforcement learning there is no predefined data the input depends
on the actions taken by the agent now these actions are then recorded in the form of matrices so that it
can serve as a memory to the agent so basically as the agent explodes the
environment it will collect data which was then being used to get the output so
guys in reinforcement learning there is no predefined data set given to the
machine the agent does all the work from scratch the next parameter to consider
is training in supervised learning the training phase is well defined and very
explicit the machine is fed training data where both the input and output is
labeled and the only thing the algorithm has to do is map the input to the
output so the training data act like a teacher or a guide.
check out my Fiverr: https://www.fiverr.com/share/qPPZWd
Comments
Post a Comment
If you have any queries please let me know.