Skip to main content

Supervised Vs Unsupervised Vs Reinforcement Learning

AI machine learning is becoming the most training field of the 21st century it is starting to redefine the way we live and it's time we understood what it is and why it matters in this session we'll be discussing the different types of machine learning and we'll compare them to each other so let me run you through today's agenda we're going to begin the session with an introduction to machine learning next we will discuss the types of machine learning after that we'll compare supervised unsupervised and reinforcement learning based on a few key parameters we'll finally end the session by discussing a few example problems that can be solved using supervised unsupervised and reinforcement learning algorithms so without any further delay let's get started so guys machine learning is the science of getting computers to act by feeding them data and letting them learn a few tricks on their own without being explicitly programmed now this sounds awfully a lot like a human child so let's consider a small scenario to understand machine learning now as a child if you had to distinguish between fruits such as cherries apples and oranges you wouldn't even know where to start because you're not familiar with how the fruits look now as we grow up we collect more information and start developing the capability to distinguish between various fruits the only reason why we are able to make this distinction is because we absorb our surroundings we gathered more data and we learn from our past experiences it's because our brain is capable enough to think and make decisions since we have been feeding it a lot of data and this is exactly how machine learning works it involves continuously feeding data to a machine so that it can interpret this data understand the useful insides detect patterns and ident  my key features to solve problems this is very similar to how our brain works now let's move ahead and take a look at the different types of machine learning.

 

Supervised Learning

First of all we have supervised learning now guys supervised means to oversee or direct a certain activity and make sure it's done correctly in this type of learning the machine learns under guidance so at school or teachers guided us and taught us similarly in supervised learning machines learn by feeding them label data and explicitly telling them hey this is the input and this is exactly how the output must look okay so the teacher in this case is the training data.




Unsupervised Learning

Next we have unsupervised learning unsupervised means to act without anyone's supervision or without  anybody's direction now here the data is not labeled there is no guide and the machine has to figure out the data set given and it has to find hidden patterns in order to make predictions about the output an example of unsupervised learning is an adult like you and me we don't need a guide to help us with our daily activities we can figure things out on our own without any supervision.

 


Reinforcement Learning

finally we have reinforcement learning now guys reinforcement means to establish or encourage a pattern of behavior let's say that you were dropped off at an isolated island what would you do now initially you'd panic and you'd be unsure of what to do where to get food from how to live and so on but after a while you will have to adapt you must learn how to live in the island adapt to the changing climates learn more to eat and what not to eat so here you're basically following the hit and trial concept because you new to the surrounding and the only way to learn is experience and then learn from your experience this is what reinforcement learning is it is a learning method wherein an agent which is basically you stuck on the island interacts with its environment which is the island by producing actions and discovers errors or rewards and once the agent gets trained it gets ready to predict the new data presented to it.

 


Differences Between Supervised Unsupervised And Reinforcement Learning

now let's move ahead and look at the differences between supervised answer and reinforcement learning so let's begin by looking at their definitions now like I mentioned earlier supervised learning is a type of machine learning wherein we teach the machine using label data so an input and your output is label next we have unsupervised learning over here the data provided to the machine is not labeled and the machine has to learn without any supervision so that's why it should discover hidden patterns and trends in the data finally we have reinforcement learning now the basic concept behind reinforcement learning is that there is an agent now this agent is put in an unknown environment so the agent has to explore the environment by taking actions and transitioning from one state to the other so that he can get maximum rewards now the next parameter to consider is the type of problems that are solved using supervised unsupervised and reinforcement learning so under supervised learning we have two main categories of problems we have regression problems and we have classification problems now guys there is an important difference between classification and regression basically classification is about predicting a label or a class whereas regression is about predicting a continuous quantity now let's say that you have to classify your emails into two different routes so here basically we'll be labeling our emails as spam and non-spam mails for this kind of problem where we have to assign our input data into different classes we make use of classification algorithms on the other hand regression is used to predict a continuous quantity now a continuous variable is a variable that has infinite number of possibilities for example a person's weight so someone could be 180 pounds or they could be 180 point 10 pounds or 180 point 1 1 0 pounds now the number of possibilities for weight are limitless and this is exactly what a continuous variable is so regression is a predictive analysis used to predict continuous variables here you don't have to label data in two different classes instead you have to predict a final outcome like let's say that you want to predict the price of a stock over a period for such problems you can make use of regression algorithms coming to unsupervised learning this type of learning can be used to solve association problems and clustering problems association problems basically involve discovering patterns in data finding co-occurrences and so on a classic example of Association rule mining is a relationship between bread and jam so people who tend to buy bread also tend to buy jam over here it's all about finding associations between items that frequently co-occur or items are similar to each other apart from Association problems unsupervised learning also deals with clustering and anomaly detection problems clustering is used for cases that involve targeted marketing wherein you are given a list of customers and some information about them and what you have to do is you have to cluster these customers based on their similarity now guys Digital AdWords use a clustering technique to cluster  potential buyers into different categories based on their interests and their intent anomaly detection on the other hand is used for tracking unusual activities an example of this is credit card fraud where in various unsupervised algorithms are used to detect suspicious activities then there is reinforcement learning now this type of learning is comparatively different in reinforcement learning the key difference is that the input itself depends on the actions we take for example in robotics we might start in a situation where the robot does not know anything above the surrounding it is in so after it  performs certain actions it finds out more about the world but the world it sees depends on whether it chooses to move right or whether it shows to move forward or backward in this case the robot is known as the agent and its surrounding is the environment so for each action it takes it can receive a reward or it might receive a punishment now the next parameter is the type of data used to train a machine when it comes to supervised learning it's quite clear and simple the machine will be provided with a label set of input and output data in the training phase itself so basically you feed the output of your algorithm into the system this means that in supervised learning the machine already knows the output of the algorithm before it starts working on it now an example is classifying a data set into either cats or dogs alright so if the algorithm is fed an image of a cat the image is labeled as a cat similarly for a dog so guys this is how the model is taught it's told that this is a cat by labeling it after the algorithm is taught it is then tested using a new data set but a point to remember here is that in the training phase for a supervised learning algorithm the beta is labeled alright the input is also labeled and the output is also labeled in unsupervised learning the machine is only given the input data so here we don't tell the system where to go the system has to understand itself from the input data that we give to it so it does this by finding patterns in the data so if we try to classify images into cats and dogs in unsupervised learning the machine will be fed images of cats and dogs and at the end it will form two groups one containing cats and the other containing dogs now the only difference here is that it won't add labels to the output okay it will just understand how cats look and cluster them into one group and similarly for dogs coming to reinforcement learning there is no predefined data the input depends on the actions taken by the agent now these actions are then  recorded in the form of matrices so that it can serve as a memory to the agent so basically as the agent explodes the environment it will collect data which was then being used to get the output so guys in reinforcement learning there is no predefined data set given to the machine the agent does all the work from scratch the next parameter to consider is training in supervised learning the training phase is well defined and very explicit the machine is fed training data where both the input and output is labeled and the only thing the algorithm has to do is map the input to the output so the training data act like a teacher or a guide.

check out my Fiverr: https://www.fiverr.com/share/qPPZWd

Comments