Supervised and Unsupervised learning

September 19, 2014 Saimadhu Polamuri

In the world of data science supervised, and unsupervised learning algorithms were the famous words, we could hear more frequently these while we were talking with the people who are working in data science field. Furthermore, the key differences between these two learning algorithms are the must learn concepts for differentiating the real world problems.

Supervised Learning Wiki Definition

Supervised learning is a data mining task of inferring a function from labeled training data.The training data consist of a set of training examples. In supervised learning, each example is a pair consisting of an input object (typically a vector) and the desired output value (also called the supervisory signal).

A supervised learning algorithm analyzes the training data and produces an inferred function, which can used for mapping new examples. An optimal scenario will allow for the algorithm to correctly determine the class labels for unseen instances. This requires the learning algorithm to generalize from the training data to unseen situations in a “reasonable” way.

Unsupervised Learning Wiki Definition

In data mining or even in data science world, the problem of an unsupervised learning task is trying to find hidden structure in unlabeled data. Since the examples given to the learner are unlabeled, there is no error or reward signal to evaluate a potential solution.

Supervised and unsupervised learning explanation with a real-life examples Click To Tweet

Supervised and unsupervised learning with a real-life example

Suppose you had a basket and f illed it with different kinds of fruits.
Your task is to arrange them into groups.
For understanding let me explain the names of the fruits in our basket.
We have four types of fruits. They are

APPLE

BANANA

GRAPE

CHERRIES

Supervised Learning:

You already learn from your previous work about the physical characters of fruits
So arranging the same type of fruits at one place is easy now
In data mining terminology the earlier work is called as training the data
You already learn the things from your train data. This is because of response variable
Response variable means just a decision variable
You can observe response variable below (FRUIT NAME)

No.	SIZE	COLOR	SHAPE	FRUIT NAME
1	Big	Red	Rounded shape with depression at the top	Apple
2	Small	Red	Heart-shaped to nearly globular	Cherry
3	Big	Green	Long curving cylinder	Banana
4	Small	Green	Round to oval,Bunch shape Cylindrical	Grape

Suppose you have taken a new fruit from the basket then you will see the size, color, and shape of that particular fruit.
If size is Big, color is Red, the shape is rounded shape with a depression at the top, you will confirm the fruit name as apple and you will put in apple group.
Likewise for other fruits also.
The job of grouping fruits was done and the happy ending.
You can observe in the table that a column was labeled as “FRUIT NAME“. This is called as a response variable.
If you learn the thing before from training data and then applying that knowledge to the test data(for new fruit), This type of learning is called as Supervised Learning.

Supervised Learning Algorithms:

All classification and regression algorithms come under supervised learning.

Logistic Regression
Decision trees
Support vector machine (SVM)
k-Nearest Neighbors
Naive Bayes
Random forest
Linear regression
polynomial regression
SVM for regression

Unsupervised Learning:

Suppose you have a basket and it is filled with some different types of fruits and your task is to arrange them as groups.
This time, you don’t know anything about the fruits, honestly saying this is the first time you have seen them. You have no clue about those.
So, how will you arrange them?
What will you do first???
You will take a fruit and you will arrange them by considering the physical character of that particular fruit.
Suppose you have considered color.
- Then you will arrange them on considering base condition as color.
- Then the groups will be something like this.
  - RED COLOR GROUP: apples & cherry fruits.
  - GREEN COLOR GROUP: bananas & grapes.
So now you will take another physical character such as size.
- RED COLOR AND BIG SIZE: apple.
- RED COLOR AND SMALL SIZE: cherry fruits.
- GREEN COLOR AND BIG SIZE: bananas.
- GREEN COLOR AND SMALL SIZE: grapes.
The job has done, the happy ending.
Here you did not learn anything before ,means no train data and no response variable.
In data mining or machine learning, this kind of learning is known as unsupervised learning.

Unsupervised learning algorithms:

All clustering algorithms come under unsupervised learning algorithms.

K – means clustering
Hierarchical clustering
Hidden Markov models

Summary:

Let’s summarize what we have learned in supervised and unsupervised learning algorithms post.

Supervised learning: Learning from the know label data to create a model then predicting target class for the given input data.

Unsupervised learning: Learning from the unlabeled data to differentiating the given input data.

Title of the course	Course Link	What You Will Learn
Pattern Discovery in Data Mining	Pattern Discovery in Data Mining	Will learn the basic concepts of data mining and it’s real world applications. Will also learn data-driven methods and some interesting of pattern discovery. Practice the scalable pattern discovery methods on massive transaction data.
Introduction to machine learning	Machine Learning	Introduce the basic machine learning, data mining, and pattern recognization concepts. In details differences of supervised and unsupervised learning algorithms. Lot more case studies and machine learning applications.
Data Mining with Python	Data Mining with Python: Classification and Regression	Understand the key concepts in data mining and will learn how to apply these concepts to solve the real world problems. Will get hands on experience with python programming language. Hands on experience with numpy, pandas, matplotlib libraries (Python libraries)

43 Responses to “Supervised and Unsupervised learning”

Saleem Afzali
5 months ago
Reply

Really appreciated of such material!
- Saimadhu Polamuri
  5 months ago
  Reply
  
  Thanks! Saleem
loubna
7 years ago
Reply

hi!
thanks a million for this explanation. Please can you tell us about used techniques to evaluate the result in case of unsupervised learning.??
- Saimadhu Polamuri
  7 years ago
  Reply
  
  Hi Loubna,
  
  Thanks for your compliment.Will write a post on evaluating the unsupervised learning results.
Anonymous
8 years ago
Reply

Perfect
Anonymous
8 years ago
Reply

its very nice to remember for new beginers
Akshat SInha
8 years ago
Reply

Reblogged this on "Unique Facts" a blog by Akshat and commented:
MACHINE LEARNING
TV Mohini
8 years ago
Reply

This is fantastic Madhu. Wish to see many more posts from you my dear.
- saimadhu
  8 years ago
  Reply
  
  Hi TV Mohini
  Thanks for your complement.
Anonymous
9 years ago
Reply

Nice Explanation…:)
kennethhenrymale
9 years ago
Reply

this is really wonderful it has explained me very well
- saimadhu
  9 years ago
  Reply
  
  Thanks 🙂
Farid Saud Rolleri
9 years ago
Reply

Excellent post, this is also usefull for artficial intelligence
- saimadhu
  9 years ago
  Reply
  
  Hi Farib Saud Rolleri,
  Thanks for your complement.
Anonymous
9 years ago
Reply

Excellent..! Keep it up
- saimadhu
  9 years ago
  Reply
  
  Thanks 🙂
Anonymous
9 years ago
Reply

Nice answer please give a*search ans also
Jaggu
9 years ago
Reply

Good explanation madhu. Keep it goes on with aspiration
- saimadhu
  9 years ago
  Reply
  
  Hi Jaggu
  Thanks for your compliment.
  - Rajendiran P
    9 years ago
    Reply
    
    good example with explanation regarding difference between clustering and classification difference. thanks
    - saimadhu
      9 years ago
      Reply
      
      Hi Rajendiran
      Thanks for you compliment
Anonymous
9 years ago
Reply

good work 🙂
- saimadhu
  9 years ago
  Reply
  
  Thanks 🙂
Anonymous
9 years ago
Reply

very good article.. please give explanations on feature sections and feature extractions.
- saimadhu
  9 years ago
  Reply
  
  Hi someone 🙂
  Thank’s for your compliment. sure before that i want to explain all classification algorithms then i will explain about feature extraction.
Anonymous
9 years ago
Reply

Your explanation was so interesting. Appreciate your thoughts for bringing in amazing example. Try to work on examples of this sort for other techniques also.
- saimadhu
  9 years ago
  Reply
  
  Thanks for your compliment. i will do it.
Al-Ahmadgaid Asaad
9 years ago
Reply

Great post saimadhu, can I use your first image above (robot image) for my presentation? I will link it back to your blog to give credit on your work.
- saimadhu
  9 years ago
  Reply
  
  Hi Al-Ahmadgaid Asaad
  Thanks for your complement. you can use what ever you want.
redserpent
9 years ago
Reply

Reblogged this on Redserpent's Weblog and commented:
Nice article for Data newbies
- saimadhu
  9 years ago
  Reply
  
  Hi redserpent
  Thank’s for your compliment.
Shafi
9 years ago
Reply

Great Job Sai Madhu…:)
- saimadhu
  9 years ago
  Reply
  
  Thank’s
svr541
9 years ago
Reply

Great Job 🙂
- saimadhu
  9 years ago
  Reply
  
  Thank’s
Anonymous
9 years ago
Reply

best
- saimadhu
  9 years ago
  Reply
  
  Thank’s
challarao
9 years ago
Reply

Nice explanations….please keep posting…love to learn…
- saimadhu
  9 years ago
  Reply
  
  Thank’s… Challarao
Sneha
9 years ago
Reply

can u plss explain me classification and clustering differences with some examples…am fully unaware of data mining???plssss
- saimadhu
  9 years ago
  Reply
  
  Sure. i will write one post on it. 🙂
  - hardi
    8 years ago
    Reply
    
    nice explanation…!! can you plz tell me Difference between cluster and classification in a simple way.
    - saimadhu
      8 years ago
      Reply
      
      Hi Hardi,
      Thanks for your compliment. Sure soon I will write a post on the key difference between the clustering and classification. For the time being please a have look at https://dataaspirant.com/2014/09/27/classification-and-prediction/