What exactly is the difference between supervised and unsupervised learning?

Question Detail: Right now I am trying to understand Clustering-methods. What I understood by now:

  1. In supervised learning the categories, data is assigned to are known before computation. So they are being used in order to ‘learn’ the parameters that are really significant for those Clusters.
  2. In unsupervised learning Datasets are assigned to segments, without the clusters being known.

Does that mean that if I don’t even know which parameters are crucial for a segmentation I should prefer supervised learning?

Asked By : Prot
Best Answer from StackOverflow

Question Source : http://cs.stackexchange.com/questions/2907

Answered By : Dave Clarke

The difference is that in supervised learning the ‘categories’ are known. In unsupervised learning, they are not, and the learning process attempts to find appropriate ‘categories’. In both kinds of learning all parameters are considered to determine which are most appropriate to perform the classification. Whether you chose supervised or unsupervised should be based on whether or not you know what the ‘categories’ of your data are. If you know, use supervised learning. If you do not know, then use unsupervised. As you have a large number of parameters and you do not know which ones are relevant,
You could use something like principle component analysis to help determine the relevant ones.

Leave a Reply