5 Single layer networks: classification
In regression problems the output is modeled as a linear function of the model parameters. In classification problems, because the target is discrete, we need some extra step to convert the linear function output to match it. We talk about this extra step in this chapter, and its related complications. And since the linear function output is continuous while the target is discrete, we need to decide how to cut the continuous space into discrete, disjoint regions.
based on how we achieve this:
- discriminative learning learns a function f: input –> label.
- probabilistic discriminative learning learns a distribution p(label|input).
- probabilistic generative learning learns a joint distribution p(input, label).