Classification
Given data, learn a function that will output (predict) a categorical response (target) variable given input feature variables.
With the penguins:
- Input: Bill length and depth
- Output: Penguin species
Question: How do we know if a learned function is doing a good job?
306 |
Chinstrap |
40.9 |
16.6 |
341 |
Chinstrap |
49.6 |
18.2 |
291 |
Chinstrap |
50.5 |
19.6 |
102 |
Adelie |
37.7 |
16.0 |
289 |
Chinstrap |
52.0 |
18.1 |
267 |
Gentoo |
55.1 |
16.0 |
125 |
Adelie |
40.6 |
19.0 |
Regression
Given data, learn a function that will output (predict) a numeric response variable (target) given input feature variables.
With the penguins:
- Input: Bill length (and Species)
- Output: Bill Depth
Question: How do we know if a learned function is doing a good job?
306 |
16.6 |
Chinstrap |
40.9 |
341 |
18.2 |
Chinstrap |
49.6 |
291 |
19.6 |
Chinstrap |
50.5 |
102 |
16.0 |
Adelie |
37.7 |
289 |
18.1 |
Chinstrap |
52.0 |
267 |
16.0 |
Gentoo |
55.1 |
125 |
19.0 |
Adelie |
40.6 |
Classification and Regression
Commonality: When learning (training) we will have access to a response or target variable.
Importantly, in both cases, we want to make predictions for unseen data!
How will we evaluate how well we predict data we haven’t seen?!