Problem Definition

Cow
Milk(Y)
Age(X1)
Weight(X2)

#1

10

1

2

#2

11

3

3

#3

12

4

1

We want to find the model which well explains our target variable(yy) with xx variables. The model looks like this

Yi=β1X1i+β2X2i+ϵiY_i =\beta_1X_{1i}+\beta_2X_{2i}+\epsilon_i

We can evaluate how precise our model it is with a fluctuation of our error. When we assume that our expected error is zero, the fluctuation represents the size of precision.

  • Good for Intuition: E[∣ϵ−E(ϵ)∣]=E[∣ϵ∣] E[|\epsilon-E(\epsilon)|]=E[|\epsilon|]

  • Good for calculation: E[ϵ2]=σϵ \sqrt{E[\epsilon^2]}=\sigma_\epsilon​

If we make a probabilistic assumption for error, we can easily find the fluctuation. For example, Error can be −2,−1,0,1,2-2, -1,0,1,2 with the probability 15 \dfrac{1}{5}. Then E[∣ϵ∣]=1E[|\epsilon|]=1. However, in a real world problem, we couldn't make a probabilistic assumption for error. Even if we do, we just assume the normal with unknown variance. So to know the precision we need to estimate the sigma of error.

MLE: σϵ^=ϵ12+⋯+ϵn2n \hat{\sigma_\epsilon}=\sqrt{\dfrac{\epsilon_1^2+\cdots+\epsilon_n^2}{n}} | σϵ^=ϵ12+⋯ϵn−factor  num2n−factor  num \hat{\sigma_\epsilon}=\sqrt{\dfrac{\epsilon_1^2+\cdots\epsilon^2_{n-factor \;num}}{n-factor\;num}}

Last updated

Was this helpful?