Linear regression
Description:
- Prediction by Regression Model
- We might not know the PDF of and , but if we know the mean and standard variance, we can determine the best linear predictor of with respect to
Finding line of best fit
- 2 main methods:
- Scattergraph method:
- Draw a line through data points with about an equal number of points above and below the line.
- Linear regression:
- Using least squares method
- Using correlation:
- We need to chooseΒ aΒ andΒ bΒ to minimize the Mean square error
- Taking partial derivative and set them equal to 0 we have and when it is at minimum point
- Cant have maximum due to the nature of the problem
- The best linear predictor (lowest mean square error) is:
- Happens when and is the Correlation of and
- The Mean square error of this predictor is given by
- Using
- Scattergraph method:
Assumptions about error term ϡ in linear regression model:
- . This impliesΒ Β andΒ βΒ are constants, and henceΒ
- The variance ofΒ Ξ΅, denoted byΒ , is the same for allΒ x
- The values of Ρ are independent.
- Ρ is a normally distributed random variable for all values of x
Testing for significance
-
IfΒ , thenΒ . In this case, we would conclude thatΒ xΒ andΒ yΒ are not linearly related
-
IfΒ , we would conclude that the two variables are related.
-
Thus, to test for a significant regression relationship, we must conduct a hypothesis test to determine whetherΒ
-
TheΒ t-testΒ is commonly used.


