Linear regression

Description:

  • Prediction by Regression Model
  • We might not know the PDF of and , but if we know the mean and standard variance, we can determine the best linear predictor of with respect to

Finding line of best fit

  • 2 main methods:
    1. Scattergraph method:
      • Draw a line through data points with about an equal number of points above and below the line.
    2. Linear regression:
      • Using least squares method
      • Using correlation:
        • We need to chooseΒ aΒ andΒ bΒ to minimize the Mean square error
        • Taking partial derivative and set them equal to 0 we have and when it is at minimum point
          • Cant have maximum due to the nature of the problem
        • The best linear predictor (lowest mean square error) is:
          • Happens when and is the Correlation of and
          • The Mean square error of this predictor is given by
      • Using

Assumptions about error term ϡ in linear regression model:

  1. . This impliesΒ Β and ​ are constants, and henceΒ 
  2. The variance ofΒ Ξ΅, denoted byΒ , is the same for allΒ x
  3. The values of Ρ are independent.
  4. Ρ is a normally distributed random variable for all values of x

Testing for significance

  • IfΒ , thenΒ . In this case, we would conclude thatΒ xΒ andΒ yΒ are not linearly related

  • IfΒ , we would conclude that the two variables are related.

  • Thus, to test for a significant regression relationship, we must conduct a hypothesis test to determine whetherΒ 

  • TheΒ t-testΒ is commonly used.