# Chapter 11: Simple Linear Regression

State Math SAT scores. Refer to the simple linear regression relating y = 2014 Math SAT scores to x = 2010 Math SAT scores, Exercise 11.19 (p. 654). A portion of the SPSS printout of the analysis is shown below.

a. Locate the values of SSE, ${{\mathbf{s}}}^{{\mathbf{2}}}$, and s on the SPSS printout.

b. Give a practical interpretation of the value of s.

Congress voting on women’s issues. The American Economic Review (March 2008) published research on how the gender mix of a U.S. legislator’s children can influence the legislator’s votes in Congress. The American Association of University Women (AAUW) uses voting records of each member of Congress to compute an AAUW score, where higher scores indicate more favorable voting for women’s rights. The researcher wants to use the number of daughters a legislator has to predict the legislator’s AAUW score.

a. In this study, identify the dependent and independent variables.

b. Explain why a probabilistic model is more appropriate than a deterministic model.

c. Write the equation of the straight-line, probabilistic model.

The following table is similar to Table 11.2. It is used for making the preliminary computations for finding the least-squares line for the given pairs of x- and y-values.

a. Complete the table.

b. FindSS_{xy}.

c. Find SS_{xx}.

d. Find$\widehat{{\text{\beta}}_{\text{1}}}$.

e. Find$\overline{X}and\overline{Y}$.

f. Find$\widehat{{\text{\beta}}_{\text{0}}}$.

g. Find the least-squares line.

Refer to Exercise 11.14. After the least-squares line has been obtained, the table below (which is similar to Table 11.2) can be used for (1) comparing the observed and the predicted values of y and (2) computing SSE.

a. Complete the table.

b. Plot the least-squares line on a scatterplot of the data. Plot the following line on the same graph:

$\widehat{\mathbf{\text{y}}}\mathbf{\text{= 14 - 2.5x.}}$

c. Show that SSE is larger for the line in part b than for the least-squares line.

Construct a scatterplot for the data in the following table.

X | .5 | 1 | 1.5 |

y | 2 | 1 | 3 |

a. Plot the following two lines on your scatterplot: y = 3 - x and y = 1 + x

b. Which of these lines would you choose to characterize the relationship between x and y? Explain.

c. Show that the sum of the prediction errors for both of these lines equals 0.

d. Which of these lines has the smaller SSE?

e. Determine the data's least-squares line and compare it to the two lines described in part a.

Best-paid CEOs. Refer to Glassdoor Economic Research firm’s 2015 ranking of the 40 best-paid CEOs in Table 2.1 (p. 65). Recall that data were collected on a CEO’s age and ratio of salary to a typical worker’s pay at the firm. One objective is to predict the ratio of salary to worker pay based on the CEO’s age.

a. In this study, identify the dependent and independent variables.

b. Explain why a probabilistic model is more appropriate than a deterministic model.

c. Write the equation of the straight-line, probabilistic model.

