NAME:                                  

Exam 3

Fall 2000

You may use additional sheets of paper to solve the following questions, but you please report your results and conclusions in the space provided.  Whenever possible, show your work for potential partial credit.  NOTE:  When performing numerical calculations, keep at least 4 digits after a decimal.  (I.e., do NOT round .2265 to .23 or .227)  BUDGET YOUR TIME WISELY!

1.  Women who are union members earn $2.50 per hour more than women who are not union members (Wall St. Journal, July 26, 1994).  Suppose independent random samples of 15 unionized women and 20 nonunionized women in manufacturing have been selected and the following summary statistics are found.

Union Workers

Nonunion Workers

Sample mean hourly wage = $17.54

Sample mean hourly wage = $15.36

Sample variance = $5.02

Sample variance =  $3.95

Sample size = 15

Sample size = 20

a.  Suppose we want to develop an interval estimate of the difference in the mean wage rates between unionized and nonunionzed women working in manufacturing.  What assumptions must be made about the two populations?  (4 points)

 

 

 

 

b.  Construct a 95% confidence interval estimate of the difference between the two population means.  Interpret this confidence interval.  (8 points)

 

 

 

 

 

 

 

c.  Does there appear to be any difference in the mean wage rate between these two groups?  Explain.  (6 points)

 

 

 

 

 

2.  A large automobile insurance company selected samples of single and married male policyholders and recorded the number who had made an insurance claim over the preceding three-year period. Using a = .05, test to determine whether the claim rates differ between single and married male policyholders.  Be sure to clearly state your null and alternative hypotheses, rejection range(s), test statistic and conclusion.  Explain the conclusion of your test to someone who is unfamiliar with the problem.  (16 points)

Single Policyholders

Married Policyholders

Sample size = 400

Sample size = 900

# making claims = 76

# making claims = 90

 

 


3.  As a sales manager who supervises a team of salespeople, you’re interested in estimating the relationship between years of sales experience and annual sales (in $1000’s) for each employee. 

a.  Which of these two variables would you use as your independent and dependent variables in a simple linear regression equation?  Why?  (2 points)

 

 

 

 

 

b.  What sign would you expect on the estimated coefficient for your chosen independent variable?  Why?  (3 points)

 

 

 

 

c.  Suppose your statistical program produces the following estimated regression equation.  Interpret both of the estimated coefficients.  (6 points)

 

 

 

 

d.  What other pieces of information would you like to have for this regression output?  Why are they important?  (9 points)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

e.  What other independent variables would be useful in developing a more effective model?  Why?  (4 points)

 

 

 

 

 

 

 

 

 

 

 

 

 

 


EMPIRICAL CASE:  Two experts have provided you with subjective lists of school districts that they think are among the best in the country.  For each school district, the following data were obtained:

Using Excel, you get the following regression output.  Use this output to answer the questions below.

SUMMARY OUTPUT

 

 

 

 

 

Regression Statistics

 

 

 

 

Multiple R

0.772320924

 

 

 

 

R Square

0.596479609

 

 

 

 

Adjusted R Square

0.428346113

 

 

 

 

Standard Error

11.21911189

 

 

 

 

Observations

18

 

 

 

 

ANOVA

 

 

 

 

 

 

df

SS

MS

F

Significance F

Regression

5

2232.689452

446.5378903

3.547654822

0.033561655

Residual

12

1510.421659

125.8684716

 

 

Total

17

3743.111111

 

 

 

 

Coefficients

Standard Error

t Stat

P-value

 

Intercept

33.70855183

63.27729519

0.532711642

0.603958149

 

Average Class Size

-1.557651146

1.090141646

 

0.17855701

 

Spending Per Student ($)

-0.002427352

0.001655643

 

0.168327702

 

Average Teacher Salary ($)

-0.000258576

0.000639237

 

0.692956957

 

Combined SAT Score

0.076885289

0.03737104

 

0.06206088

 

% Taking SAT

0.285200804

0.115445124

 

0.029469471

 

a.  Now that you have some Excel output, interpret each estimated coefficient in terms of both size and sign.  Does the sign of each make economic sense?  (10 points)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

b.  Use the information above to calculate the t-statistics for the missing coefficients.  The intercept is calculated for you.  Fill these into your table of results.  (5 points)

 


c.  What do your t-statistics tell you with regards to the significance of each coefficient?  Be thorough and refer to hypothesis tests.  (10 points)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

d.  Use the appropriate measure to comment upon the ability of the estimated model to fit the data.  (4 points)

 

 

 

 

 

 

 

e.  What can you say about overall significance for this model?  What does this mean?  Again, be thorough and refer to hypothesis tests.  (4 points)

 

 

 

 

 

 

 

 

 

f.  Would you suspect multicollinearity in this model?  Where is it most likely to exist?  What kinds of statistical information would you need to determine whether or not multicollinearity is present?  (9 points)