SAS代写 STAT 4110/7110 Statistical Software and Data Analysis

STAT 4110/7110 Statistical Software and Data Analysis

Homework Assignment #5 (30 pts.)

 

Note:

  1. You may perform this assignment by yourself alone or a team up to four
  2. If you decided to work as a team, you should form your own team in the Canvas. (Go to People on the menu and click Group on the top.)
  3. The grade given for the project will be given to all team members.
  4. You should turn in both SAS program and SAS output along with the comments for the whole credit.
  5. It should include ONLY the SAS output needed to illustrate the key points in your discussion and it should merge neatly with the body of your discussion.

 

  1. (10 pts.) Using the data set BASEBALL in the sashelp library of SAS, propose a multiple regression model to predict the salary (Salary). Discuss why the proposed model is the appropriate and (maybe) the best. Make sure you include the discussion of the model assumption checking. [Hint: use PROC REG.]

 

  1. (10 pts.) Determine whether the variables division at the end of 1986 (division), league at the end of 1986 (league), and position(s) in 1986 (position) are significant factor(s) to predict 1987 salary (in $ Thousands) (salary). [Hint: use PROC GLM.]

 

  1. (10 pts.) The data set ‘admit (Micro Excel Worksheet)’ contains four variables:
    1. admit = 1 if the individual was admitted to graduate school, and 0 otherwise.(binary variable)
    2. gre = GRE (Graduate Record Examinations) score (continuous variable)
    3. gpa = GPA (Grade Point Average) score (continuous variable)
    4. rank = institution rank = {1, 2, 3, 4} Note that institutions with a rank of 1 have the highest prestige, while those with a rank of 4 have the lowest. (categorical variable)

Fit a regression model to estimate the proportion of admission to graduate school. Interpret your result. [Hint: use PROC GENMOD.]