Statistics 511

 

Course Personnel:   

Professor:                          Naomi Altman                naomi@stat.psu.edu          312 Thomas              865-3791

Teaching Assistant:            Megan Romer                    

 

Web Page: http:// www.stat.psu.edu/~naomi/stat511/

 office hours, homework and exam solutions, practice exams, exam times, homework hints and corrections, etc. will be posted here

 

Texts:

Required:  Applied Linear Statistical Models, 5th Edition   Kutner, Nachtsheim, Neter and Li 2005 Irwin, Inc.  (Note: This is a real improvement on the 4th edition, but you can use the 4th if you prefer.  I will make sure that I have page numbers for both for readings and homework.)

 

Auditors

Auditors are encouraged to participate fully in lectures but may not attend Lab sessions or turn in material for grading.

 

Homework

 

Weekly assignments are designed to help you assimilate and practice the techniques covered in class and to prepare you for the material to come.  Some small assignments may be given in any class.  These are to prepare you for class and are due as you come into class.  1 point will be given for turning each of these in on time.  Longer assignments are due at the start of Friday’s lecture (for the bonus point) but may be turned in as late as Friday at 2:30.  (They can be placed in my mailbox.)

 

Homework, data and sample SAS commands will be posted to the Web.  Homework should be typed.  SAS output should be downloaded to a word processor and edited appropriately.  Equations and other mathematical notation can be added by hand as needed.

 

Students are encouraged to discuss homework, but the work you hand in must be your own.   Copying will be penalized.

 

Late assignments will be allowed only with the permission from Prof. Altman, documented by a note or e-mail.  Permission will be granted only in cases of medical or family emergencies, or conflicting university commitments.  If in doubt - ask.

 

 

Project

 

Students will work in groups of 2 or 3 on a data analysis project to answer a question of interest which requires several predictors.  E.g. Is gender a significant predictor of faculty salary, after accounting for years in rank and publication record?  Can freshman performance be predicted by SAT and high school scores at admission.  More details will follow.

 

Exams (exact times will be posted)

 

Mid-term 1: Week of Oct. 11

Mid-term 2: Week of Nov. 15

Final: Finals Week

 

Make-up exams, if necessary, will follow the scheduled exams.  Prof. Altman must be informed in writing of exam conflicts at least 1 week in advance.  Medical and family emergencies will be handled on a case by case basis.  Please do not come to an exam if you are sick - you may infect the rest of us, and I cannot allow you to take the make-up exam if you have already taken the exam. 

 

Regrades

 

Grading errors on homeworks should be brought to the attention of your TA within 2 weeks of the date on which it was returned.  Your failure to pick up your homework does not extend this deadline, unless the cause was a lengthy medical or family emergency, in which case you need to contact Prof. Altman.

 

Grading errors on exams should be noted in writing.  The exam paper and explanation of regrade should be given to Prof. Altman within 2 weeks of the date on which it was returned.

 

Homework and exam solutions will be available on the Web.  Homework solutions will usually be posted the Monday after the homework is turned in.

 

Grade Breakdown

 

homework                                     20%

project                                          15%

mid-term 1                                    20%

mid-term 2                                    20%

final exam                                      25%

 

Computing

 

SAS PC will be the official package used in the course.  Students may use Splus/R.  However, SAS output will be used on exams and in class. 

 

Syllabus

 

Topic

Section in NWNK

 

 

Normality, t-test and other basics

 

Estimating the Mean

 

What is Regression?

1.1, 1.2, 3.10

Least squares Regression

1.3 – 1.8, 2.7

Inference for Regression

2.1 – 2.6, 2.8 –2.10

Regression Diagnostics

3.1 – 3.3, 3.9

Matrices

5.1-5.8

Regression in Matrix Notation

5.9 – 5.13

Regression Through the Origin

4.4

Regression and Correlation

15.1

Multiple Regression

6, 7.1 – 7.4

Regression Diagnostics

9

Polynomial Regression

7.7, 7.8

Variable Selection

8

Using Categorical Predictors in Regression

11

Advanced Topics (time permitting)

 

 

A detailed list of topics and readings is posted weekly to the course web page under "Readings".

 

Academic Integrity

 

Academic integrity is the pursuit of scholarly activity free from fraud and deception and is an educational objective of this institution. All University policies regarding academic integrity apply to this course. Academic dishonesty includes, but is not limited to, cheating, plagiarizing, fabricating of information or citations, facilitating acts of academic dishonesty by others, having unauthorized possession of examinations, submitting work of another person or work previously used without informing the instructor, or tampering with the academic work of other students.

 

 What You Should Know to Succeed in Stat 511

 

 

Matrix Notation:

1.      Be familiar with matrix notation.

2.      Be familiar with use of matrix notation to summarize linear equations.

3.      Know how to form the inner product of vectors.

4.      Know how to multiply two matrices.

5.      Know how to multiply a matrix or vector times a scalar.

6.      Be familiar with matrix inverse.

7.      When does a matrix have an inverse?

 

Univariate Statistics

 

1.      Distribution (especially Normal, t and F).

2.      Use of tables for the Normal, t and F distributions.

3.      Normal probability plot to assess normality.

4.      Population mean and variance.

5.      Sample mean and variance and how to compute them.

6.      The one-sample t-test and confidence interval for the mean.

7.      The two-sample t-tests (including paired) and confidence intervals for the difference in means.

 

Regression

 

1.      Correlation - sample and population.

2.      Regression equation.

3.      Linear function.

4.      The ordinary linear regression model.

5.      Formulas for estimating the sample intercept and slope in ordinary linear regression.

6.      The ANOVA table for regression.

7.      R2.

8.      The F and t-tests for the statistical significance of the regression relationship.

9.      T-tests and confidence intervals for the population intercept and slope.

10.  Confidence intervals for the regression line.

11.  Prediction intervals.

12.  Checking the residuals for curvature, constant variance, outliers and normality.

13.  Transformations of  X and/or Y to satisfy regression assumptions.

 

ANOVA

1.      One-way and two-way ANOVA models.

2.      ANOVA table.

3.      Checking model assumptions.

4.      F-tests for treatments and factors.