Shapiro-Wilk test

Test just tells if data is likely distrubited normall

$h_{0}$ = data normal distreubited
$h_{1}$ =! normal distreubited

NULL hypothesis $H_{i}$

$H_{0}$ $p \leq 0.5$
- Closely “false” (more evidence against) aka hypothesis is wrong
$h_{1}$ $p > 0.5$
- Closely “true” when higher p-value (more for)
- higher p value means error is liekly normally distrubited

Warning

We never say that the null hypothesis is “True”, only that we fail to reject the null

High p means we don’t have enough evidence to reject the null. Doesn’t mean the null is true, but it means support for Ha isn’t strong enough to reject Ho.

p-value

A high p value only tells you you had limited evidence against the null. However if you had a very large sample it might be reasonable to conclude the null is either true or the true value differs only a small amount from the null (any true effect is small).

Residuals

Residuals = observed value − predicted value.

Adjusted vs Multiple $R^{2}$

where $. l m$ is some linear model

Adjusted $R^{2}$ how well data fits model

summary(quad.lm)$r.squared

summary(spruce.lm)$r.squared

Cooks Distance:

Piecewise Regression

lowess smoother?

fitted values? Fitted()

anova

Proof

Prove using latex that $y = β_{0} + β_{1} x + β_{2} (x - x_{k}) I (x > x_{k})$ where I() is 1 when $x > x_{k}$ and 0 else.

Code

normcheck(plot_1,plot_2)
# plots 2 graphs side by side,

What is H1, H2 in norm check?
Null hypothesis?
What is P-value in normality check

I(...)
# AS IS FORMULA, do not interperate a ^

Predict the Height of spruce when the Diameter is 15, 18 and 20cm (use predict())

predict(quad.lm, data.frame(BHDiameter = c(15, 18, 20)))

adjusted R squared determine which is “better” (whatever with higher value)

summary(spruce.lm)$adj.r.squared

Arika's Notes

Explorer

Stats_Lab4_notes

Shapiro-Wilk test

NULL hypothesis $H_{i}$

p-value

Residuals

Adjusted vs Multiple $R^{2}$

Cooks Distance:

Piecewise Regression

lowess smoother?

fitted values? Fitted()

anova

Proof

Code

Predict the Height of spruce when the Diameter is 15, 18 and 20cm (use predict())

adjusted R squared determine which is “better” (whatever with higher value)

Graph View

Table of Contents

Backlinks

Arika's Notes

Explorer

Stats_Lab4_notes

Shapiro-Wilk test

NULL hypothesis Hi​

p-value

Residuals

Adjusted vs Multiple R2

Cooks Distance:

Piecewise Regression

lowess smoother?

fitted values? Fitted()

anova

Proof

Code

Predict the Height of spruce when the Diameter is 15, 18 and 20cm (use predict())

adjusted R squared determine which is “better” (whatever with higher value)

Graph View

Table of Contents

Backlinks

NULL hypothesis $H_{i}$

Adjusted vs Multiple $R^{2}$