Hypothesis Testing 1Bora Jin1 / 21

Material

🎥 Watch Hypothesis Testing

Slides

Optional: 📖 Read IMS: Chapter 11 - Hypothesis Testing With Randomization

2 / 21

Today's Goal

Understand important concepts in hypothesis testing framework including the null hypothesis, alternative hypothesis, and p-value, etc.
Conduct simulation-based hypothesis testing for a population proportion and a mean manually and also with the tidymodels package

3 / 21

Statistical Inference

Statistical inference is the process of using sample data to make conclusions about the underlying population the sample came from.

Estimation: using the sample to estimate a plausible range of values for the unknown parameter
Testing: evaluating whether our observed sample provides evidence for or against some claim about the population

4 / 21

Statistical Inference

Statistical inference is the process of using sample data to make conclusions about the underlying population the sample came from.

Estimation: using the sample to estimate a plausible range of values for the unknown parameter
Testing: evaluating whether our observed sample provides evidence for or against some claim about the population

Today we will focus on Testing.

We will conduct hypothesis testing using simulation-based methods (bootstrapping, again).

4 / 21

Hypothesis testing framework

Defining the hypotheses
Collecting and summarizing data
Assessing the observed evidence
Making a conclusion

5 / 21

Quiz: 1. Defining the hypotheses

Q - Choose the correct description in the following sentences:

Two hypotheses are about the ( population / sample ).
The null and the alternative hypotheses are defined for ( statistics / parameters ).
The null hypothesis states ( "there is nothing unusual going on" / "there is something interesting going on" ).
The alternative hypothesis states ( the status quo / the research question ).
The alternative hypothesis is denoted by ( $H_{1}$ / $H_{A}$ ).

6 / 21

Quiz: 1. Defining the hypotheses

Q - Choose the correct description in the following sentences:

Two hypotheses are about the ( population / sample ).
The null and the alternative hypotheses are defined for ( statistics / parameters ).
The null hypothesis states ( "there is nothing unusual going on" / "there is something interesting going on" ).
The alternative hypothesis states ( the status quo / the research question ).
The alternative hypothesis is denoted by ( $H_{1}$ / $H_{A}$ ).

7 / 21

Quiz: 1. Defining the hypotheses

Q - Which of the following is the correct set of hypotheses?

(a) $H_{0} : p = 0.10$ ; $H_{A} : p \neq 0.10$

(b) $H_{0} : p = 0.10$ ; $H_{A} : p > 0.10$

(d) $H_{0} : \hat{p} = 0.10$ ; $H_{A} : \hat{p} < 0.10$

$\hat{θ}$ used to denote the associated statistic to the parameter $θ$ .

8 / 21

Quiz: 1. Defining the hypotheses

Q - Which of the following is the correct set of hypotheses?

(a) $H_{0} : p = 0.10$ ; $H_{A} : p \neq 0.10$

(b) $H_{0} : p = 0.10$ ; $H_{A} : p > 0.10$

(d) $H_{0} : \hat{p} = 0.10$ ; $H_{A} : \hat{p} < 0.10$

$\hat{θ}$ used to denote the associated statistic to the parameter $θ$ .

9 / 21

Types of Alternative Hypotheses

One sided alternatives: the parameter is hypothesized to be less than or greater than the null value
- $p > 0.10$ or $p < 0.10$
Two sided alternatives: the parameter is hypothesized to be not equal to the null value
- $p \neq 0.10$
- more objective, and hence more widely preferred

10 / 21

Quiz: 1. Defining the hypotheses

Q - Identify the null and alternative hypothesis in the following research questions.

Average systolic blood pressure of people with Stage 1 Hypertension is 150 mm Hg. We wonder whether a new blood pressure medication has an effect on the average blood pressure of heart patients.

11 / 21

Quiz: 1. Defining the hypotheses

Q - Identify the null and alternative hypothesis in the following research questions.

Average systolic blood pressure of people with Stage 1 Hypertension is 150 mm Hg. We wonder whether a new blood pressure medication has an effect on the average blood pressure of heart patients.

$H_{0}$ : A new blood pressure medication does not have an effect on the average blood pressure of heart patients.
$H_{1}$ : A new blood pressure medication has an effect on the average blood pressure of heart patients.

With $μ$ being the average blood pressure of heart patients who take a new blood pressure medication, $H_{0}$ : $μ = 150$ vs. $H_{1}$ : $μ \neq 150$

11 / 21

Quiz: 1. Defining the hypotheses

Q - Identify the null and alternative hypothesis in the following research questions.

A principal at a certain school claims that the students in the school are above average intelligence. The mean population IQ is 100.

12 / 21

Quiz: 1. Defining the hypotheses

Q - Identify the null and alternative hypothesis in the following research questions.

A principal at a certain school claims that the students in the school are above average intelligence. The mean population IQ is 100.

$H_{0}$ : The mean IQ for students attending the school is equal to the mean population IQ.
$H_{1}$ : The mean IQ for students attending the school is above the mean population IQ.

With $μ$ being the mean IQ for students attending the school,

$H_{0}$ : $μ = 100$
$H_{1}$ : $μ > 100$

12 / 21

Quiz: 1. Defining the hypotheses

Q - Identify the null and alternative hypothesis in the following research questions.

A researcher wants to test if vitamin C has the ability to prevent the flu in children. The flu infection rate in the US children population is 20%.

13 / 21

Quiz: 1. Defining the hypotheses

Q - Identify the null and alternative hypothesis in the following research questions.

A researcher wants to test if vitamin C has the ability to prevent the flu in children. The flu infection rate in the US children population is 20%.

$H_{0}$ : The true infection rate of the flu among children with sufficient vitamin C is equal to the infection rate among all US children.
$H_{1}$ : The true infection rate of the flu among children with sufficient vitamin C is lower than the infection rate among all US children.

With $p$ being the true infection rate of the flu among children with sufficient vitamin C,

$H_{0}$ : $p = 0.2$
$H_{1}$ : $p < 0.2$

13 / 21

Quiz: 3. Assessing the observed evidence

Q - What is p-value?

14 / 21

Quiz: 3. Assessing the observed evidence

Q - What is p-value?

Conditional probability
Given $H_{0}$ is true, what is the probability of observing $\hat{p}$ (our statistic) or something more extreme against the null hypothesis?
We compute this probability by simulating a null distribution for $\hat{p}$ .

14 / 21

Quiz: 3. Assessing the observed evidence

Q - What is p-value?

Conditional probability
Given $H_{0}$ is true, what is the probability of observing $\hat{p}$ (our statistic) or something more extreme against the null hypothesis?
We compute this probability by simulating a null distribution for $\hat{p}$ .

Q - What is the null distribution?

14 / 21

Quiz: 3. Assessing the observed evidence

Q - What is p-value?

Conditional probability
Given $H_{0}$ is true, what is the probability of observing $\hat{p}$ (our statistic) or something more extreme against the null hypothesis?
We compute this probability by simulating a null distribution for $\hat{p}$ .

Q - What is the null distribution?

Distribution of the observed statistics given the null hypothesis is true ("under the null hypothesis")

14 / 21

Quiz: 3. Assessing the observed evidence

Q - What is p-value?

Conditional probability
Given $H_{0}$ is true, what is the probability of observing $\hat{p}$ (our statistic) or something more extreme against the null hypothesis?
We compute this probability by simulating a null distribution for $\hat{p}$ .

Q - What is the null distribution?

Distribution of the observed statistics given the null hypothesis is true ("under the null hypothesis")

Q - We have only one sample. How can we possibly get a distribution?

14 / 21

Quiz: 3. Assessing the observed evidence

Q - What is p-value?

Conditional probability
Given $H_{0}$ is true, what is the probability of observing $\hat{p}$ (our statistic) or something more extreme against the null hypothesis?
We compute this probability by simulating a null distribution for $\hat{p}$ .

Q - What is the null distribution?

Distribution of the observed statistics given the null hypothesis is true ("under the null hypothesis")

Q - We have only one sample. How can we possibly get a distribution? Bootstrapping!

14 / 21

Quiz: 4. Making a conclusion

Q - What are the conclusions we can make from a hypothesis test?

15 / 21

Quiz: 4. Making a conclusion

Q - What are the conclusions we can make from a hypothesis test?

Reject $H_{0}$ in favor of $H_{1}$
Fail to reject $H_{0}$
- Could be because $H_{0}$ is true
- or because we happened to get a sample that didn't give us significant evidence to support $H_{0}$ was false
- We never know which one occurred through hypothesis testing
- We never say we "accept" the null

15 / 21

Quiz: 4. Making a conclusion

Q - We make a conclusion by comparing the p-value to a predetermined numeric cutoff. What is it called?

16 / 21

Quiz: 4. Making a conclusion

Q - We make a conclusion by comparing the p-value to a predetermined numeric cutoff. What is it called?

Significance level
Denoted by $α$
Depends on the context, but usually set at $α = 0.05$

16 / 21

Quiz: 4. Making a conclusion

Q - What does it mean that $α = 0.05$ ?

17 / 21

Quiz: 4. Making a conclusion

Q - What does it mean that $α = 0.05$ ?

We would expect to incorrectly reject $H_{0}$ when $H_{0}$ is true for 5% of the time.
P(reject $H_{0}$ | $H_{0}$ is true) = 0.05

17 / 21

Quiz: 4. Making a conclusion

Q - What does it mean that $α = 0.05$ ?

We would expect to incorrectly reject $H_{0}$ when $H_{0}$ is true for 5% of the time.
P(reject $H_{0}$ | $H_{0}$ is true) = 0.05

Q - State a conclusion to make when the p-value $< α$ .

17 / 21

Quiz: 4. Making a conclusion

Q - What does it mean that $α = 0.05$ ?

We would expect to incorrectly reject $H_{0}$ when $H_{0}$ is true for 5% of the time.
P(reject $H_{0}$ | $H_{0}$ is true) = 0.05

Q - State a conclusion to make when the p-value $< α$ .

The results are statistically significant.
There is sufficient evidence at $α = 0.05$ to reject the null hypothesis in favor of $H_{1}$ .
The data provide convincing evidence for the alternative hypothesis.

17 / 21

Quiz: 4. Making a conclusion

Q - State a conclusion to make when the p-value $\geq α$ .

18 / 21

Quiz: 4. Making a conclusion

Q - State a conclusion to make when the p-value $\geq α$ .

The results are not statistically significant.
We fail to reject the null hypothesis.
There is insufficient evidence at $α = 0.05$ to reject the null hypothesis.
The data do not provide convincing evidence for the alternative hypothesis.

18 / 21

Quiz: 4. Making a conclusion

Q - State a conclusion to make when the p-value $\geq α$ .

The results are not statistically significant.
We fail to reject the null hypothesis.
There is insufficient evidence at $α = 0.05$ to reject the null hypothesis.
The data do not provide convincing evidence for the alternative hypothesis.

Q - What are the two types of errors we can make?

18 / 21

Quiz: 4. Making a conclusion

Q - State a conclusion to make when the p-value $\geq α$ .

The results are not statistically significant.
We fail to reject the null hypothesis.
There is insufficient evidence at $α = 0.05$ to reject the null hypothesis.
The data do not provide convincing evidence for the alternative hypothesis.

Q - What are the two types of errors we can make?

P(Type I error) = $α$ = P(reject $H_{0}$ | $H_{0}$ is true)
P(Type II error) = $β$ = P(fail to reject $H_{0}$ | $H_{0}$ is false)

18 / 21

Quiz: 4. Making a conclusion

Q - State a conclusion to make when the p-value $\geq α$ .

The results are not statistically significant.
We fail to reject the null hypothesis.
There is insufficient evidence at $α = 0.05$ to reject the null hypothesis.
The data do not provide convincing evidence for the alternative hypothesis.

Q - What are the two types of errors we can make?

P(Type I error) = $α$ = P(reject $H_{0}$ | $H_{0}$ is true)
P(Type II error) = $β$ = P(fail to reject $H_{0}$ | $H_{0}$ is false)

Q - How do we assess the capability of a test for detecting "something interesting"?

18 / 21

Quiz: 4. Making a conclusion

Q - State a conclusion to make when the p-value $\geq α$ .

The results are not statistically significant.
We fail to reject the null hypothesis.
There is insufficient evidence at $α = 0.05$ to reject the null hypothesis.
The data do not provide convincing evidence for the alternative hypothesis.

Q - What are the two types of errors we can make?

P(Type I error) = $α$ = P(reject $H_{0}$ | $H_{0}$ is true)
P(Type II error) = $β$ = P(fail to reject $H_{0}$ | $H_{0}$ is false)

Q - How do we assess the capability of a test for detecting "something interesting"?

The power of a test: $1 - β$ = P(reject $H_{0}$ | $H_{0}$ is false)

18 / 21

Questions?

19 / 21

Let's Practice Together!

Go to AE 15: Hypothesis Testing 1

20 / 21

Bulletin

Mid-course evaluation due Friday, June 3 at 11:59pm
Project proposal due Friday, June 3 at 11:59pm
HW03 due Wednesday, June 8 at 11:59pm
Submit ae15 (Coin flips)

21 / 21

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help

Hypothesis Testing 1

Bora Jin

Material

Today's Goal

Statistical Inference

Statistical Inference

Hypothesis testing framework

Quiz: 1. Defining the hypotheses

Quiz: 1. Defining the hypotheses

Quiz: 1. Defining the hypotheses

Quiz: 1. Defining the hypotheses

Types of Alternative Hypotheses

Quiz: 1. Defining the hypotheses

Quiz: 1. Defining the hypotheses

Quiz: 1. Defining the hypotheses

Quiz: 1. Defining the hypotheses

Quiz: 1. Defining the hypotheses

Quiz: 1. Defining the hypotheses

Quiz: 3. Assessing the observed evidence

Quiz: 3. Assessing the observed evidence

Quiz: 3. Assessing the observed evidence

Quiz: 3. Assessing the observed evidence

Quiz: 3. Assessing the observed evidence

Quiz: 3. Assessing the observed evidence

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Quiz: 4. Making a conclusion

Questions?

Let's Practice Together!

Bulletin

Material

Help