Central Limit Theorem 2Bora Jin1 / 20

Material

🎥 Watch Inference Using Central Limit Theorem

Slides

Optional: 📖 Read

2 / 20

Today's Goal

Use Central Limit Theorem (CLT) to conduct inference on a population mean
Conduct CLT-based inference step-by-step and using the infer package
Understand t-distribution vs. standard normal, N(0,1) distribution

3 / 20

Quiz

Q - State the central limit theorem.

For a population with a well-defined mean $μ$ and standard deviation $σ$ , these three properties hold for the distribution of sample average $\bar{X}$ , assuming certain conditions hold:

4 / 20

Quiz

Q - State the central limit theorem.

For a population with a well-defined mean $μ$ and standard deviation $σ$ , these three properties hold for the distribution of sample average $\bar{X}$ , assuming certain conditions hold:

✅ The distribution of the sample statistic is

4 / 20

Quiz

Q - State the central limit theorem.

For a population with a well-defined mean $μ$ and standard deviation $σ$ , these three properties hold for the distribution of sample average $\bar{X}$ , assuming certain conditions hold:

✅ The distribution of the sample statistic is approximately normal

4 / 20

Quiz

Q - State the central limit theorem.

For a population with a well-defined mean $μ$ and standard deviation $σ$ , these three properties hold for the distribution of sample average $\bar{X}$ , assuming certain conditions hold:

✅ The distribution of the sample statistic is approximately normal

✅ The distribution is centered at

4 / 20

Quiz

Q - State the central limit theorem.

For a population with a well-defined mean $μ$ and standard deviation $σ$ , these three properties hold for the distribution of sample average $\bar{X}$ , assuming certain conditions hold:

✅ The distribution of the sample statistic is approximately normal

✅ The distribution is centered at the population parameter (often interest of inference)

4 / 20

Quiz

Q - State the central limit theorem.

For a population with a well-defined mean $μ$ and standard deviation $σ$ , these three properties hold for the distribution of sample average $\bar{X}$ , assuming certain conditions hold:

✅ The distribution of the sample statistic is approximately normal

✅ The distribution is centered at the population parameter (often interest of inference)

✅ The variability of the distribution is inversely proportional to the square root of

4 / 20

Quiz

Q - State the central limit theorem.

For a population with a well-defined mean $μ$ and standard deviation $σ$ , these three properties hold for the distribution of sample average $\bar{X}$ , assuming certain conditions hold:

✅ The distribution of the sample statistic is approximately normal

✅ The distribution is centered at the population parameter (often interest of inference)

✅ The variability of the distribution is inversely proportional to the square root of the sample size

4 / 20

Quiz

Q - Why do we care about the distribution of sample mean?

5 / 20

Quiz

Q - Why do we care about the distribution of sample mean?

We can estimate / test for a population mean.

We can construct a confidence interval or conduct a hypothesis test for the population mean using the CLT-based distribution in place of a simulation-based distribution of sample mean.

5 / 20

Quiz

Q - What is the distribution of sample mean by CLT?

When the population mean $μ$ and the population standard deviation $σ$ are known,

6 / 20

Quiz

Q - What is the distribution of sample mean by CLT?

When the population mean $μ$ and the population standard deviation $σ$ are known,

$\bar{X} \sim N (μ, σ / \sqrt{n}) \Leftrightarrow Z = \frac{\bar{X} - μ}{σ / \sqrt{n}} \sim N (0, 1)$

approximately, for a large enough $n$ .

6 / 20

Quiz

Q - What is the distribution of sample mean by CLT?

When the population mean $μ$ and the population standard deviation $σ$ are known,

$\bar{X} \sim N (μ, σ / \sqrt{n}) \Leftrightarrow Z = \frac{\bar{X} - μ}{σ / \sqrt{n}} \sim N (0, 1)$

approximately, for a large enough $n$ .

N(0,1) is standard normal distribution.
Often, a random variable following the standard normal distribution is denoted by $Z$ .

6 / 20

Quiz

Q - What if $σ$ is unknown?

7 / 20

Quiz

Q - What if $σ$ is unknown?

We approximate $σ$ with the sample standard deviation.

7 / 20

Quiz

Q - What if $σ$ is unknown?

We approximate $σ$ with the sample standard deviation.

$Z = \frac{\bar{X} - μ}{σ / \sqrt{n}} \to T = \frac{\bar{X} - μ}{S / \sqrt{n}}$ where $S^{2} = \sum_{i = 1}^{n} (X_{i} - \bar{X})^{2} / (n - 1)$

7 / 20

Quiz

Q - What if $σ$ is unknown?

We approximate $σ$ with the sample standard deviation.

$Z = \frac{\bar{X} - μ}{σ / \sqrt{n}} \to T = \frac{\bar{X} - μ}{S / \sqrt{n}}$ where $S^{2} = \sum_{i = 1}^{n} (X_{i} - \bar{X})^{2} / (n - 1)$

$σ$ replaced by $S$ ! (The realized value of $S$ from a sample is $s$ .)

7 / 20

Quiz

Q - What if $σ$ is unknown?

We approximate $σ$ with the sample standard deviation.

$Z = \frac{\bar{X} - μ}{σ / \sqrt{n}} \to T = \frac{\bar{X} - μ}{S / \sqrt{n}}$ where $S^{2} = \sum_{i = 1}^{n} (X_{i} - \bar{X})^{2} / (n - 1)$

$σ$ replaced by $S$ ! (The realized value of $S$ from a sample is $s$ .)
This change renders the random variable $T$ follow another distribution than the standard normal distribution, i.e., $T \sim t_{n - 1}$ where $t_{n - 1}$ is a t-distribution with $n - 1$ degrees of freedom.

7 / 20

Quiz

Q - List properties of the t-distribution.

Its shape is

8 / 20

Quiz

Q - List properties of the t-distribution.

Its shape is unimodal, symmetric, centered at 0 similarly to N(0,1).

8 / 20

Quiz

Q - List properties of the t-distribution.

Its shape is unimodal, symmetric, centered at 0 similarly to N(0,1).
Its tails are

8 / 20

Quiz

Q - List properties of the t-distribution.

Its shape is unimodal, symmetric, centered at 0 similarly to N(0,1).
Its tails are thicker than N(0,1).

8 / 20

Quiz

Q - List properties of the t-distribution.

Its shape is unimodal, symmetric, centered at 0 similarly to N(0,1).
Its tails are thicker than N(0,1).
It is fully defined by

8 / 20

Quiz

Q - List properties of the t-distribution.

Its shape is unimodal, symmetric, centered at 0 similarly to N(0,1).
Its tails are thicker than N(0,1).
It is fully defined by the degrees of freedom.

8 / 20

Quiz

Q - Black solid line is N(0,1). What is the t-distribution with df = 1, 3, 10, and 30?

9 / 20

Quiz

Q - Black solid line is N(0,1). What is the t-distribution with df = 1, 3, 10, and 30?

thicker tails
As the degrees of freedom increases, the t-distribution becomes more like N(0,1)

10 / 20

Quiz

Q - What is an appropriate code to calculate $P (T < 1.2)$ where $T \sim t_{5}$ ?

11 / 20

Quiz

Q - What is an appropriate code to calculate $P (T < 1.2)$ where $T \sim t_{5}$ ?

pt(1.2, df = 5)

## [1] 0.8580545

11 / 20

Quiz

Q - What is an appropriate code to calculate $P (- 2 < T < 3)$ where $T \sim t_{10}$ ?

12 / 20

Quiz

Q - What is an appropriate code to calculate $P (- 2 < T < 3)$ where $T \sim t_{10}$ ?

pt(3, df = 10) - pt(-2, df = 10)

## [1] 0.9566342

12 / 20

Quiz

Q - What is an appropriate code to find q s.t. $P (T > q) = 0.25$ where $X \sim t_{7}$ ?

13 / 20

Quiz

Q - What is an appropriate code to find q s.t. $P (T > q) = 0.25$ where $X \sim t_{7}$ ?

qt(0.25, df = 7, lower.tail = FALSE)

## [1] 0.7111418

13 / 20

Quiz: HT

Let's conduct a hypothesis test for $H_{0} : μ = 5$ vs. $H_{1} : μ \neq 5$ . We don't know the population standard deviation. We have a random sample of size 100. The CLT conditions are checked.

Q - What is the test statistic and its null distribution by CLT?

14 / 20

Quiz: HT

Let's conduct a hypothesis test for $H_{0} : μ = 5$ vs. $H_{1} : μ \neq 5$ . We don't know the population standard deviation. We have a random sample of size 100. The CLT conditions are checked.

Q - What is the test statistic and its null distribution by CLT?

The test statistic is calculated by $t = \frac{\bar{x} - 5}{s / 10}$ .

14 / 20

Quiz: HT

Let's conduct a hypothesis test for $H_{0} : μ = 5$ vs. $H_{1} : μ \neq 5$ . We don't know the population standard deviation. We have a random sample of size 100. The CLT conditions are checked.

Q - What is the test statistic and its null distribution by CLT?

The test statistic is calculated by $t = \frac{\bar{x} - 5}{s / 10}$ .
Under the null, $T = \frac{\bar{X} - 5}{S / 10} \sim t_{99}$
Capital letters for random variables and lowercase letters for observed values

14 / 20

Quiz: HT

Q - What does it mean that the test statistic is 3.5?

15 / 20

Quiz: HT

Q - What does it mean that the test statistic is 3.5?

The observed sample mean $\bar{x}$ is 3.5 standard errors above the hypothesized population mean, 5.

15 / 20

Quiz: CI

Q - What is the formula to obtain a $1 - α$ confidence interval for $μ$ ?

16 / 20

Quiz: CI

Q - What is the formula to obtain a $1 - α$ confidence interval for $μ$ ?

$\bar{x} \pm t_{n - 1}^{*} \times \frac{s}{\sqrt{n}}$

where $t_{n - 1}^{*}$ is a critical value that satisfies $P (T > t_{n - 1}^{*}) = α / 2$ for $T \sim t_{n - 1}$ .

16 / 20

Quiz: CI

Q - What is the formula to obtain a $1 - α$ confidence interval for $μ$ ?

$\bar{x} \pm t_{n - 1}^{*} \times \frac{s}{\sqrt{n}}$

where $t_{n - 1}^{*}$ is a critical value that satisfies $P (T > t_{n - 1}^{*}) = α / 2$ for $T \sim t_{n - 1}$ .

Q - What is the R function to calculate $t_{n - 1}^{*}$ ?

16 / 20

Quiz: CI

Q - What is the formula to obtain a $1 - α$ confidence interval for $μ$ ?

$\bar{x} \pm t_{n - 1}^{*} \times \frac{s}{\sqrt{n}}$

where $t_{n - 1}^{*}$ is a critical value that satisfies $P (T > t_{n - 1}^{*}) = α / 2$ for $T \sim t_{n - 1}$ .

Q - What is the R function to calculate $t_{n - 1}^{*}$ ?

qt(alpha/2, df = n-1, lower.tail = FALSE)

16 / 20

Quiz

Q - What is the function in the infer package to use for CLT-based inference when $σ$ is unknown?

17 / 20

Quiz

Q - What is the function in the infer package to use for CLT-based inference when $σ$ is unknown?

t_test()

17 / 20

Questions?

18 / 20

Let's Practice Together!

Go to AE 18: Central Limit Theorem 2

19 / 20

Bulletin

Tomorrow is Ask-for-Help day. Bring your questions.
Lab06 due tonight at 11:59pm
HW03 due Wednesday, June 8 at 11:59pm
Tomorrow (June 8) is the last day to withdraw with W
Submit ae18

20 / 20

↑, ←, Pg Up, k	Go to previous slide
↓, →, Pg Dn, Space, j	Go to next slide
Home	Go to first slide
End	Go to last slide
Number + Return	Go to specific slide
b / m / f	Toggle blackout / mirrored / fullscreen mode
c	Clone slideshow
p	Toggle presenter mode
t	Restart the presentation timer
?, h	Toggle this help

Central Limit Theorem 2

Bora Jin

Material

Today's Goal

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz

Quiz: HT

Quiz: HT

Quiz: HT

Quiz: HT

Quiz: HT

Quiz: CI

Quiz: CI

Quiz: CI

Quiz: CI

Quiz

Quiz

Questions?

Let's Practice Together!

Bulletin

Material

Help