+ - 0:00:00
Notes for current slide
Notes for next slide

Central Limit Theorem 1

Bora Jin

1 / 17

Today's Goal

  • Use Central Limit Theorem to define distribution of sample means
  • Calculate probabilities from the normal distribution
3 / 17

Quantifying Variability

We can quantify the variability of sample statistics using different approaches:

  • Simulation: via bootstrapping or "resampling" techniques

or

  • Theory: via the Central Limit Theorem
4 / 17

Quantifying Variability

We can quantify the variability of sample statistics using different approaches:

  • Simulation: via bootstrapping or "resampling" techniques

or

  • Theory: via the Central Limit Theorem

Today we will focus on Theory.

4 / 17

Quiz

Q - What is a sampling distribution of the sample mean?

5 / 17

Quiz

Q - What is a sampling distribution of the sample mean?

From one random sample of size n, calculate the sample mean X¯1

5 / 17

Quiz

Q - What is a sampling distribution of the sample mean?

From one random sample of size n, calculate the sample mean X¯1

From a second random sample of size n, calculate the sample mean X¯2

5 / 17

Quiz

Q - What is a sampling distribution of the sample mean?

From one random sample of size n, calculate the sample mean X¯1

From a second random sample of size n, calculate the sample mean X¯2

Repeat this many times.

5 / 17

Quiz

Q - What is a sampling distribution of the sample mean?

From one random sample of size n, calculate the sample mean X¯1

From a second random sample of size n, calculate the sample mean X¯2

Repeat this many times.

We call the distribution of X¯ the sampling distribution.

5 / 17

Quiz

Q - Apply the central limit theorem (CLT) on sample means.

Let a random variable X have a mean μ and standard deviation σ. Then the sampling distribution of the sample mean X¯=X1+X2++Xnn

6 / 17

Quiz

Q - Apply the central limit theorem (CLT) on sample means.

Let a random variable X have a mean μ and standard deviation σ. Then the sampling distribution of the sample mean X¯=X1+X2++Xnn

  • Has the mean μ
6 / 17

Quiz

Q - Apply the central limit theorem (CLT) on sample means.

Let a random variable X have a mean μ and standard deviation σ. Then the sampling distribution of the sample mean X¯=X1+X2++Xnn

  • Has the mean μ

  • Has the standard error σ/n

standard error of a sample mean = standard deviation of its sampling distribution or an estimate of that standard deviation

6 / 17

Quiz

Q - Apply the central limit theorem (CLT) on sample means.

Let a random variable X have a mean μ and standard deviation σ. Then the sampling distribution of the sample mean X¯=X1+X2++Xnn

  • Has the mean μ

  • Has the standard error σ/n

standard error of a sample mean = standard deviation of its sampling distribution or an estimate of that standard deviation

  • If the sample size n is large enough, the sampling distribution of X¯ is approximately normally distributed.

As n, X¯ converges in distribution to N(μ,σ/n).

6 / 17

Quiz

Q - Describe density of a normal distribution.

7 / 17

Quiz

Q - Describe density of a normal distribution.

  • unimodal (peak at μ)
  • symmetric around μ
  • bell-shaped

7 / 17

Quiz

Q - The CLT holds only if XN(μ,σ). (T/F)

8 / 17

Quiz

Q - The CLT holds only if XN(μ,σ). (T/F)

F

8 / 17

Quiz

Q - The CLT holds only if XN(μ,σ). (T/F)

F

X can be from any distribution with a mean μ and standard deviation σ.

Let's play with weird looking original distributions - Click!

8 / 17

Quiz

Q - What are the two conditions for CLT to hold?

9 / 17

Quiz

Q - What are the two conditions for CLT to hold?

Independence

  • {X1,,Xn} must be independent to one another
  • One observation's value should not "influence" another observation's value.
  • Rules of thumb to check independence:
    • Completely random sampling
    • If taken without replacement, n should be less than 10% of the population size
9 / 17

Quiz

Q - What are the two conditions for CLT to hold?

10 / 17

Quiz

Q - What are the two conditions for CLT to hold?

Sample size n / distribution

  • If X is numerical, n>30
  • If X is categorical, at least 10 successes and 10 failures
  • If XN(μ,σ), then the distribution of sample means will also be exactly normal, regardless of the sample size.
10 / 17

Recap

  • If certain assumptions are satisfied, regardless of the shape of the population distribution, the sampling distribution of the mean follows an approximately normal distribution.
11 / 17

Recap

  • If certain assumptions are satisfied, regardless of the shape of the population distribution, the sampling distribution of the mean follows an approximately normal distribution.

  • The center of the sampling distribution is at the center of the population distribution.

11 / 17

Recap

  • If certain assumptions are satisfied, regardless of the shape of the population distribution, the sampling distribution of the mean follows an approximately normal distribution.

  • The center of the sampling distribution is at the center of the population distribution.

  • The sampling distribution is less variable than the population distribution by a factor of 1/n.

11 / 17

Recap

  • If certain assumptions are satisfied, regardless of the shape of the population distribution, the sampling distribution of the mean follows an approximately normal distribution.

  • The center of the sampling distribution is at the center of the population distribution.

  • The sampling distribution is less variable than the population distribution by a factor of 1/n.

  • As n increases, the standard error (the spread of the sampling distribution) decreases.

11 / 17

Quiz

Q - What is an appropriate code to calculate P(Z<1.2) where ZN(0,1)?

12 / 17

Quiz

Q - What is an appropriate code to calculate P(Z<1.2) where ZN(0,1)?

pnorm(1.2)
## [1] 0.8849303
12 / 17

Quiz

Q - What is an appropriate code to calculate P(2<X<7) where XN(1,3)?

13 / 17

Quiz

Q - What is an appropriate code to calculate P(2<X<7) where XN(1,3)?

pnorm(7, mean = 1, sd = 3) - pnorm(-2, mean = 1, sd = 3)
## [1] 0.8185946
13 / 17

Quiz

Q - What is an appropriate code to find q s.t. P(X>q)=0.05 where XN(1,2)?

14 / 17

Quiz

Q - What is an appropriate code to find q s.t. P(X>q)=0.05 where XN(1,2)?

qnorm(0.05, mean = -1, sd = 2, lower.tail = FALSE)
## [1] 2.289707
14 / 17

Questions?

15 / 17

Let's Practice Together!

Go to AE 17: Central Limit Theorem

16 / 17

Bulletin

  • Watch videos for Prepare: June 7

  • Project proposal feedback released

  • Lab06 due Tuesday, June 7 at 11:59pm

  • HW03 due Wednesday, June 8 at 11:59pm

  • Submit ae17

17 / 17
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow