+ - 0:00:00
Notes for current slide
Notes for next slide

Final Project Proposal

Bora Jin

1 / 5

Project Proposal Steps

  1. Find a dataset that satisfies the final project guidelines

  2. Write about:

    • the source of data
    • when and how it was originally collected (by the curator, not necessarily how you found the data)
    • a brief description of the observations and variables you intend to explore
  3. Choose 1-2 research questions

  4. glimpse the data

2 / 5

Ex: Introduction / data

Dataset #1: NC Courage Homefield Advantage

Our first dataset comes from the National Women's Soccer League (NSWL) Github and was sourced from nwslsoccer.com.

The dataset contains 78 observations (soccer games) played by the NC courage spanning three seasons: 2017, 2018, 2019. There are 10 variables in this dataset. Some of the variables we care about are home_team, away_team, and result (of the game).

3 / 5

Ex: Research question(s):

  1. Does NC Courage have a home-field advantage? We hypothesize that NC Courage is more likely to win on their home field than another team's field.

    • To answer this question we will use information about the home_team and the result of the game.
  2. Does winning propagate winning? When NC Courage win a game, does it increase the probability of winning the very next game?

    • To answer this question we will use information about the result of the game and the game_number.
4 / 5

Ex: Glimpse

glimpse(courage)
## Rows: 78
## Columns: 10
## $ game_id <chr> "washington-spirit-vs-north-carolina-courag…
## $ game_date <chr> "4/15/2017", "4/22/2017", "4/29/2017", "5/7…
## $ game_number <dbl> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, …
## $ home_team <chr> "WAS", "NC", "NC", "BOS", "ORL", "NC", "NC"…
## $ away_team <chr> "NC", "POR", "ORL", "NC", "NC", "CHI", "NJ"…
## $ opponent <chr> "WAS", "POR", "ORL", "BOS", "ORL", "CHI", "…
## $ home_pts <dbl> 0, 1, 3, 0, 3, 1, 2, 3, 2, 3, 0, 0, 2, 1, 1…
## $ away_pts <dbl> 1, 0, 1, 1, 1, 3, 0, 2, 0, 1, 1, 1, 0, 0, 0…
## $ result <chr> "win", "win", "win", "win", "loss", "loss",…
## $ season <dbl> 2017, 2017, 2017, 2017, 2017, 2017, 2017, 2…
5 / 5

Project Proposal Steps

  1. Find a dataset that satisfies the final project guidelines

  2. Write about:

    • the source of data
    • when and how it was originally collected (by the curator, not necessarily how you found the data)
    • a brief description of the observations and variables you intend to explore
  3. Choose 1-2 research questions

  4. glimpse the data

2 / 5
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow