lab02.Rmd
to open the template R Markdown file.In addition, code should not exceed the 80 character limit. To help police this, add a vertical line at 80 characters by clicking “Tools” \(\rightarrow\) “Global Options” \(\rightarrow\) “Code” \(\rightarrow\) “Display”, then set “Margin Column” to 80 and click “Apply”.
Finally, all plots should follow best visualization practices; plots should include:
We need the tidyverse and fivethirtyeight for this lab.
library(tidyverse)
library(fivethirtyeight) # for data
The Avengers data were originally collected for a FiveThirtyEight article.
We will use a dataset called avengers
inside the fivethirtyeight
package.
This dataset includes information about characters across the entire Marvel Cinematic Universe (MCU), so some of the names will be familiar if you are a fan of the films or comics. Don’t worry if you aren’t a Marvel fan; no background knowledge is needed to successfully complete this lab!
Check the documentation of the data by typing ?avengers
in your console.
active_avengers
using a single pipeline: Use filter
to only include Avengers 1) who were not given probationary status, 2) who are still active, and 3) whose introduction year is not 1900 as “1900” indicates the value of full_reserve_avengers_intro
is missing. Confirm that once you have filtered, you are left with a data frame with 67 observations.🧶 ✅ ⬆️ Knit, commit, and push your final changes to GitHub with a meaningful commit message.
active_avengers
, create a new variable called years_served
that represents the number of years served as of 2022 (Hint: you can use either the year
variable or years_since_joining
variables to do this).years_served
, select the name_alias
and years_served
, and display the first five rows.
years_served
) as your x variable versus the number of comic books that the character appeared in (appearances
) as your y variable with points colored and shaped by gender. Please label your axes and legend and give the plot a title. You might find scale_color_viridis_d(option = "inferno", begin = 0.2, end = 0.8)
useful.
geom_smooth
with the argument se = FALSE
to add a smooth curve fit to the data without confidence bands around the line. Which plot do you prefer - this plot or the plot in Exercise 4? Briefly explain your choice.🧶 ✅ ⬆️ Knit, commit, and push your final changes to GitHub with a meaningful commit message.
active_avengers
differs from the percentage of females among all Avengers. What do you conclude based on these results?🧶 ✅ ⬆️ Knit, commit, and push your final changes to GitHub with a meaningful commit message.
Sort the active_avengers
dataset in descending order of appearances and display only the columns name_alias
, appearances
, death1
, and return1
for the top five observations. What do you observe about these Avengers in terms of deaths and returns?
Use the active_avengers
dataset to examine the mean and median number of appearances for Avengers who have died at least once compared to those who have not died.
Confirm your guesses about the distributions of appearances in Exercise 8 by 1) side-by-side boxplots and 2) density plots. Use appearances
as your x variable and map death1
to appropriate aesthetic arguments. Fill the density plots with different colors by the value of death1
and set alpha = 0.5
.
🧶 ✅ ⬆️ Knit, commit, and push your final changes to GitHub with a meaningful commit message.
Once you are fully satisfied with your lab, Knit to .pdf to create a PDF document.
Follow the instructions in previous labs to submit your PDF to Gradescope.
Be sure to identify which problems are on each page using Gradescope.
Once you are finished with the lab, you will submit the PDF document produced from your final knit, commit, and push to Gradescope.
Before you wrap up the assignment, make sure all documents are updated on your GitHub repo. We will be checking these to make sure you have been practicing how to commit and push changes. Remember – you must turn in a .pdf file to the Gradescope page by the submission deadline to be considered “on time”.
To submit your assignment:
Component | Points |
---|---|
Ex 1 | 4 |
Ex 2 | 4 |
Ex 3 | 7 |
Ex 4 | 6 |
Ex 5 | 4 |
Ex 6 | 4 |
Ex 7 | 4 |
Ex 8 | 5 |
Ex 9 | 5 |
Workflow & formatting | 7 |
Grading notes: