Data Visualization

Code for quiz 9

  1. Load the R package we will use.
  1. Quiz questions Replace all the instances of ‘SEE QUIZ’. These are inputs from your moodle quiz. Replace all the instances of ‘???’. These are answers on your moodle quiz. Run all the individual code chunks to make sure the answers in this file correspond with your quiz answers

Question: e_charts-1

spend_time contains 10 years of data on how many hours Americans spend each day on 5 activities

read it into spend_time

spend_time  <- read_csv("https://estanny.com/static/week8/spend_time.csv")

e_charts-1

Start with spend_time

spend_time %>%
  group_by(year)  %>%
  e_charts(x = activity , timeline = TRUE) %>%
  e_timeline_opts (autoPlay = TRUE)  %>%
  e_bar(serie = avg_hours)  %>%
  e_title (text ='Average hours Americans spend per day on each activity')  %>%
  e_legend(show = FALSE)

Question: echarts-2

Create a line chart for the activities that American spend time on.

Start with spend_time

spend_time  %>%
  mutate(year = paste(year, "12","31", sep = "-"))  %>%
  mutate(year = lubridate::ymd(year))  %>%
  group_by(activity)  %>%
  e_charts(x= year)  %>%
  e_line(serie = avg_hours)  %>%
  e_tooltip()  %>%
  e_title(text = 'Average hours Americans spend per day on each activity')  %>%
  e_legend(top = 40)

Question - modify slide 82

Create a plot with the spend_time data

filter on activity == “leisure/sports”

description is “Americans spend the most time on leisure/sport”

ggplot(spend_time , aes(x = year, y = avg_hours , color = activity)) +
geom_point() +
geom_mark_ellipse(aes(filter = activity == "leisure/sports",
 description= "Americans spend on average more time each day on leisure/sports than 
the other activities"))

Question: tidyquant

Modify the tidyquant example in the video

df  <- tq_get("FB", get = "stock.prices",
          from = "2019-08-01", to = "2020-07-28" )

Create a plot with the df data

ggplot(df, aes(x = date, y = close)) +
  geom_line() +
  geom_mark_ellipse(aes(
    filter = date == "2020-01-08",
    description = "The CDC issued its first public alert about the coronavirus"
  ), fill = "yellow") +
  geom_mark_ellipse(aes(
   filter  = date == "2020-03-23",
    description = "WHO describes pandemic as `accelerating`"
  ), color = "red", ) +
  labs(
    title = "Facebook",
    x = NULL,
    y = "Closing price per share",
   caption = "Source: https://en.wikipedia.org/wiki/Timeline_of_the_COVID-19_pandemic_in_the_United_States"
  )

Save the previous plot to preview.png and add to the yaml chunk at the top

ggsave(filename = "preview.png",
       path = here::here("_posts", "2021-04-13-data-visualization"))