3. Wrangling

Data wrangling skills are among the most important to hone.

Author
Published

September 9, 2024

Artwork by @allison_horst.

Agenda

September 9 + 11, 2024

  1. Tidy data
  2. Data verbs

September 16, 2024

  1. Relational data (_join)
  2. pivoting

September 18, 2024

  1. mapping

Readings

Reflection questions

  • How and why is |> used? And how is it different from the layering symbol + used in ggplot()?

  • What are the main data wrangling verbs?

  • How do you distinguish the different _join functions? Are the _join keys formatted in the same way across the two datasets? Are the data recorded in the same way (e.g., is age birthday or age at recording?) ?

  • What are some of the ways to distinguish a data verb from a typical function?

Ethics considerations

  • What is Jan 31 plus one month? And why does it matter that every analysis we do is a series of decisions? Keeping in mind that each of us might make a different decision, and all decisions have consequences.

Slides

Additional Resources

:::