1. Start with R + Git

The importance of reproducibility. Ideas of computational statistics, data science, and machine learning. Some resources for starting with R + RStudio + Git + GitHub.

Jo Hardin https://m154-comp-stats.netlify.app/
08-31-2021
Judge monster confirming that the RStudio monster has reproducible work.

Figure 1: Artwork by @allison_horst.

Agenda

August 31, 2021

  1. Questionnaire
  2. Syllabus & Course Outline
  3. Stitch Fix Algorithm
  4. College Rankings
  5. Can Twitter predict election results?

Before next Thursday, listen to the full conversation of Not So Standard Deviations - Compromised Shoe Situation.

September 2, 2021

  1. Reproducibility & GitHub
  2. Design Challenge (Not So Standard Deviations)

Before next Tuesday, read: Tufte. 1997. Visual and Statistical Thinking: Displays of Evidence for Making Decisions. (Use Google to find it.)

Readings

Reflection questions

Ethics considerations

Slides

Additional Resources

Corrections

If you see mistakes or want to suggest changes, please create an issue on the source repository.

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. Source code is available at https://github.com/hardin47/m154-comp-stats, unless otherwise noted. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".