If you find any typos, errors, or places where the text may be improved, please let us know by providing feedback either in the feedback survey (given during class), by using GitLab, or directly in this document with hypothes.is annotations.
Reproducible Research in R
An introductory workshop on modern data analyses and workflows
Updated: September 13 2020
The course is designed as a series of participatory live-coding lessons, where the instructor and learner code together, has hands-on exercises interspersed throughout the course, and has a final group assignment of doing a simple data analysis project. This website contains all of the material for the course, from reading material to exercises, to images. It is structured as a book, with “chapters” as lessons, given in order of appearance. We make heavy use of the website throughout the course where code-along sessions follow the material on the website nearly exactly (with slight modifications for time or more detailed explanations).
The course material was created using rmarkdown to write the lessons,
bookdown to create the book format, GitLab to host the Git
repository of the material, and GitLab CI with Netlify to create the website.
The original source material for this course is found on the
r-cubed GitLab repository.
- hypothes.is, a web annotating tool. Go to “Get Started” to give feedback and add comments directly on this website.
- GitLab and creating a new Issue to make comments and give feedback for the material.
1.1 Re-use and licensing
The course material is licensed under the Creative Commons Attribution 4.0 International License and the course code is licensed under a MIT License, so the material can be used, re-used, and modified, as long as there is attribution to this source.
1.2 Previous versions
There have been two previous “formal” versions of the course given so far:
Contributors for version 1 were:
- Luke Johnston: Brainstormed most of the course material and structure; set up the website; organized and coordinated the course; wrote, prepared, and taught most of the code-along sessions, as well as several lectures.
- Daniel Witte: Participated in brainstorming on the course structure and laytout; prepared and taught most of the lectures.
- João Santiago: Taught and edited the R Markdown code-along session material.
- Anna Schritz: Taught and edited the Data Management and Wrangling code-along session material.
- Omar Silverman (helper) proofread and provided feedback on the material for several sessions.
Contributors for version 2 were:
- Luke Johnston: Same as with version 1.
- Daniel Witte: Prepared and taught most of the lectures.
- Helene Baek Juel: Edited and taught the Data Visualization code-along session material, as well as reviewed multiple other sections.
- Bettina Lengger: Edited and taught the Data Management and Wrangling code-along session material, as well as reviewed multiple other sections.
Contributors for version 2.1 were:
TODO: Add this once done.
Many pieces from the (in development) course Merely Useful were modified and used in several places throughout the r-cubed course. Other parts were taken and modified from:
- The Aarhus University Community of Research Software Users sessions on project management, data management and wrangling, intermediate dplyr and tidyr usage, data visualization, and R Markdown.
- A Git session, given for the UofTCoders.
- The UofTCoders Intro to R course for Ecology students.
The Danish Diabetes Academy hosted, organized, and sponsored versions 1 and 2 of the course. A huge thanks to them for their involvement, support, and sponsorship! Both Steno Diabetes Center Aarhus and Aarhus University were employers of Luke Johnston and Daniel Witte (the initial founders and creators).