Keele R Workshop

Dorothy Hodgkin 0.31, Keele University.


In this workshop we’ll cover the motivation behind engaging in reproducible research methods, the basics of R, data wrangling, and data visualisation. In the afternoon we’ll look at some standard statistical methods using the general linear model.

We will finish off by looking at how you can use Binder and GitHub to engage in fully reproducible research.

Before the Workshop - Installing R and RStudio

You will probably want to bring a laptop for this course. Beforehand, you should install R (the language) and RStudio (the interface that helps us interact with R) - each is available for OSX, Windows, and various flavours of Unix. You can install R from here:

And RStudio from here:

The Course Material

You can download all the slides, R scripts, R Markdown scripts, and datasets that I will use in the workshop by clicking on the “Clone or download” buton on my GitHub repo and then selecting “Download ZIP”.

Here you’ll find separate Morning and Afternoon folders. The Morning and Afternoon folders contain the slides, R and R Markdown scripts, and the data I use in the slides. Additionally, you’ll also find a Cheat Sheets and Handy Guides folder which contains a number of helpful reference resources.


You can view the first worksheet here:

And the second worksheet here:

Online R Resources

Below are some helpful R resources - it would be useful to look at the first one before the workshop.

Online introductory guide to R, RStudio, and R Markdown. This is a very clear and focused introduction to R, RStudio, and R Markdown. You probably want to read the first four chapters sooner rather than later…

R for Data Science online book - Garrett Grolemund and Hadley Wickham This is the online interactive version of the book of the same name. It focuses more on the data science side of things than on statistics per se, and is very useful (especially in terms of data wrangling).

R Graphics Cookbook The following cookbook contains lots of useful examples of graphing using the ggplot2 package in R.

If you have any questions in advance of the course, feel free to drop me an email (

Andrew Stewart
Fellow of the Software Sustainability Institute and Senior Lecturer

My interests include open research and reproducibility, data science, experimental psychology, and psycholinguistics. I am a fellow of the Software Sustainability Institute.