The Centre for Investigative Journalism

Investigating Data with R

In the first class, R-1, you’ll be shown the basics and get familiar with R and RStudio, import data and learn some functions for getting to grips with your dataset including sorting and filtering. This class assumes no prior experience with R.

In R-2 you’ll get down to some data wrangling and learn how join datasets and carry out calculations in R that will allow you to identify trends in the data for storytelling. You’ll also learn statistical functions in R and how to use ggplot2 for basic visual analysis.

In the third and final class, R-3, you’ll use R to scrape, clean and structure data from webpages and APIs. You’ll also learn how to use R to convert, join and split difficult data files.

Technical Requirements

If you are a complete beginner, these sessions will work best if you come to classes 1 to 3 as we will be building on knowledge and datasets from class to class. However, if you have experience in R you are free to join classes 2 and/or 3.

5 July 2019 – #CIJSummer 2019/DAY 2

R 1: Introduction to R
R 2: Data Wrangling and Statistics in R
R 3: Scraping and APIs in R

Caelainn Barr

Caelainn Barr is Editor of Data Projects at the Guardian. She has worked at The Wall Street Journal, The Irish Times, Bloomberg and the Bureau of Investigative Journalism. She is a graduate of the Stabile Center for Investigative Journalism at Columbia Journalism School in New York.

Niamh McIntyre

Niamh McIntyre is a journalist on the Guardian’s data projects team, working on data-driven investigations and news stories. Recent stories include an investigation into schools using crowdfunding websites and coverage of the UK’s gender pay gap.
  • 5 July 2019 09.00–12.30
Location: Room 105 - PSH Building - Goldsmiths, University of London
All levels