The Centre for Investigative Journalism
The Centre for Investigative Journalism
Menu

Web Scraping Without Code

How to use the import functions for scraping data into Google Sheets, building a basic scraper with websraper.io and OCR 3 ways.

Technical Requirements

The class takes place at a Goldsmiths computer lab. Please create a Google Drive account, if you do not have one already. No laptops required, but you can use your own if you like.

4 July 2019 – #CIJSummer 2019/DAY 1

15:15–16:15
Web Scraping Without Code

Niamh McIntyre

Niamh McIntyre is a journalist on the Guardian’s data projects team, working on data-driven investigations and news stories. Recent stories include an investigation into schools using crowdfunding websites and coverage of the UK’s gender pay gap.

Pamela Duncan

Pamela Duncan is the editor of the Guardian’s Data Project team, an occasional award-winning journalist (#humblebrag) and a self-confessed data nerd. She can usually be found at her desk poring over spreadsheets and using her coding skills - usually a combination of scraping, regex and pandas/Python - to build and analyse datasets to produce high quality and exclusive data stories.
  • 4 July 2019 15.15–16.15
Location: Room 104 - PSH Building - Goldsmiths, University of London
Course
Beginner
Data