Digital Scraping

Introduction to Digital Scraping: One of the most time-consuming aspects of performing any sort of data analysis is getting that data in the first place. Often, a straightforward, well-structured database doesn't exist, which means you need to build one yourself, from scratch. That's where scraping comes in: you can build a program to automate this collection for you, saving countless hours of boring and imprecise data entry. In this one-day class, you'll learn how to decide on the structure for your data, pick the right scraping approach, create a scraper and systematize your data collection. The class will introduce the basic concepts and strategies behind scraping, and focus on getting data off both websites and offline documents (such as PDFs).

Instructor

Date

Friday, July 20, 2018 - 09:00 to 17:00

Location

K214

Pre-requisites

Basic knowledge and familiarity with R, Python or JavaScript.