M10 - Data Scraping

ENV 872 - EDA   |   Spring 2024   |   Instructors: Luana LimaJohn Fay  |  

Learning Goals

  • Examine what data scraping is, when it is useful, and when it’s appropriate.
  • Review the code behind web pages, i.e. HTML, and how data are embedded in it
  • Install and use the “Selector Gadget” widget in Chrome to identify page elements
  • Use the RVest package to extract elements from web pages into the coding environment
  • Design R functions to scrape data from pages, and use these functions to scrape data from many pages (web crawling)

Recordings

Lesson Documents

Lab Document

Assignment

Additional Resources