M10 - Data Scraping
Learning Goals
- Examine what data scraping is, when it is useful, and when it’s appropriate.
- Review the code behind web pages, i.e. HTML, and how data are embedded in it
- Install and use the “Selector Gadget” widget in Chrome to identify page elements
- Use the RVest package to extract elements from web pages into the coding environment
- Design R functions to scrape data from pages, and use these functions to scrape data from many pages (web crawling)