M2: Gathering Data for Analysis¶
This second module provides an introduction to different data formats you may encounter and ways to gather data from the web.
Pre-recorded Lectures¶
There are no pre-recorded lectures for module #3.
Supplementary Resources¶
The slides/code presented in lecture are accessible in the upstream repository under the
m2/resources
directory.- Other Resources
Lab¶
Labs provide additional practice problems for topics covered in a module. You may work on them individually or with your peers.
BeautifulSoup Lab (You find distribution code in the upstream repository in the
/labs/bs4
directory)
Installing BeautifulSoup¶
To install BeautifulSoup on your class CS Linux Server, run:
pip3 install --upgrade beautifulsoup4
pip3 install --upgrade html5lib
Programming Assignment¶
Programming Assignment #2, due Saturday, January 29th at 4:30pm CDT