Dataset: City

The original data from Chicago’s data portal contains detailed information for each crime and call to 311. We have split the city up into regions using a simple grid and have aggregated this data by region.

Each city data file contains data for different types of complaints (that is, calls to 311) and the total amount of crimes on a per-region basis. The first row in the file contains column labels, for example, GRAFFITI or POT_HOLES. Subsequent rows contain data for different regions of the city. A column contains data for a given variable across all the rows. For example, the column with index 1 (the second column) contains the number of calls about pot holes for each region. In addition to information about specific types of complaints, the file also has one column that contains the total number of crimes in each region.

File paths:
data/city
Parameters:
{"name": "City",
 "predictor_vars": [0, 1, 2, 3, 4, 5, 6],
 "dependent_var": 7,
 "training_fraction": 0.55,
 "seed": 22992}
City Task 1a:

CRIME_TOTALS ~ 575.687669 + 0.678349 * GRAFFITI
R2: 0.14027491610313492

CRIME_TOTALS ~ -22.208880 + 5.375417 * POT_HOLES
R2: 0.6229070858532731

CRIME_TOTALS ~ 227.414583 + 7.711958 * RODENTS
R2: 0.5575360783921093

CRIME_TOTALS ~ 11.553128 + 18.892669 * GARBAGE
R2: 0.7831498392992615

CRIME_TOTALS ~ -65.954319 + 13.447459 * STREET_LIGHTS
R2: 0.7198560514392484

CRIME_TOTALS ~ 297.222082 + 10.324616 * TREE_DEBRIS
R2: 0.32659079486818354

CRIME_TOTALS ~ 308.489056 + 10.338500 * ABANDONED_BUILDINGS
R2: 0.6897288976957778

City Task 1b:

CRIME_TOTALS ~ -35.784745 + -0.347343 * GRAFFITI + 3.596555 * POT_HOLES + -0.143517 * RODENTS + 4.214673 * GARBAGE + 2.446765 * STREET_LIGHTS + -4.148366 * TREE_DEBRIS + 5.724136 * ABANDONED_BUILDINGS
R2: 0.8909173620789893

City Task 2:

CRIME_TOTALS ~ -36.151629 + 3.300180 * POT_HOLES + 7.129337 * ABANDONED_BUILDINGS
R2: 0.8580580940940485

City Task 3:

CRIME_TOTALS ~ 308.489056 + 10.338500 * ABANDONED_BUILDINGS
R2: 0.6897288976957778

CRIME_TOTALS ~ -36.151629 + 3.300180 * POT_HOLES + 7.129337 * ABANDONED_BUILDINGS
R2: 0.8580580940940485

CRIME_TOTALS ~ -53.303574 + -0.213704 * GRAFFITI + 3.948901 * POT_HOLES + 6.769038 * ABANDONED_BUILDINGS
R2: 0.8650034618337505

CRIME_TOTALS ~ -29.057833 + -0.386986 * GRAFFITI + 5.057974 * POT_HOLES + -3.424232 * TREE_DEBRIS + 7.226820 * ABANDONED_BUILDINGS
R2: 0.8799155180187794

CRIME_TOTALS ~ -22.991702 + -0.337971 * GRAFFITI + 3.900442 * POT_HOLES + 5.033985 * GARBAGE + -3.433079 * TREE_DEBRIS + 6.078619 * ABANDONED_BUILDINGS
R2: 0.8877056719368024

CRIME_TOTALS ~ -35.457501 + -0.348926 * GRAFFITI + 3.532662 * POT_HOLES + 4.058232 * GARBAGE + 2.554864 * STREET_LIGHTS + -4.135113 * TREE_DEBRIS + 5.688046 * ABANDONED_BUILDINGS
R2: 0.8908748485824084

CRIME_TOTALS ~ -35.784745 + -0.347343 * GRAFFITI + 3.596555 * POT_HOLES + -0.143517 * RODENTS + 4.214673 * GARBAGE + 2.446765 * STREET_LIGHTS + -4.148366 * TREE_DEBRIS + 5.724136 * ABANDONED_BUILDINGS
R2: 0.8909173620789893

City Task 4:

CRIME_TOTALS ~ -35.457501 + -0.348926 * GRAFFITI + 3.532662 * POT_HOLES + 4.058232 * GARBAGE + 2.554864 * STREET_LIGHTS + -4.135113 * TREE_DEBRIS + 5.688046 * ABANDONED_BUILDINGS
R2: 0.8908748485824084
Adjusted R2: 0.8875512399097913

City Task 5:

CRIME_TOTALS ~ -35.457501 + -0.348926 * GRAFFITI + 3.532662 * POT_HOLES + 4.058232 * GARBAGE + 2.554864 * STREET_LIGHTS + -4.135113 * TREE_DEBRIS + 5.688046 * ABANDONED_BUILDINGS
Training R2: 0.8908748485824084
Testing R2: 0.8084939761877112