Dataset: City

The original data from Chicago’s data portal contains detailed information for each crime and call to 311. We have split the city up into regions using a simple grid and have aggregated this data by region.

Each city data file contains data for different types of complaints (that is, calls to 311) and the total amount of crimes on a per-region basis. The first row in the file contains column labels, for example, GRAFFITI or POT_HOLES. Subsequent rows contain data for different regions of the city. A column contains data for a given variable across all the rows. For example, the column with index 1 (the second column) contains the number of calls about pot holes for each region. In addition to information about specific types of complaints, the file also has one column that contains the total number of crimes in each region.

File paths
data/city
Parameters
{"name": "City",
 "feature_vars": [0, 1, 2, 3, 4, 5, 6],
 "target_var": 7,
 "training_fraction": 0.55,
 "seed": 22992}

Results without feature standardization

Task 2a

CRIME_TOTALS ~ 575.687669 + 0.678349 * GRAFFITI
R2: 0.1402749161031347
CRIME_TOTALS ~ -22.208880 + 5.375417 * POT_HOLES
R2: 0.6229070858532733
CRIME_TOTALS ~ 227.414583 + 7.711958 * RODENTS
R2: 0.5575360783921093
CRIME_TOTALS ~ 11.553128 + 18.892669 * GARBAGE
R2: 0.7831498392992613
CRIME_TOTALS ~ -65.954319 + 13.447459 * STREET_LIGHTS
R2: 0.7198560514392485
CRIME_TOTALS ~ 297.222082 + 10.324616 * TREE_DEBRIS
R2: 0.3265907948681832
CRIME_TOTALS ~ 308.489056 + 10.338500 * ABANDONED_BUILDINGS
R2: 0.6897288976957777

Task 2b

CRIME_TOTALS ~ -35.784745 + -0.347343 * GRAFFITI + 3.596555 * POT_HOLES + -0.143517 * RODENTS + 4.214673 * GARBAGE + 2.446765 * STREET_LIGHTS + -4.148366 * TREE_DEBRIS + 5.724136 * ABANDONED_BUILDINGS
R2: 0.8909173620789894

Task 3

CRIME_TOTALS ~ -36.151629 + 3.300180 * POT_HOLES + 7.129337 * ABANDONED_BUILDINGS
R2: 0.8580580940940485

Task 4

CRIME_TOTALS ~ 11.553128 + 18.892669 * GARBAGE
R2: 0.7831498392992613
CRIME_TOTALS ~ 45.293237 + 12.805205 * GARBAGE + 4.703679 * ABANDONED_BUILDINGS
R2: 0.8446128237564032
CRIME_TOTALS ~ -33.819789 + 5.772113 * GARBAGE + 5.715802 * ABANDONED_BUILDINGS + 2.141799 * POT_HOLES
R2: 0.868789563474036
CRIME_TOTALS ~ -12.197132 + 6.237088 * GARBAGE + 6.023250 * ABANDONED_BUILDINGS + 2.376052 * POT_HOLES + -1.923758 * TREE_DEBRIS
R2: 0.8748678501334366
CRIME_TOTALS ~ -22.991702 + 5.033985 * GARBAGE + 6.078619 * ABANDONED_BUILDINGS + 3.900442 * POT_HOLES + -3.433079 * TREE_DEBRIS + -0.337971 * GRAFFITI
R2: 0.8877056719368024
CRIME_TOTALS ~ -35.457501 + 4.058232 * GARBAGE + 5.688046 * ABANDONED_BUILDINGS + 3.532662 * POT_HOLES + -4.135113 * TREE_DEBRIS + -0.348926 * GRAFFITI + 2.554864 * STREET_LIGHTS
R2: 0.8908748485824085
CRIME_TOTALS ~ -35.784745 + 4.214673 * GARBAGE + 5.724136 * ABANDONED_BUILDINGS + 3.596555 * POT_HOLES + -4.148366 * TREE_DEBRIS + -0.347343 * GRAFFITI + 2.446765 * STREET_LIGHTS + -0.143517 * RODENTS
R2: 0.8909173620789894

Task 5

CRIME_TOTALS ~ 11.553128 + 18.892669 * GARBAGE
Training R2: 0.7831498392992613
Testing R2: 0.6434115580911255
CRIME_TOTALS ~ 45.293237 + 12.805205 * GARBAGE + 4.703679 * ABANDONED_BUILDINGS
Training R2: 0.8446128237564032
Testing R2: 0.6438660402149637
CRIME_TOTALS ~ -33.819789 + 5.772113 * GARBAGE + 5.715802 * ABANDONED_BUILDINGS + 2.141799 * POT_HOLES
Training R2: 0.868789563474036
Testing R2: 0.7529302078607409
CRIME_TOTALS ~ -12.197132 + 6.237088 * GARBAGE + 6.023250 * ABANDONED_BUILDINGS + 2.376052 * POT_HOLES + -1.923758 * TREE_DEBRIS
Training R2: 0.8748678501334366
Testing R2: 0.7856436543592813
CRIME_TOTALS ~ -22.991702 + 5.033985 * GARBAGE + 6.078619 * ABANDONED_BUILDINGS + 3.900442 * POT_HOLES + -3.433079 * TREE_DEBRIS + -0.337971 * GRAFFITI
Training R2: 0.8877056719368024
Testing R2: 0.798759490917067
CRIME_TOTALS ~ -35.457501 + 4.058232 * GARBAGE + 5.688046 * ABANDONED_BUILDINGS + 3.532662 * POT_HOLES + -4.135113 * TREE_DEBRIS + -0.348926 * GRAFFITI + 2.554864 * STREET_LIGHTS
Training R2: 0.8908748485824085
Testing R2: 0.8084939761877116
CRIME_TOTALS ~ -35.784745 + 4.214673 * GARBAGE + 5.724136 * ABANDONED_BUILDINGS + 3.596555 * POT_HOLES + -4.148366 * TREE_DEBRIS + -0.347343 * GRAFFITI + 2.446765 * STREET_LIGHTS + -0.143517 * RODENTS
Training R2: 0.8909173620789894
Testing R2: 0.8089304745948849

Results with feature standardization

Task 2a

CRIME_TOTALS ~ -0.038230 + 0.368050 * GRAFFITI
R2: 0.1402749161031347
CRIME_TOTALS ~ 0.005853 + 0.819536 * POT_HOLES
R2: 0.6229070858532733
CRIME_TOTALS ~ -0.010892 + 0.775836 * RODENTS
R2: 0.5575360783921093
CRIME_TOTALS ~ -0.009588 + 0.899974 * GARBAGE
R2: 0.7831498392992614
CRIME_TOTALS ~ -0.025762 + 0.817677 * STREET_LIGHTS
R2: 0.7198560514392485
CRIME_TOTALS ~ -0.032871 + 0.580104 * TREE_DEBRIS
R2: 0.3265907948681832
CRIME_TOTALS ~ -0.070962 + 0.755882 * ABANDONED_BUILDINGS
R2: 0.6897288976957778

Task 2b

CRIME_TOTALS ~ -0.023183 + -0.188457 * GRAFFITI + 0.548331 * POT_HOLES + -0.014438 * RODENTS + 0.200771 * GARBAGE + 0.148776 * STREET_LIGHTS + -0.233082 * TREE_DEBRIS + 0.418511 * ABANDONED_BUILDINGS
R2: 0.8909173620789893

Task 3

CRIME_TOTALS ~ -0.031424 + 0.503146 * POT_HOLES + 0.521250 * ABANDONED_BUILDINGS
R2: 0.8580580940940485

Task 4

CRIME_TOTALS ~ -0.009588 + 0.899974 * GARBAGE
R2: 0.7831498392992614
CRIME_TOTALS ~ -0.032697 + 0.609991 * GARBAGE + 0.343902 * ABANDONED_BUILDINGS
R2: 0.8446128237564032
CRIME_TOTALS ~ -0.028054 + 0.274961 * GARBAGE + 0.417901 * ABANDONED_BUILDINGS + 0.326539 * POT_HOLES
R2: 0.868789563474036
CRIME_TOTALS ~ -0.028073 + 0.297111 * GARBAGE + 0.440380 * ABANDONED_BUILDINGS + 0.362253 * POT_HOLES + -0.108089 * TREE_DEBRIS
R2: 0.8748678501334366
CRIME_TOTALS ~ -0.021549 + 0.239800 * GARBAGE + 0.444428 * ABANDONED_BUILDINGS + 0.594661 * POT_HOLES + -0.192893 * TREE_DEBRIS + -0.183372 * GRAFFITI
R2: 0.8877056719368024
CRIME_TOTALS ~ -0.023199 + 0.193319 * GARBAGE + 0.415872 * ABANDONED_BUILDINGS + 0.538590 * POT_HOLES + -0.232338 * TREE_DEBRIS + -0.189316 * GRAFFITI + 0.155349 * STREET_LIGHTS
R2: 0.8908748485824085
CRIME_TOTALS ~ -0.023183 + 0.200771 * GARBAGE + 0.418511 * ABANDONED_BUILDINGS + 0.548331 * POT_HOLES + -0.233082 * TREE_DEBRIS + -0.188457 * GRAFFITI + 0.148776 * STREET_LIGHTS + -0.014438 * RODENTS
R2: 0.8909173620789894

Task 5

CRIME_TOTALS ~ -0.009588 + 0.899974 * GARBAGE
Training R2: 0.7831498392992614
Testing R2: 0.6434115580911255
CRIME_TOTALS ~ -0.032697 + 0.609991 * GARBAGE + 0.343902 * ABANDONED_BUILDINGS
Training R2: 0.8446128237564032
Testing R2: 0.6438660402149637
CRIME_TOTALS ~ -0.028054 + 0.274961 * GARBAGE + 0.417901 * ABANDONED_BUILDINGS + 0.326539 * POT_HOLES
Training R2: 0.868789563474036
Testing R2: 0.752930207860741
CRIME_TOTALS ~ -0.028073 + 0.297111 * GARBAGE + 0.440380 * ABANDONED_BUILDINGS + 0.362253 * POT_HOLES + -0.108089 * TREE_DEBRIS
Training R2: 0.8748678501334366
Testing R2: 0.7856436543592813
CRIME_TOTALS ~ -0.021549 + 0.239800 * GARBAGE + 0.444428 * ABANDONED_BUILDINGS + 0.594661 * POT_HOLES + -0.192893 * TREE_DEBRIS + -0.183372 * GRAFFITI
Training R2: 0.8877056719368024
Testing R2: 0.7987594909170666
CRIME_TOTALS ~ -0.023199 + 0.193319 * GARBAGE + 0.415872 * ABANDONED_BUILDINGS + 0.538590 * POT_HOLES + -0.232338 * TREE_DEBRIS + -0.189316 * GRAFFITI + 0.155349 * STREET_LIGHTS
Training R2: 0.8908748485824085
Testing R2: 0.8084939761877113
CRIME_TOTALS ~ -0.023183 + 0.200771 * GARBAGE + 0.418511 * ABANDONED_BUILDINGS + 0.548331 * POT_HOLES + -0.233082 * TREE_DEBRIS + -0.188457 * GRAFFITI + 0.148776 * STREET_LIGHTS + -0.014438 * RODENTS
Training R2: 0.8909173620789894
Testing R2: 0.808930474594885