Dataset: City¶
The original data from Chicago’s data portal contains detailed information for each crime and call to 311. We have split the city up into regions using a simple grid and have aggregated this data by region.
Each city data file contains data for different types of complaints
(that is, calls to 311) and the total amount of crimes on a per-region
basis. The first row in the file contains column labels, for example,
GRAFFITI
or POT_HOLES
. Subsequent rows contain data for
different regions of the city. A column contains data for a given
variable across all the rows. For example, the column with index 1 (the
second column) contains the number of calls about pot holes for each
region. In addition to information about specific types of
complaints, the file also has one column that contains the total
number of crimes in each region.
- File paths
data/city
- Parameters
{"name": "City", "feature_vars": [0, 1, 2, 3, 4, 5, 6], "target_var": 7, "training_fraction": 0.55, "seed": 22992}
Results without feature standardization¶
- Task 2a
CRIME_TOTALS ~ 575.687669 + 0.678349 * GRAFFITI
R2: 0.1402749161031347
CRIME_TOTALS ~ -22.208880 + 5.375417 * POT_HOLES
R2: 0.6229070858532733
CRIME_TOTALS ~ 227.414583 + 7.711958 * RODENTS
R2: 0.5575360783921093
CRIME_TOTALS ~ 11.553128 + 18.892669 * GARBAGE
R2: 0.7831498392992613
CRIME_TOTALS ~ -65.954319 + 13.447459 * STREET_LIGHTS
R2: 0.7198560514392485
CRIME_TOTALS ~ 297.222082 + 10.324616 * TREE_DEBRIS
R2: 0.3265907948681832
CRIME_TOTALS ~ 308.489056 + 10.338500 * ABANDONED_BUILDINGS
R2: 0.6897288976957777
- Task 2b
CRIME_TOTALS ~ -35.784745 + -0.347343 * GRAFFITI + 3.596555 * POT_HOLES + -0.143517 * RODENTS + 4.214673 * GARBAGE + 2.446765 * STREET_LIGHTS + -4.148366 * TREE_DEBRIS + 5.724136 * ABANDONED_BUILDINGS
R2: 0.8909173620789894
- Task 3
CRIME_TOTALS ~ -36.151629 + 3.300180 * POT_HOLES + 7.129337 * ABANDONED_BUILDINGS
R2: 0.8580580940940485
- Task 4
CRIME_TOTALS ~ 11.553128 + 18.892669 * GARBAGE
R2: 0.7831498392992613
CRIME_TOTALS ~ 45.293237 + 12.805205 * GARBAGE + 4.703679 * ABANDONED_BUILDINGS
R2: 0.8446128237564032
CRIME_TOTALS ~ -33.819789 + 5.772113 * GARBAGE + 5.715802 * ABANDONED_BUILDINGS + 2.141799 * POT_HOLES
R2: 0.868789563474036
CRIME_TOTALS ~ -12.197132 + 6.237088 * GARBAGE + 6.023250 * ABANDONED_BUILDINGS + 2.376052 * POT_HOLES + -1.923758 * TREE_DEBRIS
R2: 0.8748678501334366
CRIME_TOTALS ~ -22.991702 + 5.033985 * GARBAGE + 6.078619 * ABANDONED_BUILDINGS + 3.900442 * POT_HOLES + -3.433079 * TREE_DEBRIS + -0.337971 * GRAFFITI
R2: 0.8877056719368024
CRIME_TOTALS ~ -35.457501 + 4.058232 * GARBAGE + 5.688046 * ABANDONED_BUILDINGS + 3.532662 * POT_HOLES + -4.135113 * TREE_DEBRIS + -0.348926 * GRAFFITI + 2.554864 * STREET_LIGHTS
R2: 0.8908748485824085
CRIME_TOTALS ~ -35.784745 + 4.214673 * GARBAGE + 5.724136 * ABANDONED_BUILDINGS + 3.596555 * POT_HOLES + -4.148366 * TREE_DEBRIS + -0.347343 * GRAFFITI + 2.446765 * STREET_LIGHTS + -0.143517 * RODENTS
R2: 0.8909173620789894
- Task 5
CRIME_TOTALS ~ 11.553128 + 18.892669 * GARBAGE
Training R2: 0.7831498392992613
Testing R2: 0.6434115580911255
CRIME_TOTALS ~ 45.293237 + 12.805205 * GARBAGE + 4.703679 * ABANDONED_BUILDINGS
Training R2: 0.8446128237564032
Testing R2: 0.6438660402149637
CRIME_TOTALS ~ -33.819789 + 5.772113 * GARBAGE + 5.715802 * ABANDONED_BUILDINGS + 2.141799 * POT_HOLES
Training R2: 0.868789563474036
Testing R2: 0.7529302078607409
CRIME_TOTALS ~ -12.197132 + 6.237088 * GARBAGE + 6.023250 * ABANDONED_BUILDINGS + 2.376052 * POT_HOLES + -1.923758 * TREE_DEBRIS
Training R2: 0.8748678501334366
Testing R2: 0.7856436543592813
CRIME_TOTALS ~ -22.991702 + 5.033985 * GARBAGE + 6.078619 * ABANDONED_BUILDINGS + 3.900442 * POT_HOLES + -3.433079 * TREE_DEBRIS + -0.337971 * GRAFFITI
Training R2: 0.8877056719368024
Testing R2: 0.798759490917067
CRIME_TOTALS ~ -35.457501 + 4.058232 * GARBAGE + 5.688046 * ABANDONED_BUILDINGS + 3.532662 * POT_HOLES + -4.135113 * TREE_DEBRIS + -0.348926 * GRAFFITI + 2.554864 * STREET_LIGHTS
Training R2: 0.8908748485824085
Testing R2: 0.8084939761877116
CRIME_TOTALS ~ -35.784745 + 4.214673 * GARBAGE + 5.724136 * ABANDONED_BUILDINGS + 3.596555 * POT_HOLES + -4.148366 * TREE_DEBRIS + -0.347343 * GRAFFITI + 2.446765 * STREET_LIGHTS + -0.143517 * RODENTS
Training R2: 0.8909173620789894
Testing R2: 0.8089304745948849
Results with feature standardization¶
- Task 2a
CRIME_TOTALS ~ -0.038230 + 0.368050 * GRAFFITI
R2: 0.1402749161031347
CRIME_TOTALS ~ 0.005853 + 0.819536 * POT_HOLES
R2: 0.6229070858532733
CRIME_TOTALS ~ -0.010892 + 0.775836 * RODENTS
R2: 0.5575360783921093
CRIME_TOTALS ~ -0.009588 + 0.899974 * GARBAGE
R2: 0.7831498392992614
CRIME_TOTALS ~ -0.025762 + 0.817677 * STREET_LIGHTS
R2: 0.7198560514392485
CRIME_TOTALS ~ -0.032871 + 0.580104 * TREE_DEBRIS
R2: 0.3265907948681832
CRIME_TOTALS ~ -0.070962 + 0.755882 * ABANDONED_BUILDINGS
R2: 0.6897288976957778
- Task 2b
CRIME_TOTALS ~ -0.023183 + -0.188457 * GRAFFITI + 0.548331 * POT_HOLES + -0.014438 * RODENTS + 0.200771 * GARBAGE + 0.148776 * STREET_LIGHTS + -0.233082 * TREE_DEBRIS + 0.418511 * ABANDONED_BUILDINGS
R2: 0.8909173620789893
- Task 3
CRIME_TOTALS ~ -0.031424 + 0.503146 * POT_HOLES + 0.521250 * ABANDONED_BUILDINGS
R2: 0.8580580940940485
- Task 4
CRIME_TOTALS ~ -0.009588 + 0.899974 * GARBAGE
R2: 0.7831498392992614
CRIME_TOTALS ~ -0.032697 + 0.609991 * GARBAGE + 0.343902 * ABANDONED_BUILDINGS
R2: 0.8446128237564032
CRIME_TOTALS ~ -0.028054 + 0.274961 * GARBAGE + 0.417901 * ABANDONED_BUILDINGS + 0.326539 * POT_HOLES
R2: 0.868789563474036
CRIME_TOTALS ~ -0.028073 + 0.297111 * GARBAGE + 0.440380 * ABANDONED_BUILDINGS + 0.362253 * POT_HOLES + -0.108089 * TREE_DEBRIS
R2: 0.8748678501334366
CRIME_TOTALS ~ -0.021549 + 0.239800 * GARBAGE + 0.444428 * ABANDONED_BUILDINGS + 0.594661 * POT_HOLES + -0.192893 * TREE_DEBRIS + -0.183372 * GRAFFITI
R2: 0.8877056719368024
CRIME_TOTALS ~ -0.023199 + 0.193319 * GARBAGE + 0.415872 * ABANDONED_BUILDINGS + 0.538590 * POT_HOLES + -0.232338 * TREE_DEBRIS + -0.189316 * GRAFFITI + 0.155349 * STREET_LIGHTS
R2: 0.8908748485824085
CRIME_TOTALS ~ -0.023183 + 0.200771 * GARBAGE + 0.418511 * ABANDONED_BUILDINGS + 0.548331 * POT_HOLES + -0.233082 * TREE_DEBRIS + -0.188457 * GRAFFITI + 0.148776 * STREET_LIGHTS + -0.014438 * RODENTS
R2: 0.8909173620789894
- Task 5
CRIME_TOTALS ~ -0.009588 + 0.899974 * GARBAGE
Training R2: 0.7831498392992614
Testing R2: 0.6434115580911255
CRIME_TOTALS ~ -0.032697 + 0.609991 * GARBAGE + 0.343902 * ABANDONED_BUILDINGS
Training R2: 0.8446128237564032
Testing R2: 0.6438660402149637
CRIME_TOTALS ~ -0.028054 + 0.274961 * GARBAGE + 0.417901 * ABANDONED_BUILDINGS + 0.326539 * POT_HOLES
Training R2: 0.868789563474036
Testing R2: 0.752930207860741
CRIME_TOTALS ~ -0.028073 + 0.297111 * GARBAGE + 0.440380 * ABANDONED_BUILDINGS + 0.362253 * POT_HOLES + -0.108089 * TREE_DEBRIS
Training R2: 0.8748678501334366
Testing R2: 0.7856436543592813
CRIME_TOTALS ~ -0.021549 + 0.239800 * GARBAGE + 0.444428 * ABANDONED_BUILDINGS + 0.594661 * POT_HOLES + -0.192893 * TREE_DEBRIS + -0.183372 * GRAFFITI
Training R2: 0.8877056719368024
Testing R2: 0.7987594909170666
CRIME_TOTALS ~ -0.023199 + 0.193319 * GARBAGE + 0.415872 * ABANDONED_BUILDINGS + 0.538590 * POT_HOLES + -0.232338 * TREE_DEBRIS + -0.189316 * GRAFFITI + 0.155349 * STREET_LIGHTS
Training R2: 0.8908748485824085
Testing R2: 0.8084939761877113
CRIME_TOTALS ~ -0.023183 + 0.200771 * GARBAGE + 0.418511 * ABANDONED_BUILDINGS + 0.548331 * POT_HOLES + -0.233082 * TREE_DEBRIS + -0.188457 * GRAFFITI + 0.148776 * STREET_LIGHTS + -0.014438 * RODENTS
Training R2: 0.8909173620789894
Testing R2: 0.808930474594885