Dataset: CityΒΆ

The original data from Chicago’s data portal contains detailed information for each crime and call to 311. We have split the city up into regions using a simple grid and have aggregated this data by region.

Each city data file contains data for different types of complaints (that is, calls to 311) and the total amount of crimes on a per region basis. The first row in the file contains column labels, for example, GRAFFITI or POT_HOLES. Subsequent rows contain data for different regions of the city. A column contains data for a given variable across all the rows. For example, column with index 1 (the second column) contains the number of calls about pot holes for each region. In addition to information about specific types of complaints, the file also has one column that contains the total number crimes in each region.

Dependent variable:
 

Total number of crimes in a region. The column name is TOTAL_CRIMES and its column index is defined in CRIME_TOTAL_COL.

Predictor variables:
 

Complaint variables. The first 7 columns, with indices defined in COMPLAINT_COLS.

File paths:
data/city/training.csv
data/city/testing.csv
Task 1 expected output:
 

CITY Task 1a:
GRAFFITI: 0.11
POT_HOLES: 0.52
RODENTS: 0.43
GARBAGE: 0.65
STREET_LIGHTS: 0.62
TREE_DEBRIS: 0.14
ABANDONED_BUILDINGS: 0.52


CITY Task 1b:
GRAFFITI, POT HOLES, RODENTS, GARBAGE, STREET LIGHTS, TREE DEBRIS, ABANDONED BUILDINGS R2:0.83

Task 2 expected output:
 

CITY Task 2:

POT HOLES, ABANDONED BUILDINGS R2:0.77

Task 3 expected output:
 

CITY Task 3a:
GARBAGE R2:0.65
GARBAGE, ABANDONED BUILDINGS R2:0.70
GARBAGE, ABANDONED BUILDINGS, POT HOLES R2:0.78
GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS R2:0.81
GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS, GRAFFITI R2:0.82
GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS, GRAFFITI, STREET LIGHTS R2:0.83
GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS, GRAFFITI, STREET LIGHTS, RODENTS R2:0.83


CITY Task 3b:
Threshold 0.1: GARBAGE R2:0.65
Threshold 0.01: GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS, GRAFFITI R2:0.82

Task 4 expected output:
 

CITY Task 4:
GARBAGE 0.48
GARBAGE, ABANDONED BUILDINGS 0.39
GARBAGE, ABANDONED BUILDINGS, POT HOLES 0.66
GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS 0.78
GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS, GRAFFITI 0.78
GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS, GRAFFITI, STREET LIGHTS 0.74
GARBAGE, ABANDONED BUILDINGS, POT HOLES, TREE DEBRIS, GRAFFITI, STREET LIGHTS, RODENTS 0.74