MPCS 53017: Revised project proposal
Due on Monday, February 10, 2014
Revised project proposal
Please, list the names and cs usernames of everyone in your team.
You will need to designate one of
your phoenixforge
repositories as the primary one. Please, email the instructor, as soon as possible,
your choice along with the usernames of all team members so we can
adjust the repository access permissions. Everyone in your team will
be able to commit to the designated repository.
Your revised proposal should discuss several new issues as well as the
same points as your draft proposal addressing any comments and
questions raised in the feedback provided on your draft submission.
- Project name
- An overview of the proposed project.
- What are the key question(s) that you would like to answer by
building the proposed data warehouse? Please, be as specific as
possible, and list up to 10 questions. For example, analyzing crime
in Chicago is too general while exploring the relationship between
weather patterns such as precipitation, temperature, wind, etc and
crime is right on point.
- Who would be interested in the answers to the questions above?
You can list organizations, companies, type of consumers, etc. For
example, both the Chicago police department and the people who live and work
in Chicago would be interested in knowing the relationships between
weather and crime in the city.
- What executives and analysts would you like to interview in order
to refine your key questions? For example, you may want to interview
both the CEO of Netflix and their analysts if your project aims to help
Netflix provide better service for its customers.
- What are the data sources that you would like to use? Please, be
as exhaustive as possible bur prioritize the data sources since you
will likely end up using only a few of your top choices. Please,
include specific datasets such as
City
of Chicago: Crimes - 2001 to present.
- For each data source, list some details such as whether it is
available via API or as a flat file, size in terms of number of tuples
and disk volume, and any limitations. For example, the crime dataset
contains 5M+ records, is about 1.7GB and can be downloaded as csv file
and also available via API.
Please, avoid any data sources that require screen-scraping
(extracting the data from the html pages of a web site) and only list
data sources that are publicly available.
You will submit your proposal via phoenixforge. Please, create a
new subdirectory named reports in your designated repository
and add your revised proposal in this subdirectory. Of course, don't
forget to commit the directory and the proposal.