Proposal

A key feature of this course is applying learning to a real-world dataset. The project takes you through all the steps of a typical data analysis project over the course of the semester. There are three deliverables for this assignment spread over the term. This description refers to the first deliverable.

Detailedinstructions for the first deliverable are listed below:

Detailed Instructions

During the live session in Module 1, I provided guidance on selecting a good dataset for your project. Data is plentiful these days, and a quick online search will yield many options. You can also use data from work or life. If you feel passionate about the data, you will have a great project.

Your project proposal should address three questions:

1.  describe the dataset you would like to use, including the source of the data, how it was collected (if known), what the columns represent, and the size of the dataset

2.  outline the question you would like to answer using the data

3.  provide a basic diagnosis of data problems, using the tools you learned in Module 1. In the project deliverable #2 (Midterm), you will fix these issues. There is no need to fix your data for this assignment.

The proposal should be 1 to 3 pages of text, plus an Appendix that includes any JMP output you created for question #3. The write-up should be in PDF/Word/slides format. You should also upload a copy of the dataset, and any supporting materials such as a data dictionary.

** Important: here are formatting guidelines you should follow, which should be standard in a workplace.

1. You must include your name on the first page of your write-up, as well as in the name of the file you upload.

2. Include page numbers

3. Make sure you explain any acronyms or industry jargon. Remember that your instructor/facilitator may not know much about the context of your dataset.

4. If you are including images of data, make sure the font size is humanly readable.

Assessment

The course project is an individual assignment. The project proposal will be marked complete or incomplete. See the grading rubric below.

This step is to make sure that you receive permission from the instructor or facilitator before you proceed with the other project-related assignments. We provide constructive feedback to help smooth your path through the project. You must submit to this assignment page by the due date and time listed on the top of this page.

Rubric

APANK5200 Proposal Rubric (1)

APANK5200 Proposal Rubric (1) /
Criteria / Ratings / Pts /
Sought permission before advancing with the project. / 5.0pts
Met instructions for submission. / 5.0pts
Total Points:10.0

PreviousNext

It happened that I came across these pages that potentially can have the datasets that you can leverage for the course project.

https://vincentarelbundock.github.io/Rdatasets/datasets.html

http://bigdata-madesimple.com/70-websites-to-get-large-data-repositories-for-free/