diff --git a/proposal.md b/proposal.md index 2586c0c..b299894 100644 --- a/proposal.md +++ b/proposal.md @@ -4,36 +4,45 @@ This planning document will also form the introduction of your argument. ## Overarching Question +What are the least popular birth dates in the United States? ### What central question are you interested in exploring? Why are you interested in exploring this question? -*This should be the big picture question that you ask; use at least 5 -sentences to describe why you are interested in it.* +Friday the 13th is always seen as an unlucky day. Many parents may be hesitant to +schedule a C-section or induction on this date due to the superstition of it. +I was born on Friday the 13th and it has always been an interesting fact about myself. +I never thought about the fact that this may be an uncommon birth date until I came across +this data set on fivethirtyeight ### What specific research questions will you investigate? -*List 2-4 specific research questions. Each should be answerable -using your data set.* +How does the 13th of the month impact birth rates? +How does the date of Friday the 13th impact birth rates? ## Data source +This is a data set from Five Thirty Eight. + ### What data set will you use to answer your overarching question? -*Give the title of your data set and provide a link to your data.* +US_births_2000-2014_SSA.csv +(https://github.com/fivethirtyeight/data/blob/master/births/README.md) ### Where is this data from? -*Describe the source of the data set--not just where you downloaded it, but -the person or organization who gathered the data. Explain why you trust them.* +This data was collected by the Social Security Administration. This is very reliable +data as it is coming straight from government records. ### What is this data about? -*Describe the nature of the data in the dataset, including the number of rows -and some of the columns which will be important to you.* +There are 5480 rows of the data. The columns that will be important to me are +date_of_the_month, day_of_the_week, and births. ## Methods ### How will you use your data set to answer your quantitative questions? -*For each research question, explain what you will do with the data set -to answer the question, and how you will present your answer (e.g. a chart or a table).* +First I will compare the average of the 13th of each month to each other day of the month. +I will show this in a graph. +Then I will compare the day of the week that falls on the 13th with each other. I will +show this on a graph as well. \ No newline at end of file