diff --git a/proposal.md b/proposal.md index 2586c0c..a919c55 100644 --- a/proposal.md +++ b/proposal.md @@ -7,27 +7,50 @@ argument. ### What central question are you interested in exploring? Why are you interested in exploring this question? +What is the most deadly coast in america to drive in? + +I am interested in this becouse i live on the east coast and we have alot of car +accidents. People drive very fast here. The roads are not always paved properly and maintained. I want to know is it just bad luck when people get into accidents or is it their own fault. + *This should be the big picture question that you ask; use at least 5 sentences to describe why you are interested in it.* ### What specific research questions will you investigate? -*List 2-4 specific research questions. Each should be answerable -using your data set.* +1.Is drinking and drinking the biggest cause of fatal collisions? + +2.what state are you mostlikly to get in a fatal collision for your first accident? + +3.what state is the most unluckiest state for fatel collisions? + +4.Is there a connection between the speed or the roads that are causing fatal collisions, that would make the Car Insurance Premiums more expensive? ## Data source ### What data set will you use to answer your overarching question? +The data set is call Bad Drivers + +https://github.com/fivethirtyeight/data/tree/master/bad-drivers + *Give the title of your data set and provide a link to your data.* ### Where is this data from? +I got the data set from FiveThirtyEight. It was used for an artical call +"Dear Mona, Which state has the worst drivers?" in October 2014. The person that wrote the artical is Mona Chalabi, they are a data editor at the Guardien US, +and a columnist at New York Margazine, and a lead news writer for FiveThirtyEight. + + *Describe the source of the data set--not just where you downloaded it, but the person or organization who gathered the data. Explain why you trust them.* ### What is this data about? +The date is about fatal collisions in each state. There are 7 rows, some of the rows +are about "Percentage Of Drivers Involved In Fatal Collisions Who Were Alcohol-Impaired" and "Percentage Of Drivers Involved In Fatal Collisions Who Were Not Distracted" + + *Describe the nature of the data in the dataset, including the number of rows and some of the columns which will be important to you.* @@ -35,5 +58,21 @@ and some of the columns which will be important to you.* ### How will you use your data set to answer your quantitative questions? +1.Is drinking and drinking the biggest cause of fatal collisions? + I would get the mean of all the 5 different way fatal collisions happen in + the U.S and then compare to see which one happenes the most often. + +2.what state are you mostlikly to get in a fatal collision for your first accident? + I would focus on the data set about "Percentage Of Drivers Involved In Fatal Collisions Who Had Not Been Involved In Any Previous Accidents" + +3.what state is the most unluckiest state for fatel collisions? + I would compare the percentage Of Drivers Involved In Fatal Collisions Who Were Not Distracted in every state. + +4.Is there a connection between the speed or the roads that are causing fatal collisions, that would make the Car Insurance Premiums more expensive? + I would gaither the average speed and miles of fatal collision in every state and sort the states by least to greatest insurance premiums and see if they aline. + +I will present the data in a chart format + + *For each research question, explain what you will do with the data set to answer the question, and how you will present your answer (e.g. a chart or a table).*