argument.ipynb proposal.md brfss_2020_cleaned.csv

This commit is contained in:
njmason2
2025-11-01 11:11:29 -04:00
parent 62cc21b7be
commit ae946d525f
5 changed files with 332988 additions and 100 deletions

View File

@@ -1,51 +1,69 @@
# Project proposal
This planning document will also form the introduction of your
argument.
Nelson Mason - Date: 10/29/2025
## Overarching Question
### What central question are you interested in exploring? Why are you interested in exploring this question?
*This should be the big picture question that you ask; use at least 5
sentences to describe why you are interested in it.*
I want to know about what relationship exists, if any, between an adult (18 +)
person's age and their weight (I'll use metric).
I'm trying to find out at what age, on average, do people experience a dramatic
weight gain or loss, if at all?
I'm curious to find out if such a dramatic increase or decrease in weight can
be captured in a one-time snapshot database, where individuals are NOT tracked
over a period of time, but ONLY once.
### What specific research questions will you investigate?
*List 2-4 specific research questions. Each should be answerable
using your data set.*
What number or percentage can be used to accurately indicate a
dramatic change in weight by age? How do I determine what "dramatic" is?
## Data source
### What data set will you use to answer your overarching question?
*Give the title of your data set and provide a link to your data.*
### Where is this data from?
*Describe the source of the data set--not just where you downloaded it, but
the person or organization who gathered the data. Explain why you trust them.*
### What is this data about?
*Describe the nature of the data in the dataset, including the number of rows
and some of the columns which will be important to you.*
## Methods
### How will you use your data set to answer your quantitative questions?
*For each research question, explain what you will do with the data set
to answer the question, and how you will present your answer (e.g. a chart or a table).*
# Project proposal
Argument Project
Nelson Mason - Date: 10/29/2025
## Overarching Question
### What central question are you interested in exploring? Why are you interested in exploring this question?
*This should be the big picture question that you ask; use at least 5
sentences to describe why you are interested in it.*
I want to know about what relationship exists, if any, between an adult (18 +)
person's age and their weight (I'll use metric).
I'm trying to find out at what age, on average, do people experience a dramatic
weight gain or loss, if at all?
I'm curious to find out if such a dramatic increase or decrease in weight can
be captured in a one-time snapshot database, where individuals are NOT tracked
over a period of time, but ONLY once.
### What specific research questions will you investigate?
*List 2-4 specific research questions. Each should be answerable
using your data set.*
What number or percentage can be used to accurately indicate a
dramatic change in weight by age? How do I determine what "dramatic" is?
## Data source
https://www.cdc.gov/brfss/annual_data/annual_2020.htm
### What data set will you use to answer your overarching question?
brfss_2020_cleaned.csv
### Where is this data from?
BRFSS 2020
This lab uses a simplified subset of the BRFSS 2020 dataset, brfss_2020.csv. This notebook explains the variables included as well as the process used to produce this file. Read more about BRFSS at https://www.cdc.gov/brfss/annual_data/annual_2020.htm
l“The link brfss/annual_data/annual_2020.htm directs to the 2020 Behavioral Risk Factor Surveillance System (BRFSS) annual survey data from the Centers for Disease Control and Prevention (CDC). This dataset includes data from 50 states, the District of Columbia, Guam, and Puerto Rico, collected through a combination of landline and cell phone interviews. The 2020 data reflect changes in the weighting methodology and the inclusion of cell phone respondents that began in 2011, making it non-comparable to data from before that year.
What the 2020 BRFSS data includes:
• Survey data: Includes approximately 401,958 records and 279 variables.
• Data files: Available in ASCII and SAS Transport formats.
• Geographic scope: Data collected from all 50 states, the District of Columbia, Guam, and Puerto Rico.
Methodology: A combination of landline and cell phone data, using updated weighting methods.
• Documentation: Includes a codebook, survey description, and information on data collection and processing.
Key differences from prior years:
• The 2020 data is not directly comparable to BRFSS data from before 2011 due to the inclusion of cell-phone-only respondents and a revised weighting methodology known as "raking".
How to use the data:
• Users can access the 2020 survey data and accompanying documentation through the CDC's BRFSS website.
• Researchers can use this public data for various studies on health-related behaviors and chronic conditions, as shown in the example research that analyzed the association between sleep, exercise, and coronary heart disease in the 2020 BRFSS data.” Googe Search
### What is this data about?
2 columns: Age and Weight (metric)
166,426 rows
## Methods
I will use quantitative analysis methods.
### How will you use your data set to answer your quantitative questions?
*For each research question, explain what you will do with the data set
to answer the question, and how you will present your answer (e.g. a chart or a table).*
I will create several charts.