generated from mwc/project_argument
1199 lines
52 KiB
Plaintext
1199 lines
52 KiB
Plaintext
{
|
||
"cells": [
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "worldwide-blood",
|
||
"metadata": {},
|
||
"source": [
|
||
"# Introduction"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "understanding-numbers",
|
||
"metadata": {},
|
||
"source": [
|
||
"Star Wars is a franchise that has spawned multiple films, in addition to entries through other media. The first six films were divided into two trilogies, with one starting with \"The Phantom Menace\" and the other starting with \"A New Hope.\""
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "greater-circular",
|
||
"metadata": {},
|
||
"source": [
|
||
"## Overarching Question: Are \"The Phantom Menace\" and \"A New Hope\" good \"first\" movies to build interest in Star Wars?"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "appreciated-testimony",
|
||
"metadata": {},
|
||
"source": [
|
||
"Star Wars continues to be popular, as seen through continued exploration of its universe through various media since its introduction in 1977. Even among those who are not fans of the series, many are aware of different characters, debates, lines, or situations from the series, e.g. the line \"No, I am your father\" or it's common misquote, \"Luke, I am your father.\" As \"The Phantom Menace\" and \"A New Hope\" serve as potential entry points into the franchise, being the first film of their respective trilogies, I am interested in seeing whether they helped build interest in the film franchise."
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "permanent-pollution",
|
||
"metadata": {},
|
||
"source": [
|
||
"# Data"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 201,
|
||
"id": "technical-evans",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"outputs": [],
|
||
"source": [
|
||
"#Include any import statements you will need\n",
|
||
"import pandas as pd"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 202,
|
||
"id": "overhead-sigma",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"outputs": [
|
||
{
|
||
"data": {
|
||
"text/html": [
|
||
"<div>\n",
|
||
"<style scoped>\n",
|
||
" .dataframe tbody tr th:only-of-type {\n",
|
||
" vertical-align: middle;\n",
|
||
" }\n",
|
||
"\n",
|
||
" .dataframe tbody tr th {\n",
|
||
" vertical-align: top;\n",
|
||
" }\n",
|
||
"\n",
|
||
" .dataframe thead th {\n",
|
||
" text-align: right;\n",
|
||
" }\n",
|
||
"</style>\n",
|
||
"<table border=\"1\" class=\"dataframe\">\n",
|
||
" <thead>\n",
|
||
" <tr style=\"text-align: right;\">\n",
|
||
" <th></th>\n",
|
||
" <th>RespondentID</th>\n",
|
||
" <th>Have you seen any of the 6 films in the Star Wars franchise?</th>\n",
|
||
" <th>Do you consider yourself to be a fan of the Star Wars film franchise?</th>\n",
|
||
" <th>Which of the following Star Wars films have you seen? Please select all that apply.</th>\n",
|
||
" <th>Unnamed: 4</th>\n",
|
||
" <th>Unnamed: 5</th>\n",
|
||
" <th>Unnamed: 6</th>\n",
|
||
" <th>Unnamed: 7</th>\n",
|
||
" <th>Unnamed: 8</th>\n",
|
||
" <th>Please rank the Star Wars films in order of preference with 1 being your favorite film in the franchise and 6 being your least favorite film.</th>\n",
|
||
" <th>...</th>\n",
|
||
" <th>Unnamed: 28</th>\n",
|
||
" <th>Which character shot first?</th>\n",
|
||
" <th>Are you familiar with the Expanded Universe?</th>\n",
|
||
" <th>Do you consider yourself to be a fan of the Expanded Universe?åÊ</th>\n",
|
||
" <th>Do you consider yourself to be a fan of the Star Trek franchise?</th>\n",
|
||
" <th>Gender</th>\n",
|
||
" <th>Age</th>\n",
|
||
" <th>Household Income</th>\n",
|
||
" <th>Education</th>\n",
|
||
" <th>Location (Census Region)</th>\n",
|
||
" </tr>\n",
|
||
" </thead>\n",
|
||
" <tbody>\n",
|
||
" <tr>\n",
|
||
" <th>0</th>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>Star Wars: Episode II Attack of the Clones</td>\n",
|
||
" <td>Star Wars: Episode III Revenge of the Sith</td>\n",
|
||
" <td>Star Wars: Episode IV A New Hope</td>\n",
|
||
" <td>Star Wars: Episode V The Empire Strikes Back</td>\n",
|
||
" <td>Star Wars: Episode VI Return of the Jedi</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>Yoda</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" <td>Response</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1</th>\n",
|
||
" <td>3.292880e+09</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>Star Wars: Episode II Attack of the Clones</td>\n",
|
||
" <td>Star Wars: Episode III Revenge of the Sith</td>\n",
|
||
" <td>Star Wars: Episode IV A New Hope</td>\n",
|
||
" <td>Star Wars: Episode V The Empire Strikes Back</td>\n",
|
||
" <td>Star Wars: Episode VI Return of the Jedi</td>\n",
|
||
" <td>3</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>Very favorably</td>\n",
|
||
" <td>I don't understand this question</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>Male</td>\n",
|
||
" <td>18-29</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>High school degree</td>\n",
|
||
" <td>South Atlantic</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>2</th>\n",
|
||
" <td>3.292880e+09</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Male</td>\n",
|
||
" <td>18-29</td>\n",
|
||
" <td>$0 - $24,999</td>\n",
|
||
" <td>Bachelor degree</td>\n",
|
||
" <td>West South Central</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>3</th>\n",
|
||
" <td>3.292765e+09</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>Star Wars: Episode II Attack of the Clones</td>\n",
|
||
" <td>Star Wars: Episode III Revenge of the Sith</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>Unfamiliar (N/A)</td>\n",
|
||
" <td>I don't understand this question</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>Male</td>\n",
|
||
" <td>18-29</td>\n",
|
||
" <td>$0 - $24,999</td>\n",
|
||
" <td>High school degree</td>\n",
|
||
" <td>West North Central</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>4</th>\n",
|
||
" <td>3.292763e+09</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>Star Wars: Episode II Attack of the Clones</td>\n",
|
||
" <td>Star Wars: Episode III Revenge of the Sith</td>\n",
|
||
" <td>Star Wars: Episode IV A New Hope</td>\n",
|
||
" <td>Star Wars: Episode V The Empire Strikes Back</td>\n",
|
||
" <td>Star Wars: Episode VI Return of the Jedi</td>\n",
|
||
" <td>5</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>Very favorably</td>\n",
|
||
" <td>I don't understand this question</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Male</td>\n",
|
||
" <td>18-29</td>\n",
|
||
" <td>$100,000 - $149,999</td>\n",
|
||
" <td>Some college or Associate degree</td>\n",
|
||
" <td>West North Central</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>...</th>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1182</th>\n",
|
||
" <td>3.288389e+09</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>Star Wars: Episode II Attack of the Clones</td>\n",
|
||
" <td>Star Wars: Episode III Revenge of the Sith</td>\n",
|
||
" <td>Star Wars: Episode IV A New Hope</td>\n",
|
||
" <td>Star Wars: Episode V The Empire Strikes Back</td>\n",
|
||
" <td>Star Wars: Episode VI Return of the Jedi</td>\n",
|
||
" <td>5</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>Very favorably</td>\n",
|
||
" <td>Han</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Female</td>\n",
|
||
" <td>18-29</td>\n",
|
||
" <td>$0 - $24,999</td>\n",
|
||
" <td>Some college or Associate degree</td>\n",
|
||
" <td>East North Central</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1183</th>\n",
|
||
" <td>3.288379e+09</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>Star Wars: Episode II Attack of the Clones</td>\n",
|
||
" <td>Star Wars: Episode III Revenge of the Sith</td>\n",
|
||
" <td>Star Wars: Episode IV A New Hope</td>\n",
|
||
" <td>Star Wars: Episode V The Empire Strikes Back</td>\n",
|
||
" <td>Star Wars: Episode VI Return of the Jedi</td>\n",
|
||
" <td>4</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>Very favorably</td>\n",
|
||
" <td>I don't understand this question</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Female</td>\n",
|
||
" <td>30-44</td>\n",
|
||
" <td>$50,000 - $99,999</td>\n",
|
||
" <td>Bachelor degree</td>\n",
|
||
" <td>Mountain</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1184</th>\n",
|
||
" <td>3.288375e+09</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>Female</td>\n",
|
||
" <td>30-44</td>\n",
|
||
" <td>$50,000 - $99,999</td>\n",
|
||
" <td>Bachelor degree</td>\n",
|
||
" <td>Middle Atlantic</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1185</th>\n",
|
||
" <td>3.288373e+09</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>Star Wars: Episode II Attack of the Clones</td>\n",
|
||
" <td>Star Wars: Episode III Revenge of the Sith</td>\n",
|
||
" <td>Star Wars: Episode IV A New Hope</td>\n",
|
||
" <td>Star Wars: Episode V The Empire Strikes Back</td>\n",
|
||
" <td>Star Wars: Episode VI Return of the Jedi</td>\n",
|
||
" <td>4</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>Very favorably</td>\n",
|
||
" <td>Han</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>Female</td>\n",
|
||
" <td>45-60</td>\n",
|
||
" <td>$100,000 - $149,999</td>\n",
|
||
" <td>Some college or Associate degree</td>\n",
|
||
" <td>East North Central</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1186</th>\n",
|
||
" <td>3.288373e+09</td>\n",
|
||
" <td>Yes</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>Star Wars: Episode I The Phantom Menace</td>\n",
|
||
" <td>Star Wars: Episode II Attack of the Clones</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>Star Wars: Episode V The Empire Strikes Back</td>\n",
|
||
" <td>Star Wars: Episode VI Return of the Jedi</td>\n",
|
||
" <td>6</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>Very unfavorably</td>\n",
|
||
" <td>I don't understand this question</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>NaN</td>\n",
|
||
" <td>No</td>\n",
|
||
" <td>Female</td>\n",
|
||
" <td>> 60</td>\n",
|
||
" <td>$50,000 - $99,999</td>\n",
|
||
" <td>Graduate degree</td>\n",
|
||
" <td>Pacific</td>\n",
|
||
" </tr>\n",
|
||
" </tbody>\n",
|
||
"</table>\n",
|
||
"<p>1187 rows × 38 columns</p>\n",
|
||
"</div>"
|
||
],
|
||
"text/plain": [
|
||
" RespondentID \\\n",
|
||
"0 NaN \n",
|
||
"1 3.292880e+09 \n",
|
||
"2 3.292880e+09 \n",
|
||
"3 3.292765e+09 \n",
|
||
"4 3.292763e+09 \n",
|
||
"... ... \n",
|
||
"1182 3.288389e+09 \n",
|
||
"1183 3.288379e+09 \n",
|
||
"1184 3.288375e+09 \n",
|
||
"1185 3.288373e+09 \n",
|
||
"1186 3.288373e+09 \n",
|
||
"\n",
|
||
" Have you seen any of the 6 films in the Star Wars franchise? \\\n",
|
||
"0 Response \n",
|
||
"1 Yes \n",
|
||
"2 No \n",
|
||
"3 Yes \n",
|
||
"4 Yes \n",
|
||
"... ... \n",
|
||
"1182 Yes \n",
|
||
"1183 Yes \n",
|
||
"1184 No \n",
|
||
"1185 Yes \n",
|
||
"1186 Yes \n",
|
||
"\n",
|
||
" Do you consider yourself to be a fan of the Star Wars film franchise? \\\n",
|
||
"0 Response \n",
|
||
"1 Yes \n",
|
||
"2 NaN \n",
|
||
"3 No \n",
|
||
"4 Yes \n",
|
||
"... ... \n",
|
||
"1182 Yes \n",
|
||
"1183 Yes \n",
|
||
"1184 NaN \n",
|
||
"1185 Yes \n",
|
||
"1186 No \n",
|
||
"\n",
|
||
" Which of the following Star Wars films have you seen? Please select all that apply. \\\n",
|
||
"0 Star Wars: Episode I The Phantom Menace \n",
|
||
"1 Star Wars: Episode I The Phantom Menace \n",
|
||
"2 NaN \n",
|
||
"3 Star Wars: Episode I The Phantom Menace \n",
|
||
"4 Star Wars: Episode I The Phantom Menace \n",
|
||
"... ... \n",
|
||
"1182 Star Wars: Episode I The Phantom Menace \n",
|
||
"1183 Star Wars: Episode I The Phantom Menace \n",
|
||
"1184 NaN \n",
|
||
"1185 Star Wars: Episode I The Phantom Menace \n",
|
||
"1186 Star Wars: Episode I The Phantom Menace \n",
|
||
"\n",
|
||
" Unnamed: 4 \\\n",
|
||
"0 Star Wars: Episode II Attack of the Clones \n",
|
||
"1 Star Wars: Episode II Attack of the Clones \n",
|
||
"2 NaN \n",
|
||
"3 Star Wars: Episode II Attack of the Clones \n",
|
||
"4 Star Wars: Episode II Attack of the Clones \n",
|
||
"... ... \n",
|
||
"1182 Star Wars: Episode II Attack of the Clones \n",
|
||
"1183 Star Wars: Episode II Attack of the Clones \n",
|
||
"1184 NaN \n",
|
||
"1185 Star Wars: Episode II Attack of the Clones \n",
|
||
"1186 Star Wars: Episode II Attack of the Clones \n",
|
||
"\n",
|
||
" Unnamed: 5 \\\n",
|
||
"0 Star Wars: Episode III Revenge of the Sith \n",
|
||
"1 Star Wars: Episode III Revenge of the Sith \n",
|
||
"2 NaN \n",
|
||
"3 Star Wars: Episode III Revenge of the Sith \n",
|
||
"4 Star Wars: Episode III Revenge of the Sith \n",
|
||
"... ... \n",
|
||
"1182 Star Wars: Episode III Revenge of the Sith \n",
|
||
"1183 Star Wars: Episode III Revenge of the Sith \n",
|
||
"1184 NaN \n",
|
||
"1185 Star Wars: Episode III Revenge of the Sith \n",
|
||
"1186 NaN \n",
|
||
"\n",
|
||
" Unnamed: 6 \\\n",
|
||
"0 Star Wars: Episode IV A New Hope \n",
|
||
"1 Star Wars: Episode IV A New Hope \n",
|
||
"2 NaN \n",
|
||
"3 NaN \n",
|
||
"4 Star Wars: Episode IV A New Hope \n",
|
||
"... ... \n",
|
||
"1182 Star Wars: Episode IV A New Hope \n",
|
||
"1183 Star Wars: Episode IV A New Hope \n",
|
||
"1184 NaN \n",
|
||
"1185 Star Wars: Episode IV A New Hope \n",
|
||
"1186 NaN \n",
|
||
"\n",
|
||
" Unnamed: 7 \\\n",
|
||
"0 Star Wars: Episode V The Empire Strikes Back \n",
|
||
"1 Star Wars: Episode V The Empire Strikes Back \n",
|
||
"2 NaN \n",
|
||
"3 NaN \n",
|
||
"4 Star Wars: Episode V The Empire Strikes Back \n",
|
||
"... ... \n",
|
||
"1182 Star Wars: Episode V The Empire Strikes Back \n",
|
||
"1183 Star Wars: Episode V The Empire Strikes Back \n",
|
||
"1184 NaN \n",
|
||
"1185 Star Wars: Episode V The Empire Strikes Back \n",
|
||
"1186 Star Wars: Episode V The Empire Strikes Back \n",
|
||
"\n",
|
||
" Unnamed: 8 \\\n",
|
||
"0 Star Wars: Episode VI Return of the Jedi \n",
|
||
"1 Star Wars: Episode VI Return of the Jedi \n",
|
||
"2 NaN \n",
|
||
"3 NaN \n",
|
||
"4 Star Wars: Episode VI Return of the Jedi \n",
|
||
"... ... \n",
|
||
"1182 Star Wars: Episode VI Return of the Jedi \n",
|
||
"1183 Star Wars: Episode VI Return of the Jedi \n",
|
||
"1184 NaN \n",
|
||
"1185 Star Wars: Episode VI Return of the Jedi \n",
|
||
"1186 Star Wars: Episode VI Return of the Jedi \n",
|
||
"\n",
|
||
" Please rank the Star Wars films in order of preference with 1 being your favorite film in the franchise and 6 being your least favorite film. \\\n",
|
||
"0 Star Wars: Episode I The Phantom Menace \n",
|
||
"1 3 \n",
|
||
"2 NaN \n",
|
||
"3 1 \n",
|
||
"4 5 \n",
|
||
"... ... \n",
|
||
"1182 5 \n",
|
||
"1183 4 \n",
|
||
"1184 NaN \n",
|
||
"1185 4 \n",
|
||
"1186 6 \n",
|
||
"\n",
|
||
" ... Unnamed: 28 Which character shot first? \\\n",
|
||
"0 ... Yoda Response \n",
|
||
"1 ... Very favorably I don't understand this question \n",
|
||
"2 ... NaN NaN \n",
|
||
"3 ... Unfamiliar (N/A) I don't understand this question \n",
|
||
"4 ... Very favorably I don't understand this question \n",
|
||
"... ... ... ... \n",
|
||
"1182 ... Very favorably Han \n",
|
||
"1183 ... Very favorably I don't understand this question \n",
|
||
"1184 ... NaN NaN \n",
|
||
"1185 ... Very favorably Han \n",
|
||
"1186 ... Very unfavorably I don't understand this question \n",
|
||
"\n",
|
||
" Are you familiar with the Expanded Universe? \\\n",
|
||
"0 Response \n",
|
||
"1 Yes \n",
|
||
"2 NaN \n",
|
||
"3 No \n",
|
||
"4 No \n",
|
||
"... ... \n",
|
||
"1182 No \n",
|
||
"1183 No \n",
|
||
"1184 NaN \n",
|
||
"1185 No \n",
|
||
"1186 No \n",
|
||
"\n",
|
||
" Do you consider yourself to be a fan of the Expanded Universe?åÊ \\\n",
|
||
"0 Response \n",
|
||
"1 No \n",
|
||
"2 NaN \n",
|
||
"3 NaN \n",
|
||
"4 NaN \n",
|
||
"... ... \n",
|
||
"1182 NaN \n",
|
||
"1183 NaN \n",
|
||
"1184 NaN \n",
|
||
"1185 NaN \n",
|
||
"1186 NaN \n",
|
||
"\n",
|
||
" Do you consider yourself to be a fan of the Star Trek franchise? \\\n",
|
||
"0 Response \n",
|
||
"1 No \n",
|
||
"2 Yes \n",
|
||
"3 No \n",
|
||
"4 Yes \n",
|
||
"... ... \n",
|
||
"1182 Yes \n",
|
||
"1183 Yes \n",
|
||
"1184 No \n",
|
||
"1185 Yes \n",
|
||
"1186 No \n",
|
||
"\n",
|
||
" Gender Age Household Income \\\n",
|
||
"0 Response Response Response \n",
|
||
"1 Male 18-29 NaN \n",
|
||
"2 Male 18-29 $0 - $24,999 \n",
|
||
"3 Male 18-29 $0 - $24,999 \n",
|
||
"4 Male 18-29 $100,000 - $149,999 \n",
|
||
"... ... ... ... \n",
|
||
"1182 Female 18-29 $0 - $24,999 \n",
|
||
"1183 Female 30-44 $50,000 - $99,999 \n",
|
||
"1184 Female 30-44 $50,000 - $99,999 \n",
|
||
"1185 Female 45-60 $100,000 - $149,999 \n",
|
||
"1186 Female > 60 $50,000 - $99,999 \n",
|
||
"\n",
|
||
" Education Location (Census Region) \n",
|
||
"0 Response Response \n",
|
||
"1 High school degree South Atlantic \n",
|
||
"2 Bachelor degree West South Central \n",
|
||
"3 High school degree West North Central \n",
|
||
"4 Some college or Associate degree West North Central \n",
|
||
"... ... ... \n",
|
||
"1182 Some college or Associate degree East North Central \n",
|
||
"1183 Bachelor degree Mountain \n",
|
||
"1184 Bachelor degree Middle Atlantic \n",
|
||
"1185 Some college or Associate degree East North Central \n",
|
||
"1186 Graduate degree Pacific \n",
|
||
"\n",
|
||
"[1187 rows x 38 columns]"
|
||
]
|
||
},
|
||
"execution_count": 202,
|
||
"metadata": {},
|
||
"output_type": "execute_result"
|
||
}
|
||
],
|
||
"source": [
|
||
"### 💻 FILL IN YOUR DATASET FILE NAME BELOW 💻 ###\n",
|
||
"\n",
|
||
"file_name = \"StarWars.csv\"\n",
|
||
"dataset_path = \"data/\" + file_name\n",
|
||
"\n",
|
||
"df = pd.read_csv(dataset_path)\n",
|
||
"\n",
|
||
"df"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "continental-franklin",
|
||
"metadata": {},
|
||
"source": [
|
||
"**Data Overview**\n",
|
||
"\n",
|
||
"The data comes from a survey performed by FiveThirtyEight, a website aiming to deliver data-informed journalism, for one of their articles, \"America's Favorite 'Star Wars' Movies (And Least Favorite Characters).\" There are 1188 rows in the dataset. Of these, 1186 represent respondents. Respondents indicated, among other things, whether they were fans and which of the movies from the first two trilogies they'd watched."
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "infinite-instrument",
|
||
"metadata": {},
|
||
"source": [
|
||
"# Methods and Results"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "e35faa85-723c-4e24-84d8-247faef1d609",
|
||
"metadata": {},
|
||
"source": [
|
||
"**Data Cleaning**\n",
|
||
"\n",
|
||
"For convenience, we will clean the data by removing columns that are not of interest and by changing the first two rows, which collectively serve as the header, so that the same information is reflected in one row. We will also rename the column names to conveniently access them. This means, we are interested in the columns indicating whether the respondent has seen at least one film, whether they are a fan, and which of the first six films they have seen.\n",
|
||
"\n",
|
||
"We now have to remove any people who have seen no films. We also have to remove those who have seen neither \"The Phantom Menace\" nor \"A New Hope.\"\n",
|
||
"Note: Some respondents reported viewing at least one film but did not indicate which of the films they viewed. These respondents will be removed from the dataset.\n",
|
||
"\n",
|
||
"Since every variable we have is a binary variable, we can replace each of the values in the data frame with either 0 or 1, where 1 will be yes and 0 will be no. Missing data, indicating a respondent has not seen a particula rmovie, will be replaced with 0 also."
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 203,
|
||
"id": "c3468864-d1fc-4957-b94e-e25b94fd1d58",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"outputs": [
|
||
{
|
||
"data": {
|
||
"text/html": [
|
||
"<div>\n",
|
||
"<style scoped>\n",
|
||
" .dataframe tbody tr th:only-of-type {\n",
|
||
" vertical-align: middle;\n",
|
||
" }\n",
|
||
"\n",
|
||
" .dataframe tbody tr th {\n",
|
||
" vertical-align: top;\n",
|
||
" }\n",
|
||
"\n",
|
||
" .dataframe thead th {\n",
|
||
" text-align: right;\n",
|
||
" }\n",
|
||
"</style>\n",
|
||
"<table border=\"1\" class=\"dataframe\">\n",
|
||
" <thead>\n",
|
||
" <tr style=\"text-align: right;\">\n",
|
||
" <th></th>\n",
|
||
" <th>Fan</th>\n",
|
||
" <th>ThePhantomMenace</th>\n",
|
||
" <th>AttackOfTheClones</th>\n",
|
||
" <th>RevengeOfTheSith</th>\n",
|
||
" <th>ANewHope</th>\n",
|
||
" <th>TheEmpireStrikesBack</th>\n",
|
||
" <th>ReturnOfTheJedi</th>\n",
|
||
" </tr>\n",
|
||
" </thead>\n",
|
||
" <tbody>\n",
|
||
" <tr>\n",
|
||
" <th>1</th>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>3</th>\n",
|
||
" <td>0</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>0</td>\n",
|
||
" <td>0</td>\n",
|
||
" <td>0</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>4</th>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>5</th>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>6</th>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>...</th>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" <td>...</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1181</th>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1182</th>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1183</th>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1185</th>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1186</th>\n",
|
||
" <td>0</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>0</td>\n",
|
||
" <td>0</td>\n",
|
||
" <td>1</td>\n",
|
||
" <td>1</td>\n",
|
||
" </tr>\n",
|
||
" </tbody>\n",
|
||
"</table>\n",
|
||
"<p>740 rows × 7 columns</p>\n",
|
||
"</div>"
|
||
],
|
||
"text/plain": [
|
||
" Fan ThePhantomMenace AttackOfTheClones RevengeOfTheSith ANewHope \\\n",
|
||
"1 1 1 1 1 1 \n",
|
||
"3 0 1 1 1 0 \n",
|
||
"4 1 1 1 1 1 \n",
|
||
"5 1 1 1 1 1 \n",
|
||
"6 1 1 1 1 1 \n",
|
||
"... ... ... ... ... ... \n",
|
||
"1181 1 1 1 1 1 \n",
|
||
"1182 1 1 1 1 1 \n",
|
||
"1183 1 1 1 1 1 \n",
|
||
"1185 1 1 1 1 1 \n",
|
||
"1186 0 1 1 0 0 \n",
|
||
"\n",
|
||
" TheEmpireStrikesBack ReturnOfTheJedi \n",
|
||
"1 1 1 \n",
|
||
"3 0 0 \n",
|
||
"4 1 1 \n",
|
||
"5 1 1 \n",
|
||
"6 1 1 \n",
|
||
"... ... ... \n",
|
||
"1181 1 1 \n",
|
||
"1182 1 1 \n",
|
||
"1183 1 1 \n",
|
||
"1185 1 1 \n",
|
||
"1186 1 1 \n",
|
||
"\n",
|
||
"[740 rows x 7 columns]"
|
||
]
|
||
},
|
||
"execution_count": 203,
|
||
"metadata": {},
|
||
"output_type": "execute_result"
|
||
}
|
||
],
|
||
"source": [
|
||
"# Remove columns I am uninterested in.\n",
|
||
"df = df.drop(df.columns[[0,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37]],axis=1)\n",
|
||
"\n",
|
||
"# Respondents don't start until row 3, so remove the first row and rename the column headers to be self-contained.\n",
|
||
"df = df.drop(df.index[0])\n",
|
||
"df.columns = ['Seen','Fan','ThePhantomMenace','AttackOfTheClones','RevengeOfTheSith','ANewHope','TheEmpireStrikesBack','ReturnOfTheJedi']\n",
|
||
"\n",
|
||
"# Remove respondents who have not seen a Star Wars film and who have seen neither The Phantom Menace nor A New Hope.\n",
|
||
"df = df[(df.Seen == \"Yes\") & ((df.ThePhantomMenace == 'Star Wars: Episode I The Phantom Menace') | (df.ANewHope == 'Star Wars: Episode IV A New Hope'))]\n",
|
||
"\n",
|
||
"# Remove the Seen column\n",
|
||
"df = df.drop(df.columns[0],axis=1)\n",
|
||
"\n",
|
||
"# Replace all variables with binary data\n",
|
||
"df=df.fillna('No')\n",
|
||
"df=df.replace('No',0)\n",
|
||
"df=df.replace('Yes',1)\n",
|
||
"df['ThePhantomMenace'] = df['ThePhantomMenace'].replace('Star Wars: Episode I The Phantom Menace',1)\n",
|
||
"df['AttackOfTheClones'] = df['AttackOfTheClones'].replace('Star Wars: Episode II Attack of the Clones',1)\n",
|
||
"df['RevengeOfTheSith'] = df['RevengeOfTheSith'].replace('Star Wars: Episode III Revenge of the Sith',1)\n",
|
||
"df['ANewHope'] = df['ANewHope'].replace('Star Wars: Episode IV A New Hope',1)\n",
|
||
"df['TheEmpireStrikesBack'] = df['TheEmpireStrikesBack'].replace('Star Wars: Episode V The Empire Strikes Back',1)\n",
|
||
"df['ReturnOfTheJedi'] = df['ReturnOfTheJedi'].replace('Star Wars: Episode VI Return of the Jedi',1)\n",
|
||
"\n",
|
||
"df"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "dc173b87-426f-4a31-a5d5-9bac84e4f0aa",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"source": [
|
||
"## First Research Question: If a person watches either \"The Phantom Menace\" or \"A New Hope,\" how likely is the person to be a fan?\n"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "endless-variation",
|
||
"metadata": {},
|
||
"source": [
|
||
"To answer this question, we focus on the columns where respondents responded whether they were fans, whether they've seen The Phantom Menace, and whether they've seen A New Hope. We can simply count the number of 1s to find the number of people who have watched either movie, and we can filter by viewers of those movies to see how many fans of Star Wars there are. We can use mean() now that the data has been converted to 0s and 1s to determine the fraction of viewers who are fans. These results can be presented visually in a table."
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "portuguese-japan",
|
||
"metadata": {},
|
||
"source": [
|
||
"### Results "
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 204,
|
||
"id": "219ad7ad-ef4d-4c0a-a469-c42cd625b259",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"outputs": [
|
||
{
|
||
"data": {
|
||
"text/html": [
|
||
"<div>\n",
|
||
"<style scoped>\n",
|
||
" .dataframe tbody tr th:only-of-type {\n",
|
||
" vertical-align: middle;\n",
|
||
" }\n",
|
||
"\n",
|
||
" .dataframe tbody tr th {\n",
|
||
" vertical-align: top;\n",
|
||
" }\n",
|
||
"\n",
|
||
" .dataframe thead th {\n",
|
||
" text-align: right;\n",
|
||
" }\n",
|
||
"</style>\n",
|
||
"<table border=\"1\" class=\"dataframe\">\n",
|
||
" <thead>\n",
|
||
" <tr style=\"text-align: right;\">\n",
|
||
" <th></th>\n",
|
||
" <th>FilmViewed</th>\n",
|
||
" <th>NumberOfViewers</th>\n",
|
||
" <th>FansWhoAreViewers</th>\n",
|
||
" <th>FanPercentage (%)</th>\n",
|
||
" </tr>\n",
|
||
" </thead>\n",
|
||
" <tbody>\n",
|
||
" <tr>\n",
|
||
" <th>0</th>\n",
|
||
" <td>The Phantom Menace</td>\n",
|
||
" <td>673</td>\n",
|
||
" <td>500</td>\n",
|
||
" <td>74.294205</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1</th>\n",
|
||
" <td>A New Hope</td>\n",
|
||
" <td>607</td>\n",
|
||
" <td>483</td>\n",
|
||
" <td>79.571664</td>\n",
|
||
" </tr>\n",
|
||
" </tbody>\n",
|
||
"</table>\n",
|
||
"</div>"
|
||
],
|
||
"text/plain": [
|
||
" FilmViewed NumberOfViewers FansWhoAreViewers FanPercentage (%)\n",
|
||
"0 The Phantom Menace 673 500 74.294205\n",
|
||
"1 A New Hope 607 483 79.571664"
|
||
]
|
||
},
|
||
"execution_count": 204,
|
||
"metadata": {},
|
||
"output_type": "execute_result"
|
||
}
|
||
],
|
||
"source": [
|
||
"viewedPhantom = df.ThePhantomMenace.value_counts()[1]\n",
|
||
"fanPhantom = df[df.ThePhantomMenace == 1].Fan.value_counts()[1]\n",
|
||
"fractionPhantom = df[df.ThePhantomMenace == 1].Fan.mean()\n",
|
||
"viewedHope = df.ANewHope.value_counts()[1]\n",
|
||
"fanHope = df[df.ANewHope == 1].Fan.value_counts()[1]\n",
|
||
"fractionHope = df[df.ANewHope == 1].Fan.mean()\n",
|
||
"\n",
|
||
"df2 = pd.DataFrame({'FilmViewed':[\"The Phantom Menace\",\"A New Hope\"],\n",
|
||
" 'NumberOfViewers':[viewedPhantom,viewedHope],\n",
|
||
" 'FansWhoAreViewers':[fanPhantom,fanHope],\n",
|
||
" 'FanPercentage (%)':[fractionPhantom*100,fractionHope*100]})\n",
|
||
"df2"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "fda63ce1-a99d-44b7-874b-03b744142d5d",
|
||
"metadata": {},
|
||
"source": [
|
||
"## Second Research Question: If a person watches either \"The Phantom Menace\" or \"A New Hope,\" how likely are they to watch the following movie in its trilogy?"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "c1b1bef8-6884-4f13-b229-99f4270fd42f",
|
||
"metadata": {},
|
||
"source": [
|
||
"To answer this question, we look at the columns indicating whether respondents have watched The Phantom Menace or A New Hope. From there, we look at whether they also watched Attack of the Clones or The Empire Strikes back, respectively. As before, we can simply count the 1s to determine the number of people who viewed Attack of the Clones or The Empire Strikes Back and use mean() to determine the fraction of people who watched the second movie in addition to the first movie in a trilogy. We present the results in a table."
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 205,
|
||
"id": "f6651522-3fe4-4c57-85e7-71a2bd6aa3cf",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"outputs": [
|
||
{
|
||
"data": {
|
||
"text/html": [
|
||
"<div>\n",
|
||
"<style scoped>\n",
|
||
" .dataframe tbody tr th:only-of-type {\n",
|
||
" vertical-align: middle;\n",
|
||
" }\n",
|
||
"\n",
|
||
" .dataframe tbody tr th {\n",
|
||
" vertical-align: top;\n",
|
||
" }\n",
|
||
"\n",
|
||
" .dataframe thead th {\n",
|
||
" text-align: right;\n",
|
||
" }\n",
|
||
"</style>\n",
|
||
"<table border=\"1\" class=\"dataframe\">\n",
|
||
" <thead>\n",
|
||
" <tr style=\"text-align: right;\">\n",
|
||
" <th></th>\n",
|
||
" <th>FilmViewed</th>\n",
|
||
" <th>NumberOfViewers</th>\n",
|
||
" <th>ViewedNextMovie</th>\n",
|
||
" <th>PercentageWatchingNextMovie (%)</th>\n",
|
||
" </tr>\n",
|
||
" </thead>\n",
|
||
" <tbody>\n",
|
||
" <tr>\n",
|
||
" <th>0</th>\n",
|
||
" <td>The Phantom Menace</td>\n",
|
||
" <td>673</td>\n",
|
||
" <td>554</td>\n",
|
||
" <td>82.317979</td>\n",
|
||
" </tr>\n",
|
||
" <tr>\n",
|
||
" <th>1</th>\n",
|
||
" <td>A New Hope</td>\n",
|
||
" <td>607</td>\n",
|
||
" <td>597</td>\n",
|
||
" <td>98.352554</td>\n",
|
||
" </tr>\n",
|
||
" </tbody>\n",
|
||
"</table>\n",
|
||
"</div>"
|
||
],
|
||
"text/plain": [
|
||
" FilmViewed NumberOfViewers ViewedNextMovie \\\n",
|
||
"0 The Phantom Menace 673 554 \n",
|
||
"1 A New Hope 607 597 \n",
|
||
"\n",
|
||
" PercentageWatchingNextMovie (%) \n",
|
||
"0 82.317979 \n",
|
||
"1 98.352554 "
|
||
]
|
||
},
|
||
"execution_count": 205,
|
||
"metadata": {},
|
||
"output_type": "execute_result"
|
||
}
|
||
],
|
||
"source": [
|
||
"viewedAttack = df[df.ThePhantomMenace == 1].AttackOfTheClones.value_counts()[1]\n",
|
||
"fractionAttack = df[df.ThePhantomMenace == 1].AttackOfTheClones.mean()\n",
|
||
"viewedEmpire = df[df.ANewHope == 1].TheEmpireStrikesBack.value_counts()[1]\n",
|
||
"fractionEmpire = df[df.ANewHope == 1].TheEmpireStrikesBack.mean()\n",
|
||
"\n",
|
||
"df3 = pd.DataFrame({'FilmViewed':[\"The Phantom Menace\",\"A New Hope\"],\n",
|
||
" 'NumberOfViewers':[viewedPhantom,viewedHope],\n",
|
||
" 'ViewedNextMovie':[viewedAttack,viewedEmpire],\n",
|
||
" 'PercentageWatchingNextMovie (%)':[fractionAttack*100,fractionEmpire*100]})\n",
|
||
"df3"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "infectious-symbol",
|
||
"metadata": {},
|
||
"source": [
|
||
"# Discussion"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "furnished-camping",
|
||
"metadata": {
|
||
"code_folding": []
|
||
},
|
||
"source": [
|
||
"## Considerations"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "bearing-stadium",
|
||
"metadata": {},
|
||
"source": [
|
||
"It's worth noting that this analysis cannot determine if a person is a fan of Star Wars solely because they watched either \"The Phantom Menace\" or \"A New Hope.\" It might be possible that becoming a fan is a result of watching more than one film. \n",
|
||
"\n",
|
||
"This is particularly important for this analysis since a respondent could have seen both \"The Phantom Menace\" and \"A New Hope.\" Although information was recorded on which of the six movies were viewed by respondents, no information was recorded on the order in which films were viewed, so it is not possible to determine a partciular respondent's \"entry point\" into the film franchise or which film(s) in particular, for those who have seen both or more, contributed to becoming a fan.\n",
|
||
"\n",
|
||
"Relatedly, there could be multiple reasons for watching a second movie in a given trilogy beyond simply liking the first. For example, a fan of one trilogy might feel obligated to watch the other, or a person might have watched the films with someone who is a fan.\n",
|
||
"\n",
|
||
"In the dataset, there were some respondents who reported seeing a Star Wars movie but whose responses in the dataset did not indicate which of the films they saw. These respondents were removed from the dataset when the analysis was performed, but the results could have been affected by the missing data."
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "beneficial-invasion",
|
||
"metadata": {},
|
||
"source": [
|
||
"## Summary"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "about-raise",
|
||
"metadata": {},
|
||
"source": [
|
||
"The data would seem to suggest that that \"The Phantom Menace\" and \"A New Hope\" are good \"first\" movies to build interest in Star Wars. This was determined by answering two quantitative questions:\n",
|
||
"\n",
|
||
"1) If a person watches either \"The Phantom Menace\" or \"A New Hope,\" how likely is the person to be a fan?\n",
|
||
"2) If a person watches either \"The Phantom Menace\" or \"A New Hope,\" how likely are they to watch the following movie in its trilogy?\n",
|
||
"\n",
|
||
"For both questions, the vast majority of viewers of either of those movies were likely to be a fan or watch the following movie. More than 74% of people who watched \"The Phantom Menace\" reported being a fan, and more than 79% of people who watched \"A New Hope\" reported being a fan. The results were more convincing for the second question. Of those watching \"The Phantom Menace,\" more than 82% reported also watching \"Attack of the Clones,\" and of those watching \"A New Hope,\" more than 98% reported also watching \"The Empire Strikes back.\"\n",
|
||
"\n",
|
||
"That \"The Phantom Menace\" and \"A New Hope\" are seemingly compelling films to start a trilogy would seem to make sense given Star Wars' continuing popularity 46 years after the release of \"A New Hope.\" Building a franchise requires a significant committment of fiscal, human, legal, and organizational resources. Had \"A New Hope\" been unsuccessful as a film when it first came out, it is unlikely that more would have been invested in the franchise.\n",
|
||
"\n",
|
||
"One thing that was surprising was that \"The Phantom Menace\" was reportedly viewed by more people than \"A New Hope\" while at the same time, those who watched \"A New Hope\" were more likely to identify as fans and to report having watched the second film in its trilogy. Although I've seen all six of the movies of the original and prequel trilogies, I feel as though I perhaps need to rewatch \"A New Hope\" given the dataset I analyzed. I watched the prequel trilogy as the films came out and only watched the original trilogy years later--when I finally got Disney+."
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": null,
|
||
"id": "de1bca69-b407-4b56-8df7-2284b7ff1746",
|
||
"metadata": {},
|
||
"outputs": [],
|
||
"source": []
|
||
}
|
||
],
|
||
"metadata": {
|
||
"jupytext": {
|
||
"cell_metadata_json": true,
|
||
"text_representation": {
|
||
"extension": ".Rmd",
|
||
"format_name": "rmarkdown",
|
||
"format_version": "1.2",
|
||
"jupytext_version": "1.9.1"
|
||
}
|
||
},
|
||
"kernelspec": {
|
||
"display_name": "Python 3 (ipykernel)",
|
||
"language": "python",
|
||
"name": "python3"
|
||
},
|
||
"language_info": {
|
||
"codemirror_mode": {
|
||
"name": "ipython",
|
||
"version": 3
|
||
},
|
||
"file_extension": ".py",
|
||
"mimetype": "text/x-python",
|
||
"name": "python",
|
||
"nbconvert_exporter": "python",
|
||
"pygments_lexer": "ipython3",
|
||
"version": "3.11.4"
|
||
},
|
||
"toc": {
|
||
"base_numbering": 1,
|
||
"nav_menu": {},
|
||
"number_sections": false,
|
||
"sideBar": true,
|
||
"skip_h1_title": false,
|
||
"title_cell": "Table of Contents",
|
||
"title_sidebar": "Contents",
|
||
"toc_cell": false,
|
||
"toc_position": {},
|
||
"toc_section_display": true,
|
||
"toc_window_display": false
|
||
},
|
||
"varInspector": {
|
||
"cols": {
|
||
"lenName": 16,
|
||
"lenType": 16,
|
||
"lenVar": 40
|
||
},
|
||
"kernels_config": {
|
||
"python": {
|
||
"delete_cmd_postfix": "",
|
||
"delete_cmd_prefix": "del ",
|
||
"library": "var_list.py",
|
||
"varRefreshCmd": "print(var_dic_list())"
|
||
},
|
||
"r": {
|
||
"delete_cmd_postfix": ") ",
|
||
"delete_cmd_prefix": "rm(",
|
||
"library": "var_list.r",
|
||
"varRefreshCmd": "cat(var_dic_list()) "
|
||
}
|
||
},
|
||
"types_to_exclude": [
|
||
"module",
|
||
"function",
|
||
"builtin_function_or_method",
|
||
"instance",
|
||
"_Feature"
|
||
],
|
||
"window_display": false
|
||
}
|
||
},
|
||
"nbformat": 4,
|
||
"nbformat_minor": 5
|
||
}
|