Weed-Out










                           Data Visualization - Project Report 
                          
                                 Instructor: Judd Bradbury







                                                                                                                                                                Group 11
Aravind Vasu Murugan
Meera Joseph
Ritesh Kunisseri Puliyakote
Sriram Murali
Tejasvi Ramadas Sagar
                                      

                                             INTRODUCTION                      

Cannabis which is better known as “Marijuana” is intended to use for psychological or medicinal use and it has been proven to be a safe and effective treatment for the symptoms of many diseases such as AIDS, Arthritis, Alzheimer’s, migraines, hepatitis C, glaucoma, psychological conditions and many more. Marijuana has impacted human lives in a rampant manner. But lately, this has been legalized for recreational use. It falls under the hallucination drug category. Initially Marijuana was used only for medicinal purpose, but few states in the USA have legalized Marijuana for recreational use.

In Project Case study-1, two insights were proposed which are given below
1.     Prospective states where marijuana can be legalized based on sales, quality and average price per ounce.
2.     Legalization of Marijuana has negligible impact on crime rate of Denver county.

In this project report, sales and crimes of Denver county in Colorado with respect to the Marijuana is analyzed. The vote to legalize Marijuana was passed in September 2012, but, it was not until 2014, legalization was official. January 2014 is considered as the start of legalization period.

The project will provide a deep dive into the overall crimes and then drill-down to Marijuana crimes related to sales of Marijuana. The analysis is performed with comparison of pre-legalization and post-legalization period and how the behavior of crime is related. Our target audience are the pro-weed(Marijuana) people who want to legalize Marijuana in more states for business purposes. We would like to showcase that legalizing Marijuana for recreational use does not affect the society in terms of crime.

Data Sources

Below are the datasets used to analyze the sales of marijuana in Denver County before and after legalization, overall crime statistics of Denver county and marijuana related crime statistics of Denver.

Marijuana Sales Dataset:
The primary data set which gives us the overall sales of Marijuana in the US for the period of 2011, 2012 and 2013. This dataset was scraped from PriceOfWeed.com using a Python script where the records of weed sold are maintained and exhibited in GitHub folder for public use. The dataset contains 1.3 million sales transactions from all the states of USA.

Marijuana gross sales for 2014-2016 (Month wise totals):
Post legalization (from 2014), County of Denver maintains record of sale, possession and cultivation of Marijuana and its medical usage. The data is maintained by County of Denver and is available for public use.

Denver overall crime dataset (2012 - 2016):
The secondary dataset includes criminal offenses in the City and County of Denver from 2012 to 2016. The data is based on the National Incident Based Reporting System (NIBRS) which includes all victims of person crimes and all crimes within an incident. The dataset was obtained from County of Denver website where all the crimes are recorded and the period from 2012 to 2016, consist of both pre-legalization and post-legalization Marijuana period.

Denver Marijuana Crime Dataset (2012 - 2016):
The secondary dataset includes criminal offenses related to marijuana in the City and County of Denver from 2012 to 2016. Data in this file are crimes reported to the Denver Police Department which, upon review, were determined to have clear connection or relation to marijuana. The data is based on the National Incident Based Reporting System (NIBRS) which includes all victims of person crimes and all crimes within an incident. The dataset contains 1254 records of reported crime in Denver county.

Understanding the Data

The project involves multiple data sets which gives information about the sales of marijuana and crime statistics of Denver county. It is important to understand the attributes and nature of the data to derive useful insights. Below are some of the key information which will be helpful to understand the data sets.

      The pre-legalization sales data was from priceofWeed.com which is one of the channels which recorded the sale of Marijuana. Post legalization period, County of Denver maintained records of Marijuana sold for both recreational and medical use.
      The crime data set is from the official website from County of Denver. The data set has categorized into different crimes based on the offense type. Based on year/month, we can analyze the trend present in the data.
      As the vote for legalization was made official in 2014, the same will be considered as the start of the post-legalization period.
      “Number of records” is a measure generated by Tableau which is used to project the count of crimes committed.
Below are the important fields in each of our data set and its description

Data set
Column Name
Description
Sales data set
Month
Name of the month
Sales data set
Year
Year the sale was made
Sales data set
Gross sales type
The type of sale – Marijuana sold for medical or recreational use
Sales data set
 Gross sales
The total sales month wise for the post- legalization period.
Crime data set
Reported date
The date on which the crime was reported. To be used in Year/months format too to visualize.
Crime data set
Offense_Catgory_Id
Records a broad category of crimes committed
 Crime data set
 Offense_Type
Reveals the detail about the type of crime committed.
Crime data set
isCrime
Represents 1, if it is a crime, else 0
Crime data set
isTraffic
Represents 1, if it is a traffic accident, else 0
Crime data set
First_occurance_date
The date when the crime first occurred, could differ from the reported date.

Exploratory/Statistical Analysis

Understanding the dataset is more important and therefore, we need to explore the data and perform analysis. Performing exploratory data visualizations helps us to understand different patterns and trends in within the data. This helps to make business decisions based on the insights developed. Following are the exploratory/statistical analysis used in this project:
·       Box plots are created to analyze the distribution of gross sales during the post-legalization period.
·       Trend lines are generated to infer, if there is an increase/decrease in crimes committed.
·       Created calculated fields to implement ratio of crimes committed per $1,000 sales, which gives relationship between sales and crimes related to Marijuana.

·       Small multiples technique is used to perform drill-down analysis about the percentage change in crime committed between 2012-2016.
·       Difference in numbers of drug-marijuana crimes is compared for post legalization period.

Insights

Based on the analysis performed, we have derived 3 insights and used different visual encodings to help audience understand the visualization. Following are the insights and encoding used.


Insight 1: “Traffic violations are fairly constant whereas, overall crimes are increasing”




Observation: From this viz, traffic incidents seems to be fairly constant but, there is a rise in all other crimes from 2012 to 2016. The trend line depicts the same.

Visual encoding: This is a bar chart which helps to analyze the total crimes committed across years. Color encoding depicts different years and classified by traffic violations and crimes

Insight 2: “Sales peak from late Summer to Fall”



Observation: This viz gives the distribution of gross sales of Marijuana used for both Medical and recreational purpose with help of box plot. Across the years, the sales is on the rise  and attains peak sales during late summer to fall. Also, the sales are lower during the beginning of each year.

Visual encoding: Box plot is used to understand the distribution of data and inspect, if there is a pattern in marijuana sales. Upper and Lower whisker values are used to infer the variation in marijuana sales against each month after legalization.







Insight 3: “Crimes related to marijuana have dropped due to legalization”


Observation: As traffic violations are fairly constant, categories of other crimes needs to be analyzed. Applying small multiples, the graph displays a drill-down analysis and percentage difference of each categorical crimes. Different categories of crimes have its own trends, but focus is on drug-alcohol category since marijuana related crimes under drug-alcohol category. As per the visualization, drug-alcohol has been increasing but at decreasing rate.

Visual encoding: Color encoding is used to differentiate year and labelling helps us to understand the percent increase/decrease in a specific category of most recent year. Small multiple helps us to portray all the categories in a single pane.



Observation: Marijuana related crimes are categorized into 3 types - cultivation, possession and sell. There is a declining trend in possession, and there is no sharp increase/decrease in sales & cultivation.

Visual encoding: Bar graph gives a clear picture about the difference of crime committed between previous year and current year. Color is encoded for year. The charts are organized by types of marijuana related crime committed.




Observation: The dashboard is used to compare crimes committed per $1,000 sales during pre-legalization and post-legalization period. During the pre-legalization period, there is a spike in crimes committed and then during the end of the pre-legalization era, its fairly constant. Whereas, there is a decrease in crimes committed, since the begin of legalization, despite increase in sales. This portrays that Marijuana legalization has minimal effect on Marijuana crime committed.

Visual encoding: Dual axis plotting is used to depict the sales and crimes committed as a bar and line graph. Color encoding is used on sales bars to differentiate the legalization period (orange- pre-legalization and green as post legalization). Dual axis is used to compare the two characters - sales and crime in a single frame.

Story

This story tells us how the characters - sales and crimes affect Marijuana legalization in Denver, Colorado. Colorado voted in favor of legalizing Marijuana for recreational use in 2012, but the law was made official in 2014. The story is narrated by analyzing how our characters influence the pre-legalization and post-legalization era. This story uses Logical Rhetoric for the impacts on legalization of Marijuana.  This story is based on the 3-act play model where we have a beginning, middle and an end. This story model has created a tension among the audience and resolves it by stating facts and guiding them towards our insights. Following is the story portrayed.

“Impacts of Marijuana legalization”

The story has analyzed the data related to marijuana, and narrate how marijuana sales and Marijuana crimes impact the decision of legalizing marijuana for recreational use.

Setting
Place: Denver, Colorado
Time: 2012 - 2016
Characters: Sales and Crime

·   The story begins by comparing sales and crimes committed/$1000 of sales during the pre-legalization period. Here we infer that after a sharp spike, the crimes committed is constant till the end of 2013.
·  Marijuana is used for both medical and recreational purpose. The gross sales, during the post-legalization period, are increasing since 2014 - when Marijuana became legal. As observed, the sales are highest during late summer to Fall.

·  It shows that the crimes committed is on the rise since 2012 till 2016. The challenge is to find whether it was a bad decision to legalize Marijuana for recreational purpose.
·  With the help of small multiples, the visualization explains the changes in categories of crimes committed. Since we are focused on Marijuana related crimes, so we observer the drug-alcohol category of crimes committed. Drug-alcohol category is increasing at a reducing rate annually. To further investigate, we now look at Marijuana crimes alone.
·  As we examine the visualization, there are three types of Marijuana related crimes- Marijuana-cultivation, Marijuana-possess and Marijuana-sell. By evaluating the change in Marijuana related crimes, it depicts a decline in the Marijuana-possession and a constant Marijuana-sell & Marijuana-cultivation crime. This shows that Marijuana crime is not on rise even though overall crime is increasing.


Story flow


 



Conclusion:

The comparison of crime rate of pre-legalization and post-legalization in Denver with respect to sales shows that Marijuana related crime is decreasing after legalization of Marijuana, although, the sales are on the rise. Finally, Crime related to marijuana is decreasing when marijuana is legalized.





Comments