Difference between revisions of "IS428 2016-17 Term1 Assign2 Liang Bing"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 51: Line 51:
 
'''Tree Map in Tableau:'''
 
'''Tree Map in Tableau:'''
 
Firstly create create a measure by dragging the "nature of injury" attribute to the size tab, select count as the measure. Drag "nature of injury" to the color and label tab follow by dragging the "occupation lower case" to the label tab and to the Filters Card. Select Tree Map from the Show Me Card, the Tree Map showing each nature of injury in different colors with different sizes of boxes representing number of victims in various occupations is drawn.<br>
 
Firstly create create a measure by dragging the "nature of injury" attribute to the size tab, select count as the measure. Drag "nature of injury" to the color and label tab follow by dragging the "occupation lower case" to the label tab and to the Filters Card. Select Tree Map from the Show Me Card, the Tree Map showing each nature of injury in different colors with different sizes of boxes representing number of victims in various occupations is drawn.<br>
 +
[[File:Wid_lb_ass2_visualization1treemaptab.png||600px||center]]<br>
 +
 +
In this graph, the size of the color boxes clearly reflects the number of victims for each type of injury. The size of boxes in each color boxes represents the share of victim counts for different occupations. The larger the size, the more victims in this job is getting that kind of injury. When hovering the mouse on each box, the nature of injury. the occupation type and total number of victim for the box will be shown. <br>
 
[[File:Wid_lb_ass2_visualization1treemaptab.png||600px||center]]<br>
 
[[File:Wid_lb_ass2_visualization1treemaptab.png||600px||center]]<br>
  

Revision as of 21:34, 24 September 2016

Abstract

In order to put Singapore's workplace safety and health (WSH) performance on par with the world's leading countries, The Workplace Safety and Health (WSH) Institute is established to collaborate with MOH to work on researching, as well as data collecting and analyzing to understand the current and emerging work environment in Singapore, and use the knowledge discovered to develop solutions for improving WSH practices. One of the field of concern is the workplace injury. This report will utilize the workplace injury data provided by WSHI to explore insights which could help find trends and understand the workplace injury situation in 2014.

Theme of Interest

Workplace injury has always been a concern of WSH. Workplace injury can be seen as the accidents happen to workers in their working environment. Some job s are naturally more dangerous than the normal office jobs. Also, certain group of people will have higher chance to get injured during work compare to others. Therefore, finding out the victims' characteristics which has high probability to have injured incidents is very important for WSHI and stakeholders to understand the risk in their workplace and improve their management accordingly. This hence is the main focus of this report.

Questions for Investigation

The following are the questions help to through the data exploration process to determine the characteristics of workplace injury victims that should concern about:

  1. What is the pattern of nature of injury for various occupations.
  2. Work overtime and cause of injury (self/external)
  3. Percentage of manual work and nature of injury.

Understanding the Data

Data Attributes

There are a total of 5650 rows of data with 48 attributes(Columns)per row.
Wid lb ass2 rowscol.png
The figure below shows all the 48 attributes provided in the WID excel file:
Wid lb ass2 48attributes.png

Data Selection & Preparation

Not all of the 48 data attributes are useful in providing information needed by this report. For example, the informant's information and employer's information are not the main concern of this report:
Wid lb ass2 proneanyway.png
After using JMP and Microsoft Excel to examine all the data attributes, irrelevant columns are removed and the rest are the data attributes which could add value to the report analysis.

Below are the data attributes remained:
Wid lb ass2 pruneleft2.png

After pruning the irrelevant data, some of the nominal data should be standardized in terms of the letter cases. For example, for the Occupation Column, the same word "cook" is recorded in both upper case"COOK" and lower case "cook". The standardization process uses JMP's formula function:
1. Create a new column called "Occupation Lower Case", right click on the column header to find the formula tab:
Wid lb ass2 cookupperlower2.png
2. Drag the Occupation attribute from Table Columns into the workspace , and then select Lower Cases function from the Character option, click "ok":
Wid lb ass2 formulalowercase2.png
3. The new column "Occupation Lower Case" is created:
Wid lb ass2 newoccupationlowercase.png

Moreover, there are typos in the Occupation column which resulted in meaningless occupation categories being created. e.g. the "cleaner" typo.
Wid lb ass2 cleanertypo.png
Also, the non-standardized occupation words have resulted in missing count of same kind of occupations. e.g. the "assembler" & "assemblers" means the same occupation obviously.
Wid lb ass2 assembler.png
I have exported the pruned data as excel file from JMP to filter through and clean these data with typo and minor differences. After few rounds of filtering and fixing, the data is read for use.

Data Exploration and Findings

Visualization 1 : The share for the occupations in each nature of injury

There are two attributes involved in this question: the nature of injury and the occupations. To find out the share for 1000+ occupations in each of the nature of injury, both of the Tree Map in Tableau and Mosaic Graph in JMP are suitable for illustrating the share pattern for this huge amount of attribute values.

Tree Map in Tableau: Firstly create create a measure by dragging the "nature of injury" attribute to the size tab, select count as the measure. Drag "nature of injury" to the color and label tab follow by dragging the "occupation lower case" to the label tab and to the Filters Card. Select Tree Map from the Show Me Card, the Tree Map showing each nature of injury in different colors with different sizes of boxes representing number of victims in various occupations is drawn.

Wid lb ass2 visualization1treemaptab.png


In this graph, the size of the color boxes clearly reflects the number of victims for each type of injury. The size of boxes in each color boxes represents the share of victim counts for different occupations. The larger the size, the more victims in this job is getting that kind of injury. When hovering the mouse on each box, the nature of injury. the occupation type and total number of victim for the box will be shown.

Wid lb ass2 visualization1treemaptab.png


Visualization 2 : The injury pattern for various occupations

Visualization 3 : The injury pattern for various occupations

Interactive Data Visualization

Tools

  1. Excel:
  2. JMP:
  3. Tableau:

References

WSH Institute Official Website