Difference between revisions of "IS428 2016-17 Term1 Assign2 Yang Chengzhen"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 37: Line 37:
 
= Data Visualization and Findings =
 
= Data Visualization and Findings =
  
==How does injuries distribute in different industries?==
+
==1.How does injuries distribute in different industries?==
 
===Overview with Major/Minor injury distribution===
 
===Overview with Major/Minor injury distribution===
 
* Industry has many categories. In order to have a intuitive view of the distribution of injury, I used treemap to visualize the data.<br>
 
* Industry has many categories. In order to have a intuitive view of the distribution of injury, I used treemap to visualize the data.<br>
Line 48: Line 48:
 
* '''From the graph, we can tell CUT_BRUISES and CRUSHING are the 2 major nature of the injury cases of all the industry. Noticeably, Marine has very high "Multiple Injuries" rate and Mining has high "CONCUSSION" rate. This may be caused by the unique characters of the industries.'''
 
* '''From the graph, we can tell CUT_BRUISES and CRUSHING are the 2 major nature of the injury cases of all the industry. Noticeably, Marine has very high "Multiple Injuries" rate and Mining has high "CONCUSSION" rate. This may be caused by the unique characters of the industries.'''
 
[[File:Mosaic type-industry.png]]
 
[[File:Mosaic type-industry.png]]
 +
 +
==2.What characteristics do the victim groups have?==
 +
* We create age bin groups to find out the age distribution and add color code to differentiate genders
 +
'''* From the graph, we can tell most of the victims are male and with age 25-30. This may because the "high injury rate" industries mainly hires male employees.'''
 +
[[File:Victim-gender -age.png]]
  
 
= Implementation =
 
= Implementation =

Revision as of 14:20, 25 September 2016

Description

Abstract

Workplace Safety and Health has always been one of the top concern of Ministry of Manpower. To better manage MSH, MOM established Workplace Safety and Health (WSH) Council on 1 Apr 2008. WSH Concil coordinates with MOM to conduct research on workplace safety and health performance in Singapore. The WSH Institute conducts quality applied research and provides evidence-based information to Ministry of Manpower, WSH Council and industry stakeholders to improve WSH practices in Singapore. The data used were collated from incident reports made by employers, occupiers and medical practitioners.
This Assignment will generate analysis utilizing WSHI work place injuries data in year 2014 to gain insights of workplace safety and health situation in 2014.

Theme of Interest

In order to be more efficient in improving workplace safety and health situation, it is essential to find out the 'Vulnerable groups' and characteristics tied with different groups. Therefore MOM can provide customized plans in order to prevent the happens of injuries and react to injuries faster and smarter.
This assignment aims to find out the major vulnerable groups and peak period of workplace injury cases to facilitate MOM's customization of planning and policy-making.

Research Question

Data Preparation

Attribute Selection

There are 48 columns in the raw data. After scanning through, I selected some attributes which are more relevant in identifying victim groups:

  • Accident Type Level 1 Category
  • Accident Type Level 2 Category
  • Accident Date
  • Nature of Injury
  • Major Industry
  • Sub Industry
  • Victim's Age
  • Victim's Gender
  • Informant's No of employees
  • Injured When Working Overtime

Data Preparation

Check Missing Values

Use JMP to find missing data pattern relating to the attributes selected in the previous section, only one missing data is found. I decided to ignore the record since it has no significant impact to the analysis. Cz-Missing-1.png

Check Distribution of Numerical Attribute

Cz-age-distribution.png Cz-noof employees.png Cz-mcdays.png

Data Visualization and Findings

1.How does injuries distribute in different industries?

Overview with Major/Minor injury distribution

  • Industry has many categories. In order to have a intuitive view of the distribution of injury, I used treemap to visualize the data.
  • From the treemap, we can tell "Construction" has the highest injury cases among all the industries, and it has higher proportion of major injury as well. The other 'high injury cases' industries are Metalworking, Accommodation and Wholesale&Retail Trade. The proportion of major injury cases is relatively high as well.

Industry-injury type-explanation.png Major-sub-industry.png

Further Explore:Nature of Injury

  • By Using mosaic plot, we are able to see the distribution of nature of injury by color coding. We can check the detailed frequency percentage by mouse over to the corresponding bars
  • From the graph, we can tell CUT_BRUISES and CRUSHING are the 2 major nature of the injury cases of all the industry. Noticeably, Marine has very high "Multiple Injuries" rate and Mining has high "CONCUSSION" rate. This may be caused by the unique characters of the industries.

Mosaic type-industry.png

2.What characteristics do the victim groups have?

  • We create age bin groups to find out the age distribution and add color code to differentiate genders

* From the graph, we can tell most of the victims are male and with age 25-30. This may because the "high injury rate" industries mainly hires male employees. Victim-gender -age.png

Implementation

Findings

Conclusion

Comments