IS428 2016-17 Term1 Assign2 Chua Shan Yong James

From Visual Analytics for Business Intelligence
Revision as of 10:12, 25 September 2016 by James.chua.2013 (talk | contribs)
Jump to navigation Jump to search

Abstract

Workplace injuries have always been a concern in Singapore as the fatalities and accident rates number are worrying. Safety protocols have been in placed and perhaps adhered to but injuries are still unavoidable. This topic has therefore caught my interest to work on as i hope to find out trends in specific industries, age, body parts injured etc and the relationship between them.

Theme of Interest

The theme of interest for workplace injuries that i will be working on is identifying and analysing the contributing factors to workplace injuries and the relationship between each of these factors.

Questions for investigation

  1. Which industry has the highest rate of workplace injuries?
  2. Which are the body parts that got injured the most number of times?
  3. Which gender suffer more injuries?
    1. Is it justified in each industries?
  4. Are the older workers more careful than the younger workers? Or vice versa?
    1. For each industry, is the above finding justified?
  5. Does experience make a worker more slack and negligent about workplace safety (months worked)?

Identifying appropriate attributes

With reference to the theme of interest and questions for investigation that i have came up with, i needed the following attributes to carry out my analysis :

  1. Body parts injured
  2. Victim's gender
  3. Victim's age
  4. Months worked
  5. Sub industry
  6. Informant's number of employees

Transformations/Rearrangements of dataset

After removing all the redundant attributes using excel, i proceed to use the data in tableau.

To find out which industry has the highest rate of injuries, i created a calculated field - % of injured employees (out of total employees). I created this field as i can't just use the total number of records as it will be unfair since construction has a high number of workers and also more records but what i am more interested is the rate. therefore, i use the number of records / informants total employees and expressed it as a percentage.

Using a treemap, i used this calculated field and sub industry to identify to show the rate of injured workers in each sub industry. The reasons i used a treemap are because there are many sub industries, it is easy to understand and identify immediately which size of the rectangle is the biggest and therefore having the highest rate and it provides an overview and summary against the other sub industries.

Treemap.png

For the distribution of both the body parts injured and gender, i used bar chart as it is simple and easy to understand.

James - Body Parts.png

James - Gender.png

James - Experience.png

James - Age.png

Visualization

Tools Utilized

  1. Excel 2013 for data preparation
  2. Tableau for visualization