IS428 2016-17 Term1 Assign2 Chua Shan Yong James
Contents
Abstract
Workplace injuries have always been a concern in Singapore as the fatalities and accident rates number are worrying. Safety protocols have been in placed and perhaps adhered to but injuries are still unavoidable. This topic has therefore caught my interest to work on as i hope to find out trends in specific industries, age, body parts injured etc and the relationship between them.
Theme of Interest
The theme of interest for workplace injuries that i will be working on is identifying and analysing the contributing factors to workplace injuries and the relationship between each of these factors.
Questions for investigation
- Which industry has the highest rate of workplace injuries?
- Which are the body parts that got injured the most number of times?
- Which gender suffer more injuries?
- Is it justified in each industries?
- Are the older workers more careful than the younger workers? Or vice versa?
- For each industry, is the above finding justified?
- Does experience make a worker more slack and negligent about workplace safety (months worked)?
Identifying appropriate attributes
With reference to the theme of interest and questions for investigation that i have came up with, i needed the following attributes to carry out my analysis :
- Body parts injured
- Victim's gender
- Victim's age
- Months worked
- Sub industry
- Informant's number of employees
Transformations/Rearrangements of dataset
After removing all the redundant attributes using excel, i proceed to use the data in tableau.
To find out which industry has the highest rate of injuries, i created a calculated field - % of injured employees (out of total employees). I created this field as i can't just use the total number of records as it will be unfair since construction has a high number of workers and also more records but what i am more interested is the rate. therefore, i use the number of records / informants total employees and expressed it as a percentage.
Using a treemap, i used this calculated field and sub industry to identify to show the rate of injured workers in each sub industry. The reasons i used a treemap are because there are many sub industries, it is easy to understand and identify immediately which size of the rectangle is the biggest and therefore having the highest rate and it provides an overview and summary against the other sub industries.
For the distribution of both the body parts injured and gender, i used bar chart as it is simple and easy to understand.
Visualization
Tools Utilized
- Excel 2013 for data preparation
- Tableau for visualization