IS428 2016-17 Term1 Assign2 Lim Hui Ting

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search

Theme of Interest

1.Background

Recently, the issue of insufficient medical leave given to injured workers has come under the spotlight again. In May this year, the court has suspended Raffles Hospital orthopaedic surgeon Wong Him Choon, 51, for six months. He had operated on a construction worker’s broken hand, but gave him just two days' medical leave and certified him fit for light duties for a month after that (Ho, 2016).

In August this year, the straits times reported that a shipyard worker has to had his right index finger amputated after it was crushed by some falling metal. However, he was only given one day of medical leave and three months of light duties (Ho, 2016).

Insufficient medical leave given to injured workers is a widespread problem in the construction industry. Employers usually influence doctors to give injured workers shorter medical leave in order to avoid reporting the accident to the Ministry of Manpower(MOM). Employers can also maintain their clean safety record and avoid paying higher insurance premiums in the future. (Foo, 2016)

Since injured employees are entitled to sufficient medical leave on proper clinical grounds, this assignment aims to analyse the distribution of the duration of the medical leave given to injured employees not just in the construction industry but the other industries in Singapore.

2. Questions for Investigation:

1)When do the accidents usually occur?
2)What are the victims’ characteristics?
3)What is the distribution of the number of MC days given to injured employees?

Tools Utilised

1.JMP Pro
2.Tableau

Data Preparation

1. Irrelevant Columns Removed

Columns Removed Reasons
  • Informant’s Company Postal Code(District)
  • Occupier’s Name
  • Org SSIC
  • 3-digit SSIC
  • 2-digit SSIC
Analysis is not part of the objective of this assignment
  • Accident Date
  • Accident Weekday No
  • Accident Month
  • Victim Employment Start Mth
Not required as the same information can be taken from other columns
  • Accident Agency Level 1
  • Accident Agency Level 2
Information is not useful as they are numerical representations of the descriptions set by MoM


2. Reformat All Text Columns

Reformat1.png
Reformat2.png
Reformat3.png

From the raw data, we can see that some columns are in uppercase whereas some columns are in lowercase. Furthermore, as for the Occupation column, some rows are in uppercase,lowercase or titlecase. Thus, it is better to standardise all the text columns to titlecase in order for a neater presentation of findings. This is done by using the Titlecase function in the column’s formula in JMP.

3. Removal of Underscores in Text Columns

Underscores1.png

From the nature of injury column and body parts injured column, underscores are being used. In order for a neater presentation of findings, these underscores are removed using the substitute function in the column’s formula.

4. Recode Occupation Column

Occupation1.png

For the Occupation column, some occupations relate to the same thing but are being recognised as different occupation due to spelling errors, spacing and singular/plural forms. Thus, these are being corrected using the recode function in JMP. For example as seen in the screenshot above, construction worker is being spelt differently such as “construction wokrer” or “constrction worker”. “Cook” and “Cooks” refer to the same occupation just that one is in singular form and another is in plural form.

5. Removal of Missing Records

Occupation1.png

Using the Missing Data Pattern function in JMP, there are a total of 47 rows with missing values in certain columns. As the numbers are not significant, I have deleted these rows instead of imputing a value for them.

6. Reformat Time Column to a Format Readable by Tableau

When the clean dataset is being imported in Tableau, Tableau recognise the Accident Time column as a character column instead of time format. Attempts to change the format directly from Tableau did not work. Thus, a calculated field is being created to change the format of this column and to change the timing format from 12-hour to 24-hour timing. The formula being used is as follows:

IF RIGHT([Accident Time],2)="PM" AND NOT LEFT([Accident Time],2)="12" THEN MAKETIME(INT(LEFT([Accident Time],FIND([Accident Time],":",1)-1))+INT(12),INT(MID([Accident Time],FIND([Accident Time],":",1)+1,2)),INT(00)) ELSEIF RIGHT([Accident Time],2)="AM" AND LEFT([Accident Time],2)="12" THEN MAKETIME(INT(0),INT(MID([Accident Time],FIND([Accident Time],":",1)+1,2)),INT(00)) ELSE MAKETIME(INT(LEFT([Accident Time],FIND([Accident Time],":",1)-1)),INT(MID([Accident Time],FIND([Accident Time],":",1)+1,2)),INT(00)) END

When do the Accidents Usually Occur?

Overview of Workplace Injuries in 2014

Overview1.png
Overview2.png

In 2014, January has the least number of injured workers. The number of injured workers between March to September is relatively the same and dips between November and December. The number of injured workers are relatively the same between Monday to Saturday. Sunday has the least number of injured workers. Monday, Wednesday and Sunday have three peaks on the graph whereas Tuesday, Thursday Friday and Saturday has two peaks on the graph. Employees tend to be injured between 10-11am and 2-4pm.

What are the Victims' Characteristics?

1 Employees' Age

Age1.png

From the histogram, most of the victims are between 25-30 years old.

2 Employees' Gender

Age1.png

From the bar chart, there are more male employees injured than female employees in 2014.

3 Employment Tenure

Age1.png

From the histogram, where the number of bins are set at two months, we can see that most of the injured employees are new comers who have worked at the company for less than 6 months. Thus, this means that less experience employees have a higher tendency to be injured at the workplace.

4 Percentage of Manual Work

Manual1.png

The bar chart above that shows that jobs that requires more manual work has more injured employees.

5 Type of Industry and Occupation

Industry1.png

From the bar graph, it seems like others industry has the most no. of employees injured. However, this graph can be misleading as the others industry are made up of many industries which could be a possible reason why it has the highest no. of employees injured.

Industry2.png
Industry3.png

By further subdividing the major industry into sub industry, we are able to see that construction industry has the most number of accidents followed by the others industry and metalworking industry.

As there are many types of occupations available, those occupations that have less than 10 injured employees are being filtered out from the Word Cloud. As such, we can see that Construction Worker is the most hazardous occupation as it has the highest number of injured workers followed by Cleaner and Driver.

6 Type of Company

Company1.png

As only the number of employees of the informants are given, we need to find out if the informant’s company is usually the same as the employer’s company. By using if-else where 1 indicates that informant company is the same as the employer company and 0 indicates that they are not the same. From the barchart, most of the Informant company is the same as the employer company. Thus, the no. of employees in the informant company will be used as an estimation of the size of the employer company.
Using the number of informant’s employee as the size of the treemap and the number of injured workers as the colour intensity, a treemap is created as shown below.

Company2.png

From the treemap, it seems that for construction industry, the companies that are smaller in size tend to have more injured employees whereas the relationship between company size and the number of injured employees are not that clear for manufacturing and others industry. For the others industry, company EM0205 has 10 injured workers in 2014. For manufacturing industry, EM1412 has 4 injured workers in 2014. For the construction industry, EM0187 has 9 injured workers in 2014.

What is the Distribution of the Number of MC Days Given to Injured Employees?

Final Deliverables

References