Difference between revisions of "ANLY482 AY2017-18T2 Group13 Analysis & Findings"

From Analytics Practicum
Jump to navigation Jump to search
Line 71: Line 71:
 
|-
 
|-
 
| Department || Location || Dec '14, Dec'15, Dec'16, Dec'17
 
| Department || Location || Dec '14, Dec'15, Dec'16, Dec'17
 +
|}
 +
 +
2. Standardization of variables within columns were also made to ensure consistency throughout the years.
 +
{| class="wikitable"
 +
|-
 +
! Issue !! Action taken !! Sheet affected
 +
|-
 +
| Location column missing from sheets and location information was combined in the naming assigned to the departments (e.g. EMOS7000) || Location Group column created to extract the front letters from the department in Excel with a calculated field || Dec'13 & Dec '14
 +
|-
 +
| Department variables inconsistent with other years as it was combined with location. Other sheets used a single descriptive field to specify departments (e.g. EMOS Shared Services) [SNAPSHOT] || Used corresponding descriptive department name based on matching of employee number to other sheets || Dec’13 & Dec'14
 +
|-
 +
| Variables as Job Title was inconsistent with other years and were in the  form of acronyms|| Cross referenced employee number to other years and Training Records for corresponding job title || Dec'13 & Dec ‘14
 +
|-
 +
| Variables in Cost Centre , Citizenship and Serial Number fields were inconsistent over the years (e.g. Missing entire field)  and  had missing records that could not be matched with other years|| As these fields were not within the scope and were advised by the Sponsor to exclude, these fields were excluded from the data analysis entirely || Dec'13, Dec'14, Dec'17
 
|}
 
|}
  

Revision as of 13:56, 26 February 2018

OPlytics Logo.png

Home-icon.png Home

Overview icon.png Project Overview

Idea icon.png Analysis & Findings

Project mgt-icon.png Project Management

Documentation icon.png Documentation

Button 4 rewind.png Main Page


DATA CLEANING & MAPPING


Manipulation and Cleaning of Training Records

Using JMP Pro, missing data pattern analysis was conducted on training records to identify missing data points.

In order to rectify these missing data points, we replaced the values whenever possible as the team hopes to analyze as many data points as possible. When replacement of values are not possible, records are excluded from analysis.

Missing data (Field) Action
Start and End date 1. Attempted to find corresponding dates from exact course titles, unsuccessful

2. Attempt to take 'Create date' however inaccurate as record could have been created after actual start and end date

3. Excluded (Final action)

Category Replaced missing data with corresponding records with exact course title

Manipulation and Cleaning of Staff List

1. As the staff list provided were in separate sheets, data fields were standardized and merged into 1 data set.

Previous field name New field name Sheet affected
Job Code Job Title Dec'13 & Dec '14
Job Group Staff Group Dec'14
Designation Job Title Dec'15, Dec'16, Dec'17
Staff Grade Staff Group Dec'15, Dec'16, Dec'17
Section Department Dec'15, Dec'16, Dec'17
Department Location Dec '14, Dec'15, Dec'16, Dec'17

2. Standardization of variables within columns were also made to ensure consistency throughout the years.

Issue Action taken Sheet affected
Location column missing from sheets and location information was combined in the naming assigned to the departments (e.g. EMOS7000) Location Group column created to extract the front letters from the department in Excel with a calculated field Dec'13 & Dec '14
Department variables inconsistent with other years as it was combined with location. Other sheets used a single descriptive field to specify departments (e.g. EMOS Shared Services) [SNAPSHOT] Used corresponding descriptive department name based on matching of employee number to other sheets Dec’13 & Dec'14
Variables as Job Title was inconsistent with other years and were in the form of acronyms Cross referenced employee number to other years and Training Records for corresponding job title Dec'13 & Dec ‘14
Variables in Cost Centre , Citizenship and Serial Number fields were inconsistent over the years (e.g. Missing entire field) and had missing records that could not be matched with other years As these fields were not within the scope and were advised by the Sponsor to exclude, these fields were excluded from the data analysis entirely Dec'13, Dec'14, Dec'17


INSIGHTS


To be updated


ANALYSIS


To be updated