ANLY482 AY2017-18T2 Group13 Analysis & Findings
Revision as of 13:56, 26 February 2018 by Taffy.cheow.2014 (talk | contribs)
DATA CLEANING & MAPPING
Manipulation and Cleaning of Training Records
In order to rectify these missing data points, we replaced the values whenever possible as the team hopes to analyze as many data points as possible. When replacement of values are not possible, records are excluded from analysis.
Missing data (Field) | Action |
---|---|
Start and End date | 1. Attempted to find corresponding dates from exact course titles, unsuccessful
2. Attempt to take 'Create date' however inaccurate as record could have been created after actual start and end date 3. Excluded (Final action) |
Category | Replaced missing data with corresponding records with exact course title |
Manipulation and Cleaning of Staff List
1. As the staff list provided were in separate sheets, data fields were standardized and merged into 1 data set.
Previous field name | New field name | Sheet affected |
---|---|---|
Job Code | Job Title | Dec'13 & Dec '14 |
Job Group | Staff Group | Dec'14 |
Designation | Job Title | Dec'15, Dec'16, Dec'17 |
Staff Grade | Staff Group | Dec'15, Dec'16, Dec'17 |
Section | Department | Dec'15, Dec'16, Dec'17 |
Department | Location | Dec '14, Dec'15, Dec'16, Dec'17 |
2. Standardization of variables within columns were also made to ensure consistency throughout the years.
Issue | Action taken | Sheet affected |
---|---|---|
Location column missing from sheets and location information was combined in the naming assigned to the departments (e.g. EMOS7000) | Location Group column created to extract the front letters from the department in Excel with a calculated field | Dec'13 & Dec '14 |
Department variables inconsistent with other years as it was combined with location. Other sheets used a single descriptive field to specify departments (e.g. EMOS Shared Services) [SNAPSHOT] | Used corresponding descriptive department name based on matching of employee number to other sheets | Dec’13 & Dec'14 |
Variables as Job Title was inconsistent with other years and were in the form of acronyms | Cross referenced employee number to other years and Training Records for corresponding job title | Dec'13 & Dec ‘14 |
Variables in Cost Centre , Citizenship and Serial Number fields were inconsistent over the years (e.g. Missing entire field) and had missing records that could not be matched with other years | As these fields were not within the scope and were advised by the Sponsor to exclude, these fields were excluded from the data analysis entirely | Dec'13, Dec'14, Dec'17 |
INSIGHTS
To be updated
ANALYSIS
To be updated