ANLY482 AY2017-18 T1 Group2 Project EZLin Scope
Phase 0: Understanding Data & Supply Chain Context
Upon obtaining the raw data from the sponsor, we will be working, with the assistance of the client, to understand the data based on the client’s storage of information in its system. This includes mapping the entire supply chain flow and process of the client as well as seeking to understand what are the supply chain terminologies that the company uses. Upon doing so, we will proceed to add or edit variables/categories to add better context in allowing us to gain a better understanding of the data.
Phase 1: Data Cleaning & Exploration
Given the raw form of the data, we will need to clean the data before running any form of exploration or analysis. This includes but are not limited to the following steps,
- Checking for anomalies and outliers
- Deciding on action for missing data
- De-duplicating of any fields
- Standardising and normalising data
- Deciding and documenting on assumptions made.
Only after cleaning the dataset will we be able to do Exploratory Data Analysis (EDA) data set, which includes but are not limited to,
- Plotting the raw data
- Examining the distribution of the variables
- Studying the relationships between exploratory variables
- Conducting cross sectional and longitudinal analysis with the different factory locations
Phase 2: Automating Data Retrieval Process and Creating of Application
Once we have clearly understood the relationships and examined the variables in the dataset, we will proceed to develop a model for our client to firstly automate its data retrieval process. Given the current manual process, it’s important to be able to automate this retrieval and cleaning of data to allow the company to conduct its initial analysis. After which, we will seek to create an application that will enable the client to have a visual representation of their supply chain flow and what are the cost incurred for each point of their supply chain.