ANLY482 AY2017-18T2 Group11 Project Overview Old
HOME | PROJECT OVERVIEW | ANALYSIS & INSIGHTS | PROJECT MANAGEMENT | DOCUMENTATION | ANLY482 MAIN |
Updated | Previous |
---|
Motivation |
Technology has opened the floodgates for users globally to amass and transmit data, providing millions with unprecedented opportunities and benefits. This has given rise to the field of Data Analytics, which has become an important tool for companies to improve the efficiency in their operations. However, some companies are still lacking data analysis capabilities and often find it time-consuming to visualise, modify and interrogate data – especially when data is arising from more than one source such as machinery sensor, mobile devices, wearables, weblogs etc.
While big data presents opportunities for many companies to leverage on, it requires a certain level of technical skill in order to successfully capitalise on this opportunity. The lack of technical capabilities within the company to derive operational solutions from data is a problem that resonates strongly with the sponsor company, much like many others. With the company slowly gaining foothold in the industry, it is imperative for the company to enhance itself by analysing its data and obtain solutions to counter its high operational costs.
Data Provided |
The data provided is obtained from Company ABC’s database server and this database is updated whenever parcels are being received and delivered. The data given will be 2 years’ worth of delivery data in the Central and Western parts of Singapore and this adds up to approximately 750,000 rows of data points. It will contain details such as the delivery address, quality, date, etc…
Project Objective & Goal |
Based on the problem statement our sponsor have given to us, we have derived 3 main objectives for this project. The 3 objectives are:
- Identify other possible ways to minimise operational costs for the company
- Identify the optimal number of Drivers that Company ABC would require
- Minimise failed delivery by identifying erroneous forms before goods are being dispatched
The objectives and problems listed can be summarised as following :
Methodology |
To provide operational recommendations from the given dataset, we will thoroughly examine the dataset via the following four-step approach:
1. Data Exploratory
As the dataset is provided in Excel format, little data preparation is required by the team. Following which, the team would use methods such as summary statistics, to determine if there are any inconsistencies, missing and invalid values in the dataset.
2. Data Cleaning
As errors such as outliers and invalid values could lead to inaccurate results, the data must be cleaned to ensure that it is suitable for further analysis. Based on the dataset, the two most probable data errors are inconsistency data and missing or invalid values.
3. Data Analysis
After cleaning up the relevant data, an in-depth analysis will be performed on the data to gain meaningful insights. Based on preliminary discussion, we will be looking into these 4 analytical methods in analysing the data:
4. Data Visualisations
Lastly, we will also be looking at creating a dashboard with the following visualisations which will ultimately help the team present its recommendation. Some of the visualisations that will be derived are:
Technology Used |