Hiryuu Methodology

From Analytics Practicum
Revision as of 01:38, 28 December 2016 by Wtchua.2013 (talk | contribs) (Created page with "<!-- LOGO --> <!--MAIN HEADER --> {|style="background-color:#F5A9A9;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" | | style="font-family:Roboto; fon...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Home

About Us

Project Overview

Findings

Project Management

Documentations

Background Data Methodology

Introduction

The main aim of this practicum is to give our sponsor an insight into the delivery patterns in the different countries managed, focusing on Australia and Japan as these 2 countries have posed the most problems. To do so we will first analyse the trends from 3 months worth of data use 4 main techniques, Exploratory, Clustering, Time Series, and Geospatial.

With these analysis done, we hope to give our sponsors a clearer picture as to the reasons of failed deliveries so that it will aid the company in avoiding similar pitfalls in the future.

Tools Used

We’ll be manually extracting the data we need from the raw data sheets provided. There is also the need to combine the data from both company’s applications (App1 and App2). After which we will proceed with the analysis using JMPro and Power BI to perform exploratory analysis, clustering, and time series. We agree that JMPro is a more powerful too but the reason for using Power BI is because our sponsors are familiar with the software so we want to get familiarise with its display as well so that we can have a better idea how to construct our final web app. QGIS will be our main application for the Geospatial analysis.

Eventually we will display our findings on a single display (most probably Javascript) as per requested by the sponsor.

Analysis

1. Exploratory Analysis

An exploratory analysis will be conducted first to analyse the shipping behaviour of different customers in different countries.

  • Determine the average turnaround time from the first to the last stage.
  • Determine the average turnaround time for the statuses closure
  • Identify patterns between destinations and shipment issues.
  • Identify types of shipments with frequent shipment issues.

2. Geospatial Analysis

Shipping patterns and behaviour can be identified using geospatial analysis. The analysis will be narrowed down to the country, state/city and postal code. We will seek to answer the following questions:

  • Where different customers lie on the map and hopefully identify the more popular areas and their reasons
  • How different locations and proximity to the warehouses can affect shipment time and procedures.
  • Identify and flag out destinations with high probability of shipment issues.
  • Track different shipping routes from the start to the final to determine the average time required.
  • Track different shipment status gap to determine partner’s performance in data provision/updates

3. Clustering

We plan to cluster our data based on type of customer, shipping history, activity level and any other potential classifications which we may identify in the future. Each customer/vendor will then be assigned a cluster number.

4. Time Series Analysis

As the data could be organised by the date, a time series analysis could be conducted. The time series analysis would be broken down into time periods of weeks and month to analyse and identify patterns and trends in the shipment and customer data.

We will also attempt to determine if there are seasonality trends in shipment patterns across different countries for different shipments.