Difference between revisions of "Hiryuu Analysis"

From Analytics Practicum
Jump to navigation Jump to search
Line 57: Line 57:
 
==<div style="background: #A4A4A4; line-height: 0.3em; font-family:Roboto;  border-left: #848484 solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#ffffff"><strong>Geospatial</strong></font></div></div>==
 
==<div style="background: #A4A4A4; line-height: 0.3em; font-family:Roboto;  border-left: #848484 solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#ffffff"><strong>Geospatial</strong></font></div></div>==
  
<h3>Simple Plot</h3>
+
<h3>Choropleth Plot</h3>
 
We performed a simple choropleth map based on the total number of inbound shipments into each city of each country. Doing so allows us to identify hot spots and detect relationships between the number of inbounds and the geographic location. Comparison can also be done easily at a single glance. We have plotted total Inbound for 2 countries and plan to carry out the rest for the remaining countries. We will also plot the total Outbound number, # Failures and % Failures with respect to each city.
 
We performed a simple choropleth map based on the total number of inbound shipments into each city of each country. Doing so allows us to identify hot spots and detect relationships between the number of inbounds and the geographic location. Comparison can also be done easily at a single glance. We have plotted total Inbound for 2 countries and plan to carry out the rest for the remaining countries. We will also plot the total Outbound number, # Failures and % Failures with respect to each city.
 
[[File:Wiki save country1.JPG|500px|center]]
 
[[File:Wiki save country1.JPG|500px|center]]
  
 
<h3>Neighbouring Polygons Patterns</h3>
 
<h3>Neighbouring Polygons Patterns</h3>
We observed that neighbouring cities around a city with a high number of inbounds tended to have higher inbounds than others as well. So hence we suspect that neigbouring inbound might be affected. Spatial randomness analysis will be relevant here, specifically Moran I and Geary’s C. We suspect that with time when we input failure points into this map and perform the same analysis, we might be able to find some pattern. Such that areas that tend to have high failures have neighbouring cities that also have failures. And possible explanations could be the transport mode, or transport companies that are assigned to handle these areas. These kind of information is rather useful to our sponsor.
+
We observed that when a city had a high Percentage Pass rate of shipments, the neighbouring cities around it tended to have a higher Percentage Passing rate as well. To investisgate this further and determine if there indeed is a spatial correlation between patterns, we utilised Moran I. This spatial measure was more prominent in Australia and the results showed that there indeed was a spatial correlation between cities and their Percentage Passes as the Mora I statistic is above 0.
 +
 
 +
[[File:Au map info.JPG|800px|center]]
 +
<center>Fig. Australia Percentage Pass, spatial correlation</center>
 +
 
 +
 
 +
[[File:Moran au.JPG|300px|center]]
 +
<center>Fig. Moran I measure for Australia</center>
 +
 
 +
Possible explanations for the spatial correlation could be the transport mode, or the couriers assigned to handle these areas. <br><br>
 +
 
 +
We decided to investigate for Japan's side as well. However, the limited time span of the data (3 months) ended up with all of Japan having a 100% passing rate. So the Moran I statistic returned a null value.
 +
 
 +
[[File:Japan map info.JPG|800px|center]]
 +
<center>Fig. Japan Percentage Pass, spatial correlation</center>
 +
 
 +
 
 +
[[File:Moran jpn.JPG|300px|center]]
 +
<center>Fig. Moran I measure for Japan</center>

Revision as of 11:14, 22 April 2017

Current Project

Logo Hiryuu.png


Home

About Us

Project Overview

Findings

Project Management

Documentations

Data Preparation Analysis

Exploratory

  • 1. TAT across different countries

    Distribution.JPG

Although there are some datasets where the 90 percentile of the TAT is less than 3 days, there are some data sets where there were a huge proportion of failures.
One example is the dataset below which has a high value of 13 days for its TAT at the 90th percentile. This is an alarming number and should be flagged out for further in-depth analyis on the factors for failure.

  • 2. Ending day of shipments

We have observed similar trends across various datasets in the failure rates for shipments ending on Monday and Tuesday.
An example of the distribution is shown below: Day of Week-Max(Stage Completed Date)-.JPG
For OB data, the reason for the higher failure rate might be the inavailability of customers over the weekends.
However, for IB data, there is no conclusive reason as of now, and we will be clarifying with our sponsors.

Time-Series

Geospatial

Choropleth Plot

We performed a simple choropleth map based on the total number of inbound shipments into each city of each country. Doing so allows us to identify hot spots and detect relationships between the number of inbounds and the geographic location. Comparison can also be done easily at a single glance. We have plotted total Inbound for 2 countries and plan to carry out the rest for the remaining countries. We will also plot the total Outbound number, # Failures and % Failures with respect to each city.

Wiki save country1.JPG

Neighbouring Polygons Patterns

We observed that when a city had a high Percentage Pass rate of shipments, the neighbouring cities around it tended to have a higher Percentage Passing rate as well. To investisgate this further and determine if there indeed is a spatial correlation between patterns, we utilised Moran I. This spatial measure was more prominent in Australia and the results showed that there indeed was a spatial correlation between cities and their Percentage Passes as the Mora I statistic is above 0.

Au map info.JPG
Fig. Australia Percentage Pass, spatial correlation


Moran au.JPG
Fig. Moran I measure for Australia

Possible explanations for the spatial correlation could be the transport mode, or the couriers assigned to handle these areas.

We decided to investigate for Japan's side as well. However, the limited time span of the data (3 months) ended up with all of Japan having a 100% passing rate. So the Moran I statistic returned a null value.

Japan map info.JPG
Fig. Japan Percentage Pass, spatial correlation


Moran jpn.JPG
Fig. Moran I measure for Japan