Difference between revisions of "ANLY482 AY2017-18T2 Group27 : Project Overview / Methodology"

From Analytics Practicum
Jump to navigation Jump to search
(edit)
 
Line 36: Line 36:
 
JMP is used for part of the data cleaning process too, namely: removing rows with bad data and duplicates, and recoding of data fields.
 
JMP is used for part of the data cleaning process too, namely: removing rows with bad data and duplicates, and recoding of data fields.
  
==== 7.2 Project Methodology ====
+
==== 7.2 Data Cleaning and Preparation ====
 
  
Our brief plan of action includes doing descriptive analytics and predictive analytics. Descriptive analytics will be done via doing data visualisation. We will be combining various descriptive analysis of operational factors into a dashboard which will allow Company X to monitor, track and diagnose. Additionally, a regression model to allow Company X to better forecast shipment behaviour. We will also be doing secondary research from various journal articles to potentially supplement our project.  
+
Since data cleaning was not the focus of Company X, we did basic cleaning. This includes:
 +
 
 +
* Removing Outliers
 +
* Removed Duplicates
 +
* Standardising Format of Data
 +
*Transforming Relevant Variables
  
As of now, we will be focusing on understanding and cleaning the data.
 
 
</div>
 
</div>

Latest revision as of 16:15, 16 April 2018

Homepage

Our Team

Project Overview

Project Findings

Project Management

Documentation

ANLY482 AY2017-18 T2 Projects

Description Data Methodology

7.0 Methodology

7.1 Tools Used

In this project, 3 main tools will be used - Power BI, Excel and JMP.

Power BI is the choice of tool by Company X and data visualisation will be done on this medium.

Excel is used by Company X to store their data, taken from their system. It is also used for part of the data cleaning process, namely: categorizing density of each shipment into their respective freight density ratios and appending new data sets given to us.

JMP is used for part of the data cleaning process too, namely: removing rows with bad data and duplicates, and recoding of data fields.

7.2 Data Cleaning and Preparation

Since data cleaning was not the focus of Company X, we did basic cleaning. This includes:

  • Removing Outliers
  • Removed Duplicates
  • Standardising Format of Data
  • Transforming Relevant Variables