Difference between revisions of "ANLY482 AY2017-18T2 Group27 : Project Overview / Methodology"
Jump to navigation
Jump to search
Soneak.2014 (talk | contribs) |
|||
(2 intermediate revisions by 2 users not shown) | |||
Line 28: | Line 28: | ||
<div style="padding-left:0px; padding-right:0px; text-align: justify; font-size:13px"> | <div style="padding-left:0px; padding-right:0px; text-align: justify; font-size:13px"> | ||
==== 7.1 Tools Used ==== | ==== 7.1 Tools Used ==== | ||
− | In this project, | + | In this project, 3 main tools will be used - Power BI, Excel and JMP. |
− | Power BI is the choice of tool by | + | Power BI is the choice of tool by Company X and data visualisation will be done on this medium. |
− | + | Excel is used by Company X to store their data, taken from their system. It is also used for part of the data cleaning process, namely: categorizing density of each shipment into their respective freight density ratios and appending new data sets given to us. | |
− | + | JMP is used for part of the data cleaning process too, namely: removing rows with bad data and duplicates, and recoding of data fields. | |
− | + | ==== 7.2 Data Cleaning and Preparation ==== | |
+ | |||
+ | Since data cleaning was not the focus of Company X, we did basic cleaning. This includes: | ||
+ | |||
+ | * Removing Outliers | ||
+ | * Removed Duplicates | ||
+ | * Standardising Format of Data | ||
+ | *Transforming Relevant Variables | ||
− | |||
</div> | </div> |
Latest revision as of 16:15, 16 April 2018
Description | Data | Methodology |
7.0 Methodology
7.1 Tools Used
In this project, 3 main tools will be used - Power BI, Excel and JMP.
Power BI is the choice of tool by Company X and data visualisation will be done on this medium.
Excel is used by Company X to store their data, taken from their system. It is also used for part of the data cleaning process, namely: categorizing density of each shipment into their respective freight density ratios and appending new data sets given to us.
JMP is used for part of the data cleaning process too, namely: removing rows with bad data and duplicates, and recoding of data fields.
7.2 Data Cleaning and Preparation
Since data cleaning was not the focus of Company X, we did basic cleaning. This includes:
- Removing Outliers
- Removed Duplicates
- Standardising Format of Data
- Transforming Relevant Variables