ANLY482 AY2016-17 T2 Group12 : Project Overview / Methodology

From Analytics Practicum
Revision as of 16:49, 12 January 2017 by Tingzhi.lim.2013 (talk | contribs)
Jump to navigation Jump to search

Home

About Us

Project Overview

Findings

Project Management

Documentation

Description Methodology


Data

The dataset provided by KST Bikers is a Feedback System which consists of feedback lodged by:

  • SMS
  • Email
  • Feedback Form

Tools Used

  • Microsoft Excel 2016
  • JMP Pro 13
  • Tableau 10.0

Methodology

  • Data Collection

The dataset is from KST Bikers feedback system which is collected from a variety of sources such as email, SMS and feedback form. We will also be using external data such as weather and public holiday data. Having such data allows us to examine external factors which could impact the generation of feedbacks.

  • Data Exploration

Spot missing values, identify outliers and select necessary variables such as categories and subcategories for analysis. We will also figure out the number of feedbacks in each subcategories. This will allow us to figure out which are the top few most important problems that Singaporeans faced.

  • Data Cleaning

Outliers and missing values cause data inaccuracy. Hence we will remove missing values and outliers. However, if there are too many outliers, they will be treated as a separate group for analysis.

  • Data Normalization and Transformation

As the variables in the dataset have different forms of measurements, normalization is conducted to provide equal weightage to each variable. Z-score normalization will be used. If the distribution of the variables is found to be skewed, natural log will be conducted to each involved variable to make the model more normally distributed.

  • Dashboards

Two visual dashboards will be created for KST Bikers to visualize the analysis. The dashboards will provide a summary of the trends in the feedback data and the different external factors which generate these feedbacks.