ANLY482 AY2017-18T2 Group14 Project Methodology

From Analytics Practicum
Jump to navigation Jump to search

HOME

 

ABOUT US

 

PROJECT OVERVIEW

 

PROJECT MANAGEMENT

 

DOCUMENTATION

 

ANLY482 Main Page

 

 
Database Management Currently, the data are stored separately in two different places, one is stored in the mainframe whereas one is stored in newly added system which is still in negotiating process. The separation of the data has no doubts increase the difficulty of database management. The main data we going to use in our project, on-demand delivery data (aka., ODD data) is retrieved from new system. The major tools used for these two data are Microsoft Access and Microsoft Excel. In our project, we will assist the team to build an integrated database where they can store two set of data together in unified format. To achieve this, we will use the open-source relational database management system- MySQL for database management. Noticed that there are differences exist in two set of data, we will firstly extract the raw data from Microsoft Access then perform any needed data cleaning. Next, we will create different tables for different data, and set up proper relations among tables. By doing this, we can provide our client flexibilities to maintain (i.e. CRUD) their operating data. Lastly, imported the cleaned data into the already set up database system.

Exploratory Analysis and Model Building After setting up the database, we will begin analysing the data by performing exploratory data analysis using JMP. Then, we will use some analytical models, such as geospatial analysis, clustering, regression, in order to help us reveal any hidden patterns not known by the client.

Data Visualization and Reporting Microsoft Excel is currently used for routine data analysis and reporting but it only works well when the data is small. However, with the rapidly-growing data set used by the client, Microsoft Excel becomes a constraint that limits the data processing capacity and their work efficiency. Hence, in this project, Power BI, an interactive data visualization BI tools will be used to create an automatic dashboard for data analysis and reporting.