Difference between revisions of "ANLY482 AY2017-18T2 Group19 Data"

From Analytics Practicum
Jump to navigation Jump to search
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
<!--Logo-->
 
<!--Logo-->
[[Image:G19_logo.png|center|748x116px]]<br>
+
[[Image:G19_Logo.png|center|800x150px|link=ANLY482 AY2017-18T2 Group19]]<br>
  
 
<!--Navbar Start-->
 
<!--Navbar Start-->
Line 41: Line 41:
  
 
| style="vertical-align:top;width:25%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:14px; border: 0px solid; font-family:helvetica">
 
| style="vertical-align:top;width:25%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:14px; border: 0px solid; font-family:helvetica">
[[ANLY482 AY2017-18T2 Group19 Background| <font face='Century Gothic' color="#FFFFFF"><b>BACKGROUND</b></font>]]
+
[[ANLY482 AY2017-18T2 Group19 Project Overview| <font face='Century Gothic' color="#FFFFFF"><b>BACKGROUND</b></font>]]
  
 
| style="vertical-align:top;width:25%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:14px; border: 0px solid; font-family:helvetica">
 
| style="vertical-align:top;width:25%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:14px; border: 0px solid; font-family:helvetica">
Line 53: Line 53:
  
  
<div style="background: #FFFFFF; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.03em;font-size:16px;id:UT1"><font face='Century Gothic' color=#000000 ><u>DATA DESCRIPTION</u></font></div>
+
We have obtained the following datasets from the client
 +
# Year 2016 - 2-hour transaction data 
 +
# Year 2016 - 3-day transaction data
 +
# Year 2017 - 3-hour transaction data 
 +
# Year 2017 - 3-day transaction data
 +
# Course reserve books master data
  
The data required for us comes in 3 parts.
+
Each row represents one borrowing instance. It was given to us in the following format:
#The demand side where the patients live which is derived from the percentage distribution of elderly in the respective subzones. This data will be created at random to simulate the daily nodes for each patient.
 
#The supply side where the foundation/organisation dispatches nurses to their scheduled patients
 
#The details of services with details such as the mean and standard deviation to provide a more realistic constraint for the algorithm to take into account when routing the nurses to their patients.
 
  
The dataset for the demand side will need to be generated by ourselves which includes:
+
<center><i>Loan Transaction Data</i></center>
*Address: Address of patient
 
*Latitude: Latitude of patient’s address
 
*Longitude: Longitude of patient’s address
 
*Type:
 
*Appointment_Times: Prefered appointment time
 
  
The dataset for the supply side will need to be generated by ourselves which includes:
+
[[Image:G19_MetaData1.png|500px|center]] &nbsp;
*Id: Identification for nurse
 
*Address: Address of origin
 
*Latitude: Latitude of address
 
*Longitude: Longitude of address
 
*Type:  
 
  
The dataset for the details of services includes:
+
The course reserve books master data details the collection of course reserves available for loan. This dataset would include the following fields of information:  
*Discipline:
 
*Type (i.e. revisits, etc):
 
*Request count:
 
*Average: average time required for the treatment type
 
*SD: standard deviation of time required for the treatment type
 
  
 +
<center><i>Course Reserve Books Master Data</i></center>
  
<div style="background: #FFFFFF; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.03em;font-size:16px;id:UT1"><font face='Century Gothic' color=#000000 ><u>DATA COLLECTION</u></font></div>
+
[[Image:G19_MetaData2.png|500px|center]] &nbsp;
 
 
We need to conduct research to better understand the demand side and supply side to generate sufficient amount of data to work with.  
 
 
 
 
 
<div style="background: #FFFFFF; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.03em;font-size:16px;id:UT1"><font face='Century Gothic' color=#000000 ><u>DATA CLEANING AND TRANSFORMATION</u></font></div>
 
 
 
The data will need to be transformed into the required formats in order for us to perform the necessary processing later.
 
 
 
 
 
<div style="background: #FFFFFF; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.03em;font-size:16px;id:UT1"><font face='Century Gothic' color=#000000 ><u>EXPLORATION OF DATA</u></font></div>
 
 
 
At this phase, the data will be explored in order for us to choose a suitable model to be built.
 

Latest revision as of 19:36, 15 April 2018

G19 Logo.png


G19 Home.png   HOME

 

G19 Overview Icon.png   PROJECT OVERVIEW

 

G19 Findings Icon.png   PROJECT FINDINGS

 

G19 Management Icon.png   PROJECT MANAGEMENT

 

G19 Documentation Icon.png   DOCUMENTATION

 

G19 To Main Page icon.png   BACK TO MAIN PAGE


 


We have obtained the following datasets from the client

  1. Year 2016 - 2-hour transaction data
  2. Year 2016 - 3-day transaction data
  3. Year 2017 - 3-hour transaction data
  4. Year 2017 - 3-day transaction data
  5. Course reserve books master data

Each row represents one borrowing instance. It was given to us in the following format:

Loan Transaction Data
G19 MetaData1.png

 

The course reserve books master data details the collection of course reserves available for loan. This dataset would include the following fields of information:

Course Reserve Books Master Data
G19 MetaData2.png