Difference between revisions of "ANLY482 AY2017-18T2 Group08 : Project Overview / Methodology"

From Analytics Practicum
Jump to navigation Jump to search
Line 23: Line 23:
 
| style="padding: 0.25em; font-size: 90%; border-top: 1px solid #cccccc; border-left: 1px solid #cccccc; border-right: 1px solid #cccccc; border-bottom: 1px solid #cccccc; text-align:center; background-color: #404040; width:33.3%" | [[ANLY482 AY2017-18T2 Group08 : Project Overview / Methodology | <font color="#ffffff ">Methodology</font>]]
 
| style="padding: 0.25em; font-size: 90%; border-top: 1px solid #cccccc; border-left: 1px solid #cccccc; border-right: 1px solid #cccccc; border-bottom: 1px solid #cccccc; text-align:center; background-color: #404040; width:33.3%" | [[ANLY482 AY2017-18T2 Group08 : Project Overview / Methodology | <font color="#ffffff ">Methodology</font>]]
 
|}
 
|}
==<div style="background: #404040; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size: 16px"><font color=#ffffff >Data</font></div>==
 
  
'''<big><font color="#fcb706">Data Sample</font></big>'''<br>
+
==<div style="background: #404040; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size: 16px"><font color=#ffffff >Methodology</font></div>==
oBike provided us with a sample data in the form of a csv file titled ''‘2018 01 02 oBike LTA ticket cases.csv’''. This file contains information on illegal parking. Put simply, when authorities make a complaint to oBike, the relevant details are recorded in a spreadsheet document. As of now, the dataset includes illegal parking cases from mid-November 2017 to early January 2018. Our team is currently in the midst of requesting for the entire data set in order to perform more in-depth exploratory data analysis. Refer to Figure 1 below for a screenshot of the first ten records in the sample.<br>
 
 
 
 
 
'''<big><font color="#fcb706">Metadata</font></big>'''
 
 
 
Please refer to Table 2 below for a more detailed description of the data provided. Data provided by the client as of now includes illegal parking cases from mid-November 2017 to early January 2018. According to oBike, on average, it incurs approximately 300 tickets per weekday. Data is only available from 0700 hours to 1900 hours on weekdays. Additional data including historical travel routes of the bicycles will also be given at a later date.
 
  
Note: <br>
 
(i) Data Type’ has been standardised based on SAS definitions<br>
 
(ii) ‘Length’ represents the maximum length of each data type
 
  
 +
==<div style="background: #404040; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size: 16px"><font color=#ffffff >Limitations</font></div>==
 +
oBike is a relatively new company and as such, they do not yet have extensive data collection measures in place. Consequently, we are only provided LTA ticket issuance data for the period of November to December 2017 and often, data from the last quarter of the year tends to differ from the rest as it clashes with the holiday season. This hinders our analysis as it becomes difficult to analyse and forecast annual trends. To help overcome this, our team will put forth a request to obtain data from May 2017 onwards, as well as upcoming months i.e. January/February 2018.
  
{| class="wikitable" style="text-align: center; background: white; margin: 0px auto; width:80%"
+
Nevertheless, given that the bike-sharing industry is relatively new in Singapore, it is highly volatile in nature. Ergo, older data might not be very representative of existing trends. Thus, the use of the most recent data for analysis, forecasting and prescriptive measures might be more suitable for such an industry.  
! style="background: #fcb706; color: #262626 ; font-weight: bold;" | Name
 
! style="background: #fcb706; color: #262626 ; font-weight: bold; " | Data Type
 
! style="background: #fcb706; color: #262626 ; font-weight: bold; " | Length
 
! style="background: #fcb706; color: #262626 ; font-weight: bold; " | Description
 
|-
 
|'''Month'''
 
|Date
 
|3
 
|Month that illegal parking took place. Sample data set contains the months ‘Nov’, ‘Dec’, ‘Jan’
 
|-
 
|'''Date'''
 
|Date
 
|9
 
|Date that illegal parking occurred. Format is DD-MMM-YY e.g. ‘10-Nov-17’
 
|-
 
|'''Authority'''
 
|Character
 
|6
 
|The regulatory authority that picked up on the illegal parking and issued the fines.<br>Authorities include ‘NPARKS’, ‘LTA’ (Land & Transport Authority), ‘TC’ (Town Council) and ‘OTHERS’.
 
|-
 
|'''Code'''
 
|Character
 
|10
 
|Represents a case number and is a unique identifier
 
|-
 
|'''Location'''
 
|Character
 
|250
 
|Description of where the illegal parking has occurred
 
|-
 
|'''Reported Time'''
 
|Numeric
 
|4
 
|Time that the illegal parking was reported. Represented in 24-hour military time
 
|-
 
|'''Completed'''
 
|Numeric
 
|4
 
|Time that oBike has responded to the illegal parking claims. Also represented in 24-hour military time
 
|-
 
|'''Due Time'''
 
|Numeric
 
|4
 
|Time that the illegally-parked bicycles have to be cleared out before the fines become due.<br> It is usually 4 hours from the reported time. Also represented in 24-hour military time
 
|-
 
|'''Duration'''
 
|Character
 
|2
 
|The amount of time oBike took to respond to the illegal parking claims.<br> ‘<4’ implies less than 4 hours while ‘4>’ implies more than 4 hours.
 
|-
 
|'''# of Bikes'''
 
|Numeric
 
|2
 
|Number of bicycles included in each ticket issued.
 
|-
 
|'''Arrange To'''
 
|Character
 
|6
 
|Person/Organisation in-charge of responding to the illegal parking claims
 
|-
 
|'''Status'''
 
|Character
 
|9
 
|Status of the response to illegal parking. Either ‘Ignore’, ‘Arranging’ or ‘Completed’.
 
|-
 
|'''Remarks'''
 
|Character
 
|108
 
|Additional comments regarding each case
 
|-
 
|}
 
  
==<div style="background: #404040; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size: 16px"><font color=#ffffff >Tools Used</font></div>==
+
Further, upon reviewing the data, we observe that ‘# of Bikes’ column has a significant amount of missing values. oBike has explained that this occurs because LTA does not always inform them of the exact number of bicycles included in each parking ticket. Given that LTA has only just begun stepping up their enforcements efforts, it is of no surprise that there are still variances in their reporting formats. As such, we cannot analyse the exact number of bikes that are illegally parked. However, since one ticket is issued for one or more bikes in the same location at the same time, useful insights can still be obtained with regards to the geographical locations where illegal bike parking problems are the most prevalent.
In_progress
 
 
 
==<div style="background: #404040; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size: 16px"><font color=#ffffff >Methodology</font></div>==
 
<b>Discovery</b><br/>
 
In_progress
 

Revision as of 16:04, 14 January 2018

Homepage

Our Team

Project Overview

Project Findings

Project Management

Documentation

ANLY482 AY2017-18 T2 Projects

Description Data Methodology

Methodology

Limitations

oBike is a relatively new company and as such, they do not yet have extensive data collection measures in place. Consequently, we are only provided LTA ticket issuance data for the period of November to December 2017 and often, data from the last quarter of the year tends to differ from the rest as it clashes with the holiday season. This hinders our analysis as it becomes difficult to analyse and forecast annual trends. To help overcome this, our team will put forth a request to obtain data from May 2017 onwards, as well as upcoming months i.e. January/February 2018.

Nevertheless, given that the bike-sharing industry is relatively new in Singapore, it is highly volatile in nature. Ergo, older data might not be very representative of existing trends. Thus, the use of the most recent data for analysis, forecasting and prescriptive measures might be more suitable for such an industry.

Further, upon reviewing the data, we observe that ‘# of Bikes’ column has a significant amount of missing values. oBike has explained that this occurs because LTA does not always inform them of the exact number of bicycles included in each parking ticket. Given that LTA has only just begun stepping up their enforcements efforts, it is of no surprise that there are still variances in their reporting formats. As such, we cannot analyse the exact number of bikes that are illegally parked. However, since one ticket is issued for one or more bikes in the same location at the same time, useful insights can still be obtained with regards to the geographical locations where illegal bike parking problems are the most prevalent.