Difference between revisions of "Computational Transportation Science Project Overview"

From Analytics Practicum
Jump to navigation Jump to search
 
(2 intermediate revisions by the same user not shown)
Line 26: Line 26:
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
  
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #2e2e2e" width="150px"| [[Computational_Transportation_Science_Project Data Source| <span style="color:#3d3d3d">Data Source</span>]]
+
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="150px"| [[Computational_Transportation_Science_Data Source| <span style="color:#3d3d3d">Data Source</span>]]
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
 +
 +
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="150px"| [[Computational_Transportation_Science_Methodology| <span style="color:#3d3d3d">Methodology</span>]]
 +
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
 +
|}
 +
<!------- Details ---->
 +
{| style="background-color:#FFFFFF ; color:#FFFFFF  padding: 1px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" |
 +
| style="padding:0.3em; font-family:Georgia; font-size:100%; border-bottom:2px solid #626262; border-left:2px #FFFFFF; background: #FFFFFF; text-align:left;" width="20%" | <font color="0066FF" size="3em">Project Background<br></font>
 
|}
 
|}
 +
Singapore is a small country, yet it has a complex but comprehensive public transportation network. Consisting of train (known as Mass Rapid Transit, hereinafter known as MRT), bus, light and rapid trains (Light Rail Transport, hereinafter known as LRT), and taxis, the public transport in Singapore employs the hub-and-spoke strategy; busses serve as the means of transportation within a town, and MRT trains are used for long distance travel. <br/><br/>
 +
The demand for MRT ridership has significantly increased since 1997 as it served as a cheaper or faster alternative to car or taxi for long distance travel. However, since 2011 to the time of this paper, confidence in the MRT system have dropped as it has been plaque with service breakdowns. Some of these breakdowns can be as short as 45 minutes and some as long as a full day. Most Singaporeans feel that the train breakdown is attributed to the sudden increase of foreign workers in the country and that the MRT infrastructure cannot cope with the sudden increase of ridership, thus leading to the breakdowns. <br/><br/>
 +
Calls from the public to improve the MRT infrastructure have been a priority for the MRT operators. It is important that the operators understand the traffic patterns of the MRT ridership to be able to constructively understand and cater or improve the reliability and re-instill confidence in the MRT. <br/><br/>
 +
Should the MRT operators cater to the morning peak by increasing the frequency of trains in the morning, or should they increase the train frequency in the evenings when commuters end the day? Should policies be applied across all stations or should each station have different policies? <br/>
 +
With the Government’s plans to have 6.9 million citizens in Singapore by 2020, we hope to use analytics to be able to understand the travel patterns of the MRT so as to improve the MRT services. <br/><br/>
 +
This paper attempts to explore the travel patterns of the MRT ridership in Singapore for the first week of November of 2011. This paper will continue the work done by Roy LEE’s Master Thesis and we seek to explore the areas that LEE do not cover in his Master Thesis. <br/>
  
 +
{| style="background-color:#FFFFFF ; color:#FFFFFF  padding: 1px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" |
 +
| style="padding:0.3em; font-family:Georgia; font-size:100%; border-bottom:2px solid #626262; border-left:2px #FFFFFF; background: #FFFFFF; text-align:left;" width="20%" | <font color="0066FF" size="3em">Project Objective<br></font>
 +
|}
 +
* <b> Business objective:</b> To identify the MRT ridership patterns of the various station to improve the MRT services.
 +
* <b> Technical objective: </b> To use data analytics techniques such like exploratory data analysis (EDA), and statistical methods to study and gain insights from the data to identify patterns that aid business objective. We will then use time series data mining methods to explore the different patterns.
  
<!------- Details ---->
+
{| style="background-color:#FFFFFF ; color:#FFFFFF  padding: 1px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" |
 +
| style="padding:0.3em; font-family:Georgia; font-size:100%; border-bottom:2px solid #626262; border-left:2px #FFFFFF; background: #FFFFFF; text-align:left;" width="20%" | <font color="0066FF" size="3em">Project Scope<br></font>
 +
|}
 +
* Perform data cleaning on the data set received to consolidate the important fields that are required for analysis.
 +
* Perform EDA to identify patterns that will help in the study of MRT ridership.
 +
* Use time series data mining to explore the patterns of the MRT ridership.

Latest revision as of 15:02, 25 February 2015

Home

Project Overview

 

Project Management

 

Project Documentation

 

Findings

Background Data Source Methodology
Project Background

Singapore is a small country, yet it has a complex but comprehensive public transportation network. Consisting of train (known as Mass Rapid Transit, hereinafter known as MRT), bus, light and rapid trains (Light Rail Transport, hereinafter known as LRT), and taxis, the public transport in Singapore employs the hub-and-spoke strategy; busses serve as the means of transportation within a town, and MRT trains are used for long distance travel.

The demand for MRT ridership has significantly increased since 1997 as it served as a cheaper or faster alternative to car or taxi for long distance travel. However, since 2011 to the time of this paper, confidence in the MRT system have dropped as it has been plaque with service breakdowns. Some of these breakdowns can be as short as 45 minutes and some as long as a full day. Most Singaporeans feel that the train breakdown is attributed to the sudden increase of foreign workers in the country and that the MRT infrastructure cannot cope with the sudden increase of ridership, thus leading to the breakdowns.

Calls from the public to improve the MRT infrastructure have been a priority for the MRT operators. It is important that the operators understand the traffic patterns of the MRT ridership to be able to constructively understand and cater or improve the reliability and re-instill confidence in the MRT.

Should the MRT operators cater to the morning peak by increasing the frequency of trains in the morning, or should they increase the train frequency in the evenings when commuters end the day? Should policies be applied across all stations or should each station have different policies?
With the Government’s plans to have 6.9 million citizens in Singapore by 2020, we hope to use analytics to be able to understand the travel patterns of the MRT so as to improve the MRT services.

This paper attempts to explore the travel patterns of the MRT ridership in Singapore for the first week of November of 2011. This paper will continue the work done by Roy LEE’s Master Thesis and we seek to explore the areas that LEE do not cover in his Master Thesis.

Project Objective
  • Business objective: To identify the MRT ridership patterns of the various station to improve the MRT services.
  • Technical objective: To use data analytics techniques such like exploratory data analysis (EDA), and statistical methods to study and gain insights from the data to identify patterns that aid business objective. We will then use time series data mining methods to explore the different patterns.
Project Scope
  • Perform data cleaning on the data set received to consolidate the important fields that are required for analysis.
  • Perform EDA to identify patterns that will help in the study of MRT ridership.
  • Use time series data mining to explore the patterns of the MRT ridership.