Difference between revisions of "Car Park Overspill Study PROJECT DOCUMENTATION FINAL"

From Analytics Practicum
Jump to navigation Jump to search
 
(59 intermediate revisions by the same user not shown)
Line 5: Line 5:
 
| style="font-size:12px;background-color:#ffffff;border-width: 1px;padding: 8px;border-style: solid;border-color: #ffffff;text-align:center"|
 
| style="font-size:12px;background-color:#ffffff;border-width: 1px;padding: 8px;border-style: solid;border-color: #ffffff;text-align:center"|
 
[[Car_Park_Overspill_Study_Home|<font color="#66c0cf" size=3><b>HOME</b></font>]]   
 
[[Car_Park_Overspill_Study_Home|<font color="#66c0cf" size=3><b>HOME</b></font>]]   
 
||
 
| style="font-size:12px;background-color:#ffffff;border-width: 1px;padding: 8px;border-style: solid;border-color: #ffffff;text-align:center"|
 
[[Car_Park_Overspill_Study_TEAM|<font color="#66c0cf" size=3><b>TEAM</b></font>]]
 
  
 
||
 
||
Line 16: Line 12:
 
||
 
||
 
| style="font-size:12px;background-color:#ffffff;border-width: 1px;padding: 8px;border-style: solid;border-color: #ffffff;text-align:center"|
 
| style="font-size:12px;background-color:#ffffff;border-width: 1px;padding: 8px;border-style: solid;border-color: #ffffff;text-align:center"|
[[Car_Park_Overspill_Study_PROJECT_MANAGEMENT | <font color="#66c0cf" size=3><b>PROJECT MANAGEMENT</b></font>]]
+
[[Car_Park_Overspill_Study_PROJECT_MANAGEMENT | <font color="#66c0cf" size=3><b>PROJECT MANAGEMENT & DOCUMENTATION</b></font>]]
  
 
  ||  
 
  ||  
 
| style="font-size:12px;background-color:#ffffff;border-width: 1px;padding: 8px;border-style: solid;border-color: #ffffff;text-align:center"|
 
| style="font-size:12px;background-color:#ffffff;border-width: 1px;padding: 8px;border-style: solid;border-color: #ffffff;text-align:center"|
[[File:Click-here.png|25px|link=Car_Park_Overspill_Study_PROJECT_DOCUMENTATION]]
+
[[File:Click-here.png|25px|link=Car_Park_Overspill_Study_PROJECT_MANAGEMENT]]
[[Car_Park_Overspill_Study_PROJECT_DOCUMENTATION | <font color="#66c0cf" size=3><b> PROJECT DETAILS </b></font>]]
+
[[Car_Park_Overspill_Study_PROJECT_DOCUMENTATION_FINAL | <font color="#66c0cf" size=3><b>PROJECT ANALYSIS & IMPLEMENTATION</b></font>]]
 
|}
 
|}
  
Line 54: Line 50:
 
| style="font-family:Segoe UI; font-size:110%; background-color:#F5F5F5; border-bottom:2px solid #3f3f3f; text-align:center; color:#F5F5F5" width="8%" |  
 
| style="font-family:Segoe UI; font-size:110%; background-color:#F5F5F5; border-bottom:2px solid #3f3f3f; text-align:center; color:#F5F5F5" width="8%" |  
  
[[Social Media & Public Opinion - Project Overview|<font color="#222"><b>INTERIM</b></font>]]
+
[[Car_Park_Overspill_Study_PROJECT_DOCUMENTATION|<font color="#222"><b>INTERIM</b></font>]]
 
| style="font-family:Segoe UI; font-size:110%; background-color:#F5F5F5; border-bottom:2px solid #3f3f3f; text-align:center; color:#F5F5F5" width="8%" |  
 
| style="font-family:Segoe UI; font-size:110%; background-color:#F5F5F5; border-bottom:2px solid #3f3f3f; text-align:center; color:#F5F5F5" width="8%" |  
[[File:Click-here.png|25px|link=Car_Park_Overspill_Study_PROJECT_DOCUMENTATION]] &nbsp;
+
[[File:Click-here.png|25px|link=Car_Park_Overspill_Study_PROJECT_DOCUMENTATION_FINAL]] &nbsp;
 
[[Car_Park_Overspill_Study_PROJECT_DOCUMENTATION_FINAL|<font color="#222"><b>FINAL</b></font>]]
 
[[Car_Park_Overspill_Study_PROJECT_DOCUMENTATION_FINAL|<font color="#222"><b>FINAL</b></font>]]
 
|}
 
|}
Line 63: Line 59:
  
 
<!--Header End-->
 
<!--Header End-->
 +
=== Revised Methodology ===
 +
We have revised our goal of Phase 2, instead of building a simulation tool with projection of a car park together with its surrounding environment such as streets and roads, now we aim to let the tool to simulate a car park inside traffic only. Considering the raw data at hand, it includes surveyed car parks inbound and outbound traffic. The data become more meaningful when we uncover the pattern of car arriving at a car park and apply the pattern into a model which simulates the traffic inside a car park only. Therefore, we redefined the Phase 2 objective, and accordingly we update the preparing data process to suit the updated needs.
 +
 +
=== Data Preparation ===
 +
To help us understand the traffic situation inside a car park, we process the raw data in the following steps:
 +
 +
1. Based on the nature of the car park's associated site, hence, we categorized 4 sets of data - Shopping Malls, F&B Clusters, Community Clubs, and Hawker Centres. The following table shows the basic information of each category:
 +
{| class="wikitable"
 +
|-
 +
! Site Category !! Shopping Mall !! Community Club !! Hawker Centre !! F&B Cluster
 +
|-
 +
| Num of Samples || 17 || 10 || 10  || 7
 +
|-
 +
| Survey Periods || 10am - 9pm (Weekday & Weekend) || 9am - 6pm (Weekend), 2pm - 10pm (Weekday) || 7am - 3pm (Weekday & Weekend), 6pm - 8pm(Weekday) || 10am - 10pm (Weekday & Weekend)
 +
 +
|}
  
=== Data Exploration ===
+
2. Organize the Inbound Traffic data of each site in a site category, such that we have a collection of data showing the number of arriving cars at each car park in each 15 minutes interval from 7:00 am to 10:00 pm. The following screen shot shows an example of the collection of data of Shopping Mall Category sites:
There are 55 sites in total, each site has a dataset containing development human headcount data and the development car park (if any) vehicle count data during survey period. A typical car park dataset contains the following information: No. of cars parked in season lots, No. of cars in the car park, No. of Overspills. Survey time spans from 7am to 10am. Data is collected in 15 minutes interval during the survey time. Same data collecting frequency applied to human count, human count dataset rather contains the following information: human count in and human count out within each time interval. We had following findings:
 
* '''Difficulty in counting the real headcount in surveyed development'''
 
A primary problem of the collected data is that we cannot assure the real headcount in the development, as in most cases, there is no record of current human count in the development. Therefore, human count in and out data record just provide us the human traffic information. Besides, we found in many datasets, there is one or more record gap(s) between two time intervals with the human traffic recorded. This further increases the difficulty of finding real headcount in a development building.
 
* '''Insufficiency of collected raw data'''
 
We also realised that for each surveyed site, there are only two days of records.one set is for weekday and the other is for weekend. Having insufficient data would lower the confidence of collected data in representing the ordinary situation happens in the surveyed site everyday.
 
  
=== Data Preparation ===
+
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P1.PNG|center|600px]]
In order to uncover the common characteristics of sites within a same category and expose the differences of sites from different categories, we firstly grouped each dataset by its category (hawker centre, community centre, shopping mall or F&B cluster).
+
 
We understood that the size of the surveyed development may affect the carpark traffic. In order to remove the development size factor, we decided to normalise each site’s data by applying the following formula for weekday and weekend separately:<br />
+
3. We understand that the car parks' associated development site have varied capacities in accommodating patrons or customers, so we normalized the data , by applying below formula:
 +
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P2.png|center|400px]]
 +
The following screen shot shows an instance of data has been normalized:
 +
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P3.PNG|center|600px]]
 +
4. After normalizing the data, in each site category, we still found a few outliers with normalized value deviating from the mean value greatly. To ensure that the data as input is effective in representing the general cases to the agent-based model we build, we have to remove those outliers.
 +
The following plot chart shows an example of the effect before and after removing the outliers:
 +
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P4.PNG|center|600px]]
 +
 
 +
== Implementation of Agent-based Model ==
 +
Agent-based Simulation is a micro scale model that simulates the simultaneous operations and interactions of multiple agents in an attempt to re-create and predict the appearance of complex phenomena. Agents are a defined class of objects which have certain attributes and behaviours, they will interact with each other and also the world settings within the environment based on their pre-defined attributes and behaviours. Agent-based simulation is commonly used when agents’ behaviour can be easily defined by simple behavioural rules but the outcome could be complex and difficult to predict, such as the modelling of reproduction, economic benefits, social status and others.<br />
 +
 
 +
There are 3 main benefits for agent-based simulation compared to normal simulation, such as business process simulation:
 +
* The utilization of the car park depends on every individual drivers’ behaviour
 +
* Visualization that is commonly used in agent-based simulation helps us better understand which area of the car park tend to have congestions
 +
* The model will be flexible enough to takes in user’s inputs since the drivers’ behaviour can easily be adjusted for once and all the drivers will have the updated behaviour
 +
 
 +
=== Model Introduction ===
 +
The following diagram abstractly shows the inputs the simulation model needs and the output it generates. It takes two sorts of inputs:<br />
 +
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P8.PNG|center|600px]]
 +
1. built-in configurations:
 +
Processed data in earlier stage is used here are configuration for the model when simulation the pattern of arriving cars at a car park. It will be scaled up proportionally to one of the user inputs - Car Park Capacity - and generate the number of arriving cars in each time interval. <br />
 +
 
 +
 
 +
2. User inputs and heuristic algorithm configuration parameters:
 +
There are 4 key user inputs which have to be specified before the simulation of the model. They are “Development Site Type”, “Peak/Non-Peak”, “Maximum Head Count” and “Number of Parking Space” as highlighted in orange in the pictures above.
 +
Based on “Development Site Type” and “Peak/Non-Peak”, we will retrieve the processed data which belongs to the 2 categories and this data will be used to calculate the “incoming Traffic / Head Count” at different time periods of the day. Next, the user will use “Maximum Head Count” to specify the rough size of the development site, for example, 1000 indicates that the development site will hold roughly maximum 1000 persons. We multiple the head count with the incoming traffic to find out the “Size of the Incoming Traffic” which is relative to the size of the development site. Lastly, we will generate the cars at different periods based on this data, the cars will interact with each other and also the world to see whether the “Number of Parking Space” entered by the user is enough for such development site.
 +
<br />
 +
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P9.PNG|center|600px|The diagram shows how different parameters being processed in the model]]
 +
 
 +
 
 +
3. Output Charts:
 +
The Model will monitor and display the Occupancy Rate, Number of Overspills, and Average Waiting Time dynamically.
 +
<br />
 +
 
 +
=== Model Interface ===
 +
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P7.PNG|center|1000px]]
 +
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P10.PNG|center|1000px]]
 +
<br />
 +
 
 +
=== Defining agents' behavior===
 +
There are 2 types of agents in our model, patches and turtles. Patches refer the group units in the environment which are used to facilitate turtles’ behaviour, we have 3 patches, “Road”, “Parking Space” and “Wall”. Turtles refer to class of agents which travels and interacts in the environment, we have 1 turtles class, “Car”.<br />
 +
 
 +
“Road” is used to allow cars to move around, but additionally, some of the “Road” are identified as “Entrance” which only allows “Cars” to enter into the carpark, “Exit” which only allows “Car” to exit from the carpark and “Crossroad” which allows the cars to make turns. “Parking Space” is used to identify parking slots in the carpark, it will be labelled as “yes” to park if there is no cars on it, the status will be changed to “parked” if there is a car parking on it. “Wall” is simply used to define the boundary of the carpark and the cars will not be able to drive pass “Wall” patches.<br />
 +
 
 +
“Car” is slightly more complex to define. It has 4 basic behaviours, firstly, it must follow basic traffic rules, in our context, a car must drive on the left of the road and only make turns at crossroad. Secondly, a car or a driver will try to find the optimal place to park the car until the patience runs out, we uses “Patience” variable to describe the car or driver’s patience, it is modelled using normal distribution with user’s input of its mean and standard deviation. If the patience of the driver runs out, he will choose to park whenever he encounters the first free parking space. Thirdly, the driver will leave the carpark after the “Stay Duration” is reached. Similarly, “Stay Duration” is modelled using normal distribution with user’s inputs as well. Lastly, we introduce the idea “Randomness” since not all the drivers will follow the optimal parking algorithm, different drivers may make different decisions based on their own preferences. Hence, we give the driver a “Randomness” variable to make the decision making process more realistic.<br />
  
''No. of vehicles / maximum cumulative human headcount within the day''<br />
+
=== Parking Algorithm ===
  
Below shows the screenshot of a sample normalised data:<br />
+
Before we discussed the detailed parking algorithm, we have to first understand drivers’ behaviour in normal daily life and transform it into algorithm which best describes it in order to achieve more realistic and accurate simulation result.<br />
[[File:ZW Picture1.png|800px|center]]
 
  
After normalising each dataset (pertaining to a development site), we found the mean and variance of all datasets within one category at each given time interval. This is to generalise the data across the entire category. For example, the following screenshot shows the mean and its corresponding variance for normalised data points across entire shopping mall datasets:<br />
+
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P6.PNG|center|600px]]<br />
[[File:ZW Picture2.png|600px|center]]
 
  
=== Data Distribution ===
+
The picture above describes a driver’s decision making process when enters a carpark. He will have a rough idea of the occupancy rate in the carpark since most of the Singapore carpark shows the number of vacancies at the entrance. If he thinks that occupancy rate is generally low, he will prefer to find a parking space which is near to the destination, in this context the lift or the shop that he wants to go, so that he can minimize the walking distance. The action will be to travel to the area which he thinks is close enough and if there are any vacancies left, he will choose any one of them. On the other hand, if he thinks the occupancy rate is high, the strategy is to find a parking space which is as optimal as possible. In this case, if a vacancy can be found near area which has many parked cars, it is probably a good choice since many people choose to park here while the occupancy rate is high.  
We did a cross-sites data distribution analysis for each category. The following charts show the mean and its associated variance during each time interval. By plotting the distribution of cross-sites data, we  can better understand the data characteristics of different development site categories. <br />
+
With these 2 simple behavior in mind, we want to find parking algorithm which suits them the best. After reading through several research papers, especially in the paper [http://www.indiana.edu/~abcwest/pmwiki/pdf/hutchinson.car%20parking.2012.pdf Car Parking as a Game Between Simple Heuristics (John M. C. utchinson Carola Fanselow Peter M. Todd, 2012)]. We managed to find few parking algorithms which can possibly be used:<br />
[[File:ZW Picture3.png|600px|center]]
+
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P5.PNG|center|800px]]
Shopping Mall Cross-site chart shown reveals the following characteristics of Shopping mall datasets:
 
# All the data is collected from 10am to 9pm
 
# The distribution of normalised data points for each site is very close to the cross-sites data mean at all time period, as the variance are always less than 0.02. We can conclude that the mean values can well represent the data points of every site.
 
# At any given time interval, weekend mean is higher than weekday mean. This indicates that the vehicle per headcount ratio tends to be higher at weekend than weekday at shopping mall in general.  
 
# The vehicle per headcount ratio peaks at around 12am and 7pm
 
 
<br />
 
<br />
  
We further divided Hawker Centre datasets into two groups as we have realised that the vehicle per headcount ratios deviate between the two groups, as we can see in below charts:
+
'''“Fixed-Distance-Heuristics”''' means that the driver will take the first vacancy encountered within a fixed distance to the destination, it takes in a parameter “Distance” which represents the maximum distance that the driver can accept in order to choose the vacancy.<br />
[[File:ZW Picture4.png|600px|center]]
+
 
 +
'''“Proportional-Distance-Heuristics”''' means that the driver will record down the distance between the first parked car he encounter to the destination, a parameter “Proportion P” will be used to multiple that distance and any vacancies which are smaller than the result will be chosen by the driver. <br />
 +
 
 +
There are 2 main reasons for choose these 2 algorithms:<br />
  
Those Hawker Centres with smaller vehicle per headcount ratio are at surveyed sites like Yuhua, Holland Drive, Marine Parade, Marsiling. By viewing the distribution, we can conclude:
+
1. These 2 algorithms both take into consideration the distance to the destination which we believe is very important in the carpark study. The rest of the 4 algorithm does not consider this piece of information and hence may not describe the real case scenario accurately.<br />
# Weekend data is collected from 7am to 3pm, while weekday data covers not only the period from 7am to 3pm, but also with extra period from 6pm to 8pm
 
# On both weekday and weekend, the vehicle per headcount ratio tends to reach peak during period around 9am and lunch hours (12pm to 1pm)
 
# Weekend mean values are generally higher than weekday mean values at a given time period
 
# We can say the mean values can be considered as a good representative of cross-sites data, as the most of the variance values dropped within the range below 0.02
 
[[File:ZW Picture5.png|600px|center]]
 
  
Those Hawker Centres with larger vehicle per headcount ratio are surveyed sites like Boon Keng, New Upper Changi Rd Blk 208, new Upper Changi Rd Blk 58, Hougang, Serangoon. We can conclude the followings:
+
2. In terms of matching to the driver’s behavior, “Fixed-Distance-Heuristics” is very close to the case when the occupancy rate is low while all the rest of the algorithms can be used when the occupancy rate is high. However, one main difference for “Proportional-Distance-Heuristics” is that its parameter is relative to the carpark size while the other algorithms request a different input when the user change the carpark size. Therefore, we can use the optimal value for the “Proportion P” in the simulation with varying carpark size. This will definitely increase user-friendliness and accuracy of the model. With these considerations, we have chosen “Fixed-Distance-Heuristics” and “Proportional-Distance-Heuristics” to be the parking algorithm. <br />
# The data was collected from 7am to 3pm on both weekday and weekend
 
# The majority of data points fall within the range between 0.3 and 0.5
 
# Weekday data shows that the mean value peaks around 7am and 1:30pm on the survey day, while weekend data shows that the mean value peaks around 7:45am and 12:30pm on the survey day
 
# The variance values generally are less than 0.1 which indicates there are some deviations in cross-site data.
 
[[File:ZW Picture6.png|600px|center]]
 
  
The Community Centre data was collected from 2pm to 10pm on weekday while from 9am to 6pm on weekend. Based on the data, we can say that the vehicle per headcount ratio reaches its peak at around 8pm on weekday and 5:30pm on the weekend. The majority of variance values fall within the range below 0.02. The variance values tend to increase slightly during the peak hours. These indicate that the vehicle per headcount ratios across sites do not deviate from the mean much at non-peak hours, while start to deviate during peak hours on both days.
+
However, drivers in real life do not strictly follow the optimal solution all the time, which is why we introduce the idea of '''“Randomness”''' and '''“Patience”''', so the model will check the driver’s patience before deciding the parking algorithm, if the patience has run out, the driver will chooses the nearest vacancy. If not, he will proceed to choose either “Fixed-Distance-Heuristics” or “Proportional-Distance-Heuristics” based on the occupancy rate of the carpark. After choosing the algorithm and decide the direction to go, there is a certain chance which the driver may turn randomly instead of following the result direction from the algorithm since we want drivers’ behavior to be more random and more realistic. This level of randomness can be adjusted by the user. This loop will continue until the driver finds a parking space and start counting his “Stay Duration”. '''“Stay Duration”''' is used to model that amount of time which the driver will stay in the carpark, we used normal distribution to randomly generate a “Stay Duration” for the driver based on user’s input of mean and standard deviation. The driver will leave the carpark when the “Stay Duration” is reached and this will end the whole parking process for the driver.<br />
[[File:ZW Picture7.png|600px|center]]
 
  
After removing outliers, F&B Cluster cross-sites data distribution has the following characteristics:
+
=== Monitoring Results ===
# The data was collected from 10am to 10pm on both weekday and weekend.
+
The following screen shot shows three gadgets on the model's interface showing Occupancy Rate, Overspill and Average Waiting Time in real time manner during simulating:
# The vehicle per headcount ratio tends to peak around 2pm, 5pm and 10pm on weekday, while around 1pm, 8pm on weekend. 
+
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P11.PNG|center|800px]]
# The variances are small on both days.
 
  
== Implementation of Agent-based Model ==
+
== Model Evaluation ==
=== Defining agent behaviour===
+
The accuracy of the model is hugely important, we want the model to provide realistic simulation of the car park usage so that the user can trust the results and make necessary adjustments based on the simulation results. Therefore, we randomly picked 2 development sites from each category and use their parameters to run the simulation, the simulation results were then exported to compare with the real data. We took the absolute difference between the simulation results and the real data and divide it by the real data to get a percentage error.
By studying the data by category, we can find out the distributions of the amount of cars which will travel to the sites and use the carpark. Also the chances that they may park on the street illegally. More complex behavior can be investigated if time allows.
+
The table below shows the results:
There is only a type of agent in our project which is the car travelling to the development site. The behavior of the car can be defined by few rules:
+
[[File:Car Park Overspill Study PROJECT DOCUMENTATION FINAL P12.PNG|800px|center]]
* Only cars which are travelling to the development site will be generated on the map
+
 
* Individual car in the map will travel along available roads towards the development site
+
The cause of differences between simulated results and actual data:
* Average parking time of individual car is set to be 60min
+
* Unstable human count which can affect the incoming traffic size and eventually the occupancy rate
* Cars will leave the carpark, travel towards the edge of the map and exit
+
* A small car park parking lot capacity tends to be more prone to error: For F&B cluster, the car park tends to be very small which usually has less than 50 parking space, few cars difference can lead to large simulation error in terms of percentage.
There are 3 types of patches that we need to define in the Model: road, development site and carpark:
+
 
* Road patches allows cars to travel on it and there will be a “direction” variable relates to it to define the direction that the cars should follow
+
== Project Assumptions & Limitations ==
* Development site patches indicates the location of the site, there is no functionality relates to the patches
+
Since our project was done based on a few assumptions, the model also has its limitations, this section will highlight the major points:
* Carpark patches allows cars to go in and out of the it, there will be a “count” variable which indicates the amount of cars in the carpark
+
* '''No real data of parking duration''': One of the major limitations is that there is no support of real data of parking duration. In our case, we use a normal distribution with user specified mean and standard deviation to generate the “Stay Duration” for each driver. However, this will definitely affect the accuracy of the model.
=== Next Following-up===
+
 
The next stage of the project focuses on development of the model since the major elements of the model are fully prepared which are the agents’ behavior and data. <br />
+
* '''Constant speed of cars''': The speed of the car was designed to be constant which is not true in real life, this could affect the amount of time which the driver takes to find a parking space.
 +
* '''Lack of details of driver behaviour''' (e.g.  enter, exit and parking action is instantaneous): Similar to the previous assumptions, there could be many more detailed driver actions, such as entering and exiting the car park, the parking process and others. All of these will affect the traffic in the car park and hence car park usage.
 +
* '''Choice of parking algorithm''': The parking algorithm for our model was chosen and modified based on common understanding of driver’s behavior in the car park and his decision making process. However, the real case could be different or more details need to be considered.
 +
* '''Pre-designed Car park Shape''': In order to maintain certain flexibility, our model chose to build the car park into a pre-designed car park shape so that we can adjust to user’s input of car park size. However it is also understood that car park shape will affect the cars’ behaviour significantly.
  
The tools which we are going to use is Netlogo as mentioned in the project proposal. Netlogo is known to be a user-friendly tool in agent based simulation field due to its clear graphical interface, great flexibility and easy to pick up coding syntax. All simulation model in Netlogo involves 2 development stages, the first one is the “setup” function which creates the default environment that the agent is going to interact in, the other one is “go” function which triggers the agent’s action and iterates infinitely until a result is generated or the user stops the process.<br />
+
== Further Improvement ==
 +
The simulation program can be further developed by introducing with the following features:
 +
* More detailed and accurate behaviour of drivers
 +
More detailed parking cars’ parking behaviours can be studied and modelled, such as the different parking styles: Angle Parking, Perpendicular Parking, and Parallel Parking etc.  
 +
In our model, due to lack of the data help us to define the parking duration of cars, we set its configuration as parameters that allow user to manipulate. Improvements can be made by collecting and modelling with realistic data showing the pattern of cars’ parking duration.  
  
Our project aims to incorporate real world elements into the model, such as the road direction and traffic condition to provide more convincing results of carpark overspill study. Therefore, “setup” function will be a challenge and also the main focus for our next stage plan. We would like to use the drawing functionalities in Netlogo to enable the user to input the settings of the environment which is close to the real world development site. <br />
+
* Map realistic car park layout into the model
 +
Modelling with realistic car park layout allows user to get simulation results that are more representative of the real traffic outcomes in the car park. It can be achieved by importing the raster map into NetLogo. Pixels in raster data have to be assigned with appropriate attribute values so that it allows programmer to define the rules for each patch in NetLogo.
 +
* Introduce additional parameters
 +
We have identified two kinds of parameters that will improve the accuracy of our model: <br />
  
The “go” function operates the agents based on the general rule we have discussed earlier on in the agents’ behavior section. The logic of some other considerations such as traffic light, crossings and turning of the vehicles will also be coded in this part to simulate a more realistic environment.<br />
+
1.Introducing parameters for configuring different range of confidence interval in defining car arriving pattern<br />
 +
2.Introducing parameters to define the attractiveness of car park associated development site. Besides, the model should capture the relation between this attractiveness parameter with the number of inbound traffic in the car park.
  
Therefore, we believe that the interaction between the pre-coded agents in the user input environment will generate interesting and useful insights for the user.
+
== Comments ==
 +
We thank you for any comment you have for this project!

Latest revision as of 23:26, 17 April 2016


HOME

PROJECT OVERVIEW

PROJECT MANAGEMENT & DOCUMENTATION

Click-here.png PROJECT ANALYSIS & IMPLEMENTATION


Preliminary Phase

Scope

Our sponsor requires us to finish 5 final reports of different development sites during the preliminary phase of our project. The necessary data and infographics are prepared by the sponsor and his tea, our job scope is to compile the report using the sample template and interpret the data to generate some insights which can be helpful for LTA future planning. A typical final report contains the following components:

  1. Executive Summary
  2. Site Background
  3. Site Characteristics
  4. Site Assessment
  5. Survey Findings
  6. Conclusion

“Executive Summary” gives a brief overview of the report and its conclusion. “Site Background”, “Site Characteristics” and “Site Assessment” provides the basic information of the site, such as the nature, the size, the transportation availability of the site and etc. Our focus is on “Survey Findings” which uses the data collected on both weekday and weekend at the site to generate some insights. The generated insights elaborate on the vehicles traffic pattern, the human traffic during different period of the day and most importantly whether there is overspill, this helps LTA understands the utilization rate of the car park and make certain adjustments in the future.

Actual Deliverables

By the end of the preliminary phase, our group has finished 5 final reports as planned at the beginning of the project. These 5 final reports are:

  • Final Report - Greenwood Ave & Hillcrest Rd
  • Final Report - Vivocity
  • Final Report - Frankel Ave
  • Final Report - Jalan Mata Ayer
  • Final Report - Yuhua Market & Hawker Centre

Phase 2

INTERIM

Click-here.png   FINAL


Revised Methodology

We have revised our goal of Phase 2, instead of building a simulation tool with projection of a car park together with its surrounding environment such as streets and roads, now we aim to let the tool to simulate a car park inside traffic only. Considering the raw data at hand, it includes surveyed car parks inbound and outbound traffic. The data become more meaningful when we uncover the pattern of car arriving at a car park and apply the pattern into a model which simulates the traffic inside a car park only. Therefore, we redefined the Phase 2 objective, and accordingly we update the preparing data process to suit the updated needs.

Data Preparation

To help us understand the traffic situation inside a car park, we process the raw data in the following steps:

1. Based on the nature of the car park's associated site, hence, we categorized 4 sets of data - Shopping Malls, F&B Clusters, Community Clubs, and Hawker Centres. The following table shows the basic information of each category:

Site Category Shopping Mall Community Club Hawker Centre F&B Cluster
Num of Samples 17 10 10 7
Survey Periods 10am - 9pm (Weekday & Weekend) 9am - 6pm (Weekend), 2pm - 10pm (Weekday) 7am - 3pm (Weekday & Weekend), 6pm - 8pm(Weekday) 10am - 10pm (Weekday & Weekend)

2. Organize the Inbound Traffic data of each site in a site category, such that we have a collection of data showing the number of arriving cars at each car park in each 15 minutes interval from 7:00 am to 10:00 pm. The following screen shot shows an example of the collection of data of Shopping Mall Category sites:

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P1.PNG

3. We understand that the car parks' associated development site have varied capacities in accommodating patrons or customers, so we normalized the data , by applying below formula:

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P2.png

The following screen shot shows an instance of data has been normalized:

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P3.PNG

4. After normalizing the data, in each site category, we still found a few outliers with normalized value deviating from the mean value greatly. To ensure that the data as input is effective in representing the general cases to the agent-based model we build, we have to remove those outliers. The following plot chart shows an example of the effect before and after removing the outliers:

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P4.PNG

Implementation of Agent-based Model

Agent-based Simulation is a micro scale model that simulates the simultaneous operations and interactions of multiple agents in an attempt to re-create and predict the appearance of complex phenomena. Agents are a defined class of objects which have certain attributes and behaviours, they will interact with each other and also the world settings within the environment based on their pre-defined attributes and behaviours. Agent-based simulation is commonly used when agents’ behaviour can be easily defined by simple behavioural rules but the outcome could be complex and difficult to predict, such as the modelling of reproduction, economic benefits, social status and others.

There are 3 main benefits for agent-based simulation compared to normal simulation, such as business process simulation:

  • The utilization of the car park depends on every individual drivers’ behaviour
  • Visualization that is commonly used in agent-based simulation helps us better understand which area of the car park tend to have congestions
  • The model will be flexible enough to takes in user’s inputs since the drivers’ behaviour can easily be adjusted for once and all the drivers will have the updated behaviour

Model Introduction

The following diagram abstractly shows the inputs the simulation model needs and the output it generates. It takes two sorts of inputs:

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P8.PNG

1. built-in configurations: Processed data in earlier stage is used here are configuration for the model when simulation the pattern of arriving cars at a car park. It will be scaled up proportionally to one of the user inputs - Car Park Capacity - and generate the number of arriving cars in each time interval.


2. User inputs and heuristic algorithm configuration parameters: There are 4 key user inputs which have to be specified before the simulation of the model. They are “Development Site Type”, “Peak/Non-Peak”, “Maximum Head Count” and “Number of Parking Space” as highlighted in orange in the pictures above. Based on “Development Site Type” and “Peak/Non-Peak”, we will retrieve the processed data which belongs to the 2 categories and this data will be used to calculate the “incoming Traffic / Head Count” at different time periods of the day. Next, the user will use “Maximum Head Count” to specify the rough size of the development site, for example, 1000 indicates that the development site will hold roughly maximum 1000 persons. We multiple the head count with the incoming traffic to find out the “Size of the Incoming Traffic” which is relative to the size of the development site. Lastly, we will generate the cars at different periods based on this data, the cars will interact with each other and also the world to see whether the “Number of Parking Space” entered by the user is enough for such development site.

The diagram shows how different parameters being processed in the model


3. Output Charts: The Model will monitor and display the Occupancy Rate, Number of Overspills, and Average Waiting Time dynamically.

Model Interface

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P7.PNG
Car Park Overspill Study PROJECT DOCUMENTATION FINAL P10.PNG


Defining agents' behavior

There are 2 types of agents in our model, patches and turtles. Patches refer the group units in the environment which are used to facilitate turtles’ behaviour, we have 3 patches, “Road”, “Parking Space” and “Wall”. Turtles refer to class of agents which travels and interacts in the environment, we have 1 turtles class, “Car”.

“Road” is used to allow cars to move around, but additionally, some of the “Road” are identified as “Entrance” which only allows “Cars” to enter into the carpark, “Exit” which only allows “Car” to exit from the carpark and “Crossroad” which allows the cars to make turns. “Parking Space” is used to identify parking slots in the carpark, it will be labelled as “yes” to park if there is no cars on it, the status will be changed to “parked” if there is a car parking on it. “Wall” is simply used to define the boundary of the carpark and the cars will not be able to drive pass “Wall” patches.

“Car” is slightly more complex to define. It has 4 basic behaviours, firstly, it must follow basic traffic rules, in our context, a car must drive on the left of the road and only make turns at crossroad. Secondly, a car or a driver will try to find the optimal place to park the car until the patience runs out, we uses “Patience” variable to describe the car or driver’s patience, it is modelled using normal distribution with user’s input of its mean and standard deviation. If the patience of the driver runs out, he will choose to park whenever he encounters the first free parking space. Thirdly, the driver will leave the carpark after the “Stay Duration” is reached. Similarly, “Stay Duration” is modelled using normal distribution with user’s inputs as well. Lastly, we introduce the idea “Randomness” since not all the drivers will follow the optimal parking algorithm, different drivers may make different decisions based on their own preferences. Hence, we give the driver a “Randomness” variable to make the decision making process more realistic.

Parking Algorithm

Before we discussed the detailed parking algorithm, we have to first understand drivers’ behaviour in normal daily life and transform it into algorithm which best describes it in order to achieve more realistic and accurate simulation result.

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P6.PNG


The picture above describes a driver’s decision making process when enters a carpark. He will have a rough idea of the occupancy rate in the carpark since most of the Singapore carpark shows the number of vacancies at the entrance. If he thinks that occupancy rate is generally low, he will prefer to find a parking space which is near to the destination, in this context the lift or the shop that he wants to go, so that he can minimize the walking distance. The action will be to travel to the area which he thinks is close enough and if there are any vacancies left, he will choose any one of them. On the other hand, if he thinks the occupancy rate is high, the strategy is to find a parking space which is as optimal as possible. In this case, if a vacancy can be found near area which has many parked cars, it is probably a good choice since many people choose to park here while the occupancy rate is high. With these 2 simple behavior in mind, we want to find parking algorithm which suits them the best. After reading through several research papers, especially in the paper Car Parking as a Game Between Simple Heuristics (John M. C. utchinson Carola Fanselow Peter M. Todd, 2012). We managed to find few parking algorithms which can possibly be used:

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P5.PNG


“Fixed-Distance-Heuristics” means that the driver will take the first vacancy encountered within a fixed distance to the destination, it takes in a parameter “Distance” which represents the maximum distance that the driver can accept in order to choose the vacancy.

“Proportional-Distance-Heuristics” means that the driver will record down the distance between the first parked car he encounter to the destination, a parameter “Proportion P” will be used to multiple that distance and any vacancies which are smaller than the result will be chosen by the driver.

There are 2 main reasons for choose these 2 algorithms:

1. These 2 algorithms both take into consideration the distance to the destination which we believe is very important in the carpark study. The rest of the 4 algorithm does not consider this piece of information and hence may not describe the real case scenario accurately.

2. In terms of matching to the driver’s behavior, “Fixed-Distance-Heuristics” is very close to the case when the occupancy rate is low while all the rest of the algorithms can be used when the occupancy rate is high. However, one main difference for “Proportional-Distance-Heuristics” is that its parameter is relative to the carpark size while the other algorithms request a different input when the user change the carpark size. Therefore, we can use the optimal value for the “Proportion P” in the simulation with varying carpark size. This will definitely increase user-friendliness and accuracy of the model. With these considerations, we have chosen “Fixed-Distance-Heuristics” and “Proportional-Distance-Heuristics” to be the parking algorithm.

However, drivers in real life do not strictly follow the optimal solution all the time, which is why we introduce the idea of “Randomness” and “Patience”, so the model will check the driver’s patience before deciding the parking algorithm, if the patience has run out, the driver will chooses the nearest vacancy. If not, he will proceed to choose either “Fixed-Distance-Heuristics” or “Proportional-Distance-Heuristics” based on the occupancy rate of the carpark. After choosing the algorithm and decide the direction to go, there is a certain chance which the driver may turn randomly instead of following the result direction from the algorithm since we want drivers’ behavior to be more random and more realistic. This level of randomness can be adjusted by the user. This loop will continue until the driver finds a parking space and start counting his “Stay Duration”. “Stay Duration” is used to model that amount of time which the driver will stay in the carpark, we used normal distribution to randomly generate a “Stay Duration” for the driver based on user’s input of mean and standard deviation. The driver will leave the carpark when the “Stay Duration” is reached and this will end the whole parking process for the driver.

Monitoring Results

The following screen shot shows three gadgets on the model's interface showing Occupancy Rate, Overspill and Average Waiting Time in real time manner during simulating:

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P11.PNG

Model Evaluation

The accuracy of the model is hugely important, we want the model to provide realistic simulation of the car park usage so that the user can trust the results and make necessary adjustments based on the simulation results. Therefore, we randomly picked 2 development sites from each category and use their parameters to run the simulation, the simulation results were then exported to compare with the real data. We took the absolute difference between the simulation results and the real data and divide it by the real data to get a percentage error. The table below shows the results:

Car Park Overspill Study PROJECT DOCUMENTATION FINAL P12.PNG

The cause of differences between simulated results and actual data:

  • Unstable human count which can affect the incoming traffic size and eventually the occupancy rate
  • A small car park parking lot capacity tends to be more prone to error: For F&B cluster, the car park tends to be very small which usually has less than 50 parking space, few cars difference can lead to large simulation error in terms of percentage.

Project Assumptions & Limitations

Since our project was done based on a few assumptions, the model also has its limitations, this section will highlight the major points:

  • No real data of parking duration: One of the major limitations is that there is no support of real data of parking duration. In our case, we use a normal distribution with user specified mean and standard deviation to generate the “Stay Duration” for each driver. However, this will definitely affect the accuracy of the model.
  • Constant speed of cars: The speed of the car was designed to be constant which is not true in real life, this could affect the amount of time which the driver takes to find a parking space.
  • Lack of details of driver behaviour (e.g. enter, exit and parking action is instantaneous): Similar to the previous assumptions, there could be many more detailed driver actions, such as entering and exiting the car park, the parking process and others. All of these will affect the traffic in the car park and hence car park usage.
  • Choice of parking algorithm: The parking algorithm for our model was chosen and modified based on common understanding of driver’s behavior in the car park and his decision making process. However, the real case could be different or more details need to be considered.
  • Pre-designed Car park Shape: In order to maintain certain flexibility, our model chose to build the car park into a pre-designed car park shape so that we can adjust to user’s input of car park size. However it is also understood that car park shape will affect the cars’ behaviour significantly.

Further Improvement

The simulation program can be further developed by introducing with the following features:

  • More detailed and accurate behaviour of drivers

More detailed parking cars’ parking behaviours can be studied and modelled, such as the different parking styles: Angle Parking, Perpendicular Parking, and Parallel Parking etc. In our model, due to lack of the data help us to define the parking duration of cars, we set its configuration as parameters that allow user to manipulate. Improvements can be made by collecting and modelling with realistic data showing the pattern of cars’ parking duration.

  • Map realistic car park layout into the model

Modelling with realistic car park layout allows user to get simulation results that are more representative of the real traffic outcomes in the car park. It can be achieved by importing the raster map into NetLogo. Pixels in raster data have to be assigned with appropriate attribute values so that it allows programmer to define the rules for each patch in NetLogo.

  • Introduce additional parameters

We have identified two kinds of parameters that will improve the accuracy of our model:

1.Introducing parameters for configuring different range of confidence interval in defining car arriving pattern
2.Introducing parameters to define the attractiveness of car park associated development site. Besides, the model should capture the relation between this attractiveness parameter with the number of inbound traffic in the car park.

Comments

We thank you for any comment you have for this project!