Difference between revisions of "ANLY482 AY2017-18 T2 Group 31 Model Buidling and Analysis"

From Analytics Practicum
Jump to navigation Jump to search
 
(4 intermediate revisions by the same user not shown)
Line 43: Line 43:
 
[[ANLY482_AY2017-18_T2_Group_31 Project Analysis|<font color="#3c3c3c"><strong>Exploratory Data Analysis</strong></font>]]
 
[[ANLY482_AY2017-18_T2_Group_31 Project Analysis|<font color="#3c3c3c"><strong>Exploratory Data Analysis</strong></font>]]
 
| style="font-family:Century Gothic, Open Sans, Arial, sans-serif; font-size:15px; text-align: center; border:solid 2px #FF8C00; border-radius: 10px; background-color: #fff" width="250px" |   
 
| style="font-family:Century Gothic, Open Sans, Arial, sans-serif; font-size:15px; text-align: center; border:solid 2px #FF8C00; border-radius: 10px; background-color: #fff" width="250px" |   
[[ANLY482_AY2017-18_T2_Group_31 Model Buidling and Analysis|<font color="#3c3c3c"><strong>Model Building and Analysis</strong></font>]]
+
[[ANLY482_AY2017-18_T2_Group_31 Model Building|<font color="#3c3c3c"><strong>Model Building</strong></font>]]
 
| style="font-family:Century Gothic, Open Sans, Arial, sans-serif; font-size:15px; text-align: center; border:solid 1px #FF8C00; border-radius: 10px; background-color: #fff" width="250px" |   
 
| style="font-family:Century Gothic, Open Sans, Arial, sans-serif; font-size:15px; text-align: center; border:solid 1px #FF8C00; border-radius: 10px; background-color: #fff" width="250px" |   
 
[[ANLY482_AY2017-18T2_Group31_Project_Recommendation|<font color="#3c3c3c"><strong>Recommendation</strong></font>]]
 
[[ANLY482_AY2017-18T2_Group31_Project_Recommendation|<font color="#3c3c3c"><strong>Recommendation</strong></font>]]
Line 78: Line 78:
 
[[File:Converting.png|500px]]<br>
 
[[File:Converting.png|500px]]<br>
 
</div>
 
</div>
 +
 +
<b>Interpreting the Modified L Test graph</b><br>
 +
<div align="center">
 +
[[File:Interpret.png|500px]]<br>
 +
</div>
 +
</div>
 +
 +
<div style="margin:20px; padding: 10px; background: #ffffff; text-align:left; font-size: 95%;-webkit-border-radius: 15px;-webkit-box-shadow: 7px 4px 14px rgba(176, 155, 121, 0.96); -moz-box-shadow: 7px 4px 14px rgba(176, 155, 121, 0.96);box-shadow: 7px 4px 14px rgba(176, 155, 121, 0.96);">
 +
<p></p>
 +
<b>Kernel Density Estimation</b><br>
 +
To determine: <br>
 +
1. To identify cluster of locations that have higher occurrence of indiscriminate parkings<br>
 +
 +
Function (kernel 𝑘) of a given radius (𝑟) “visits” each point in the study region. 𝑘 provides the weight of the area surrounding 𝑠 in proportion to its distance to 𝑠_𝑖 <br>
 +
<div align="center">
 +
[[File:KDEformula.png|200px]]<br>
 +
</div>
 +
𝑘 is calculated as a function of the distance between point 𝑠 and 𝑠_𝑖, over given radius 𝑟 <br>
 +
The density of the study region is obtained by summing 𝑘 of all points 𝑠_𝑖 within  𝑟 <br>
 +
<div align="center">
 +
[[File:LargeBW.png|400px]] [[File:SmallBW.png|400px]]
 +
</div>
 +
Kernel Density Estimations are sensitive to changes in radius values <br>
 +
Large radius leads to a smoother curve, but local details would be obscured <br>
 +
Small radius leads to many small spikes that are very localised <br>
 +
Using the statistically significant radius distance obtained from Modified L Test as a search radius within each event<br>
 +
 +
<div align="center">
 +
[[File:Interpolate.png|400px]]
 +
</div>
 +
Perform interpolation by transforming the graph to make it smoother<br>
 +
Individual kernels are summed up to produce a smooth surface<br>
 +
Quartic kernel type is used in QGIS<br>

Latest revision as of 22:27, 14 April 2018

Bannernew.png

HOME

 

ABOUT US

 

PROJECT OVERVIEW

 

PROJECT ANALYSIS

 

PROJECT MANAGEMENT

 

ANLY482 HOMEPAGE

Data

Exploratory Data Analysis

Model Building

Recommendation

Model Building

Modified L Test via Ripley's K Function
To determine:
1. If the notifications appear to be clustered or randomly distributed in our area of interest
2. Minimum radius distance which shows signs of statistically significant clustering

RKFunction.png
RKFunctionFormula.png

Number of observed notifications is compared to the number of notifications expected based on Complete Spatial Randomness (CSR)
CSR assumes distribution of points is homogeneous over the study area
Null hypothesis: the spatial points are randomly distributed, using alpha = 0.01

GraphicalOutputK.png

Bold line represents the observed values for a range of 𝑟
Red dotted line represents the expected theoretical value for a range of 𝑟
Grey area is the confidence envelope obtained through 100 iterations of Monte Carlo procedures based on assumptions from CSR
For each simulated point pattern, 𝐾(𝑟) is estimated over a range of 𝑟. The max and min of these functions define an upper and lower simulation of the envelope

Converting K-function to L function, and to Modified L function

Converting.png

Interpreting the Modified L Test graph

Interpret.png

Kernel Density Estimation
To determine:
1. To identify cluster of locations that have higher occurrence of indiscriminate parkings

Function (kernel 𝑘) of a given radius (𝑟) “visits” each point in the study region. 𝑘 provides the weight of the area surrounding 𝑠 in proportion to its distance to 𝑠_𝑖

KDEformula.png

𝑘 is calculated as a function of the distance between point 𝑠 and 𝑠_𝑖, over given radius 𝑟
The density of the study region is obtained by summing 𝑘 of all points 𝑠_𝑖 within 𝑟

LargeBW.png SmallBW.png

Kernel Density Estimations are sensitive to changes in radius values
Large radius leads to a smoother curve, but local details would be obscured
Small radius leads to many small spikes that are very localised
Using the statistically significant radius distance obtained from Modified L Test as a search radius within each event

Interpolate.png

Perform interpolation by transforming the graph to make it smoother
Individual kernels are summed up to produce a smooth surface
Quartic kernel type is used in QGIS