ANLY482 AY2017-18 T2 Group 31 Main Findings and Analysis

From Analytics Practicum
Revision as of 22:41, 14 April 2018 by Chekweichia.2014 (talk | contribs) (Created page with "__NOEDITSECTION__ <div align="center"> 500px <!--Header Start--> {|style="background-color:#FFA500; color: #F5F5F5; padding: 10 0 10 0;" width="100%" ce...")
(diff) ← Older revision | Latest revision (diff) | Newer revision β†’ (diff)
Jump to navigation Jump to search

Bannernew.png

HOME

 

ABOUT US

 

PROJECT OVERVIEW

 

PROJECT ANALYSIS

 

PROJECT MANAGEMENT

 

ANLY482 HOMEPAGE

Data

Exploratory Data Analysis

Model Building

Main Findings and Analysis

Recommendation

Findings and Analysis

Spatial Point Analysis

SPA.png

Conducted spatial point distributions in Singapore on QGIS
Focus on HDB Land-Use Type as highest number of notifications
Case study done on Bedok Reservoir HDB region

Bedok.png

Belongs to the same precinct with only one entrance at Bedok Reservoir View near roundabout

Modified L Test

ModLTest1.png

Sharp increase due to data quality issues during collection of data
Duplicates and points that are very close together
Signs of statistically significant clustering even at very small radius

ModLTest2.png

Between 54.5 metres and 62 metres
Clustering but not statistically significant

ModLTest3.png

Statistically significant radius of 62 metres with signs of clustering
Likely to find another indiscriminately parked bike within 62 metres
Used as input for kernel density estimation

Kernel Density Estimation
To determine:
1. To identify cluster of locations that have higher occurrence of indiscriminate parkings

Function (kernel π‘˜) of a given radius (π‘Ÿ) β€œvisits” each point in the study region. π‘˜ provides the weight of the area surrounding 𝑠 in proportion to its distance to 𝑠_𝑖

KDEformula.png

π‘˜ is calculated as a function of the distance between point 𝑠 and 𝑠_𝑖, over given radius π‘Ÿ
The density of the study region is obtained by summing π‘˜ of all points 𝑠_𝑖 within π‘Ÿ

LargeBW.png SmallBW.png

Kernel Density Estimations are sensitive to changes in radius values
Large radius leads to a smoother curve, but local details would be obscured
Small radius leads to many small spikes that are very localised
Using the statistically significant radius distance obtained from Modified L Test as a search radius within each event

Interpolate.png

Perform interpolation by transforming the graph to make it smoother
Individual kernels are summed up to produce a smooth surface
Quartic kernel type is used in QGIS