Group06 Elec3city Proposal

From Geospatial Analytics and Applications
Revision as of 19:04, 14 April 2019 by Yu.fu.2015 (talk | contribs) (→‎Approach)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search


HOME

PROPOSAL

POSTER

APPLICATION

RESEARCH PAPER


Project Motivation

Rising energy consumption is an issue that plagues the Singapore government for several years. Recently, the government has begun pushing for more efficient energy usage, and most effort is expended on the efficiency of energy sources – e.g. using less carbon-intensive fuels. In exploring potential ways to aid this cause, we realised that there has been scant statistical analysis energy consumption patterns. As such, our team feels that there is a need for an app which allows for authorities in Singapore such as the National Environment Agency to understand with data-driven evidence the origins of variation in Singapore energy consumption, so as to allow for more targeted efforts to reduce energy wastage. Another factor that provided a fertile ground for the development of this application is the availability of energy consumption data, with granularity right down to individual postal code.


Project Objective

We have been observing rising trends in energy consumption across all sub-sectors, including households, which has seen sharp increase over the years from 6092.5GWh in 2005 to 7295.8GWh in 2017. As such, household sub-sector is a good starting point for our analysis.

  • Energy Consumption Patterns in Singapore

Through the exploration of potential energy consumption patterns among households in Singapore, we will be able to tease out potential drivers of energy consumption. We will not only be looking at seasonal patterns in consumption (i.e. Is there a particular month in a year when energy consumption spikes?), we will also try to identify spatial patterns in consumption (i.e. Is there a particular region in Singapore whereby energy consumption is particularly high?).

  • Temperature VS Energy Consumption

The next issue that we are interested in exploring is the relationship between temperature and energy consumption. Intuitively, we will assume that as temperature increases, energy consumption will also increase due to more electrical appliances used to regulate temperature such as air-conditioner. Thus, we plan to explore the relationship between temperature in different region of Singapore, and how this will affect the energy consumption in each respective region.



Data

Data Source Data Type
Average Monthly Household Electricity Consumption by Postal Code (Public Housing) & Dwelling Type, 2H 2016 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Public Housing) & Dwelling Type, 1H 2016 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Public Housing) & Dwelling Type, 2H 2015 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Public Housing) & Dwelling Type, 1H 2015 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Public Housing) & Dwelling Type, 2H 2014 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Public Housing) & Dwelling Type, 1H 2014 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Public Housing) & Dwelling Type, 2H 2013 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Public Housing) & Dwelling Type, 1H 2013 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Private Apartments), 2015 and 2016 Energy Market Authority (ema.gov.sg) xls
Average Monthly Household Electricity Consumption by Postal Code (Private Apartments), 2013 to 2014 Energy Market Authority (ema.gov.sg) xls
Resident Households by Planning Area and Dwelling Type/Household Size/Monthly Household Income Department of Statistics Singapore (singstat.gov.sg) xls
Singapore Residents by Planning Area/Subzone, Age Group and Sex, June 2000 - 2018 Department of Statistics Singapore (singstat.gov.sg) csv
Singapore Residents by Planning Area/Subzone and Type of Dwelling, June 2000 - 2018 Department of Statistics Singapore (singstat.gov.sg) csv
Singapore Climate Historical Data - crawled to get temperature and rain data from 2013 to 2016 at daily granularity Meteorological Service Singapore (weather.gov.sg) csv

Literature Review

In our due diligence for the project, the team looked at multiple research papers to inform and influence us in the best practices for analyzing geospatial variation in energy use, when it is to be compared against variables such as temperature and housing composition.

1. Appropriate use of Interpolation Methods in GIS - Mitas, L. and Mitasova, H. Spatial Interpolation, Chap. 34 Spatial Interpolation (2005)

Aim of literature: to enlighten reader of the appropriate interpolation method for different GIS themes.



Comparison of Digital Elevation Models computed from contours, splines with tension and stream enforcement, and by regularised spline with tension (RST)


Methodology:
1. Inverse Distance Weighted Interpolation (IDW) - adopted
2. Kriging - rejected
3. Regularised spline with tension (RST) - rejected


Learning Points:
1. Inverse Distance Weighted Interpolation (IDW)

  • Pro: relatively less demanding computationally
  • Pro: better at reproducing approximations on linear patterns
  • Con: "produces local extrema at the data points"


2. Kriging

  • Con: While good at predicting spatial distribution of uncertainty, it is less successful for applications where local geometry and smoothness are the key issues - Critical weakness for our interpolation of temperature data where granularity is at housing block level, thus Kriging is rejected.


3. Regularised spline with tension (RST)

  • Pro: Allows for smoothing according to parameters such as the tension φ and smoothing weights {wj} which are empirically informed through minimisation of the predictive error estimated by a cross-validation procedure
  • Pro: Can realistically represent rough gradients in spite of the smoothness condition, if the roughness is sufficiently described by the input data - might be true of temperature when it comes to the Urban Heat Island effect - pockets of high building density can cause a micro-climate of higher temperatures; particularly pertinent in Singapore.
  • Con: requires a lot of 'guess-timation' and past domain knowledge to fine-tune the tension and smoothing weights.


Areas for improvement:
Our team has selected IDW as the interpolation technique for smoothing of temperature data of the 22 weather stations across Singapore.

2. A Spatial Analysis of the Relationship between Vegetation and Poverty - Dawson T., Sandoval J.S., Sagan V. and Crawford T. (2018)

Aim of literature: investigate poverty and inequities that are associated with vegetation



Geospatial Visualisation of MAXN (regression coefficient for the time variable showing trend in Normalized Difference Vegetation Index) against race poverty geospatial distribution
Local R-Squared values of model in Detroit


Methodology:
1. Pixel level regression - Curve Fit extension in ArcGIS

  • Run regression trend analysis using raster datasets for temporal analysis

2. Global Ordinary Least Squares (OLS) regression

  • Capture global geospatial correlation

3. Local Geographically Weighted Regression (GWR)

  • Capture local geospatial correlation

4. Moran's I for spatial autocorrelation

  • For local level analysis of spatial autocorrelation

5. Local Indicators of Spatial Association (LISA) map - Contiguity Edges and Corners method

  • Queen contiguity to show clustering


Learning Points:
1. Pixel level regression - Curve Fit extension in ArcGIS

  • Helps us see the degree of model prediction for energy consumption given our variables

2. Global Ordinary Least Squares (OLS) regression

  • Investigate if the distributions of these random variables all have the same variance and a mean of zero. If so, then the least squares method may be the best unbiased linear estimator of the model coefficient.
  • If residuals are spatially correlated, OLS results are biased. GWR models would then be used to remove the spatial autocorrelation of residuals.

3. Local Geographically Weighted Regression (GWR)

  • Provides local t-values with which to find level of confidence in our local model

4. Moran's I for spatial autocorrelation

  • We can use this to ascertain if local level analysis is indeed appropriate to understand the relationship between income level and energy consumption, after accounting for other factors like number of household members and number of rooms.

5. Local Indicators of Spatial Association (LISA) map - Contiguity Edges and Corners method

  • Shows us clustering of energy consumption at local level


Areas for improvement:
1. Pixel level regression - Curve Fit extension in ArcGIS

  • No ArcGIS - so we use curveFit function provided in mixtox v1.3 package by Xiangwei Zhu



3. Using GIS to target outreach For LADWP (Los Angeles Department of Water and Power) Customer Rebate Programs

Aim of literature: reduce traditional energy usage and promoting sustainable energy production through geographically segmented marketing



Residential Relative Energy Efficiency Index (REEI) 2009-2012 Choropleth
Local Moran's I for REEI - most and least efficient block groups


Methodology:
1. Creation of a REEI (Relative Energy Efficiency Index)

  • Done by dividing the zonal average consumption growth rate by the consumption change rate for each block group.

2. Global Moran’s I

  • Determine if spatial autocorrelation is taking place

3. Local Moran's I

  • See where clustering is taking place


Learning Points:
1. REEI (Relative Energy Efficiency Index)

  • Team can look into calculating such an index for each HDB parcel


Areas for improvement:
1. The temperature data used was too simple - only two zones of temperature.

  • Our team will use the previously learnt RST interpolation method to create a model for temperature geospatial variation, that also allows for temporal analysis.



Approach

Data Collection and Preprocessing

  • We collected 2013- 2016 average monthly household electricity consumption by postal code and dwelling type from Energy Market Authority. The postal codes are matched with longitudes and latitudes with the use of OneMap API.
  • Singapore Temperature Historical Data from 2013 to 2016 at daily granularity were crawled from Meteorological Service Singapore.

Methodology
  • Hot Spot and Cold Spot maps:
The maps show which areas have high energy consumption and low energy consumption. First we used adaptive distance weight matrix to define neighbours. Based on the adaptive distance weight matrix, we computed Getis -Ord Gi statistics. A hot spot area has significantly positive Gi statistics which means location i is associated with relatively high values of the surrounding locations. A cold spot area has significantly negative Gi statistics which means location i is associated with relatively low values of the surrounding locations.
  • LISA Map
Local Indicator of Spatial Association (LISA) maps help us identify the outliers and clusters of the energy consumption observations.
  • Spatial Interpolation
Since we only have 21 meteorological stations that have complete data in Singapore, Spatial interpolation is adopted to use points with known temperature values to estimate values at other unknown points.
  • Geographically Weighted Regression Local R square.
The estimated temperature and energy consumption data will be used in the Geographically Weighted Regression (GWR) model. The GWR model will generate Local R square values which indicate how well the local regression model fits observed y values. Very low values indicate the local model is performing poorly. In our case, it means low correlation between temperature and energy in the areas.

Web Application Design

Design Inspiration

The dashboard design is inspired by https://stanleyadion.shinyapps.io/AmazeingCrop

Initial Storyboard

Design Description
1. Elec3city dashboard 1.jpg
  • Project and Dataset Overview
2. Elec3city dashboard 2.jpg
  • Bivariate Choropleth Maps showing relationships between energy consumption with other factors
  • Users can choose the factor they want to compare with energy consumtion
3. Elec3city dashboard 3.jpg
  • A Box-plot showing distributions of energy consumption by Planning Zone and Dwelling Type
4. Elec3city dashboard 4.jpg
  • Lisa Maps showing spatial clustering of energy consumption observations
5. Elec3city dashboard 5.jpg
  • Overview of Data for GWR model
6. Elec3city dashboard 6.jpg
  • Transform Data for GWR model
  • Users can use a histogram to check whether the variable is normally distributed
7. Elec3city dashboard 7.jpg
  • Select Variables for GWR model
  • Users can remove correlated variables with the help of the correlation matrix plot
8. Elec3city dashboard 8.jpg
  • Configure a GWR model and view the results

Project Challenges

Key Challenges Description Solution
1. Temperature Data Collection We can only download the temperature data from Meteorological Service Singapore for one station and one month each time. There are more than 60 stations and 4 years of data to be downloaded for this project, which can be very time consuming.
  • Discovered a pattern of the data links
  • Used excel to auto-generate all the required data links
  • Used Internet Download Manager to download from all the data links
2. Imperfect Temperature Data Temperature information is only collected at the designated temperature stations.
  • Use spatial interpolation techniques to estimate the temperature around the temperature stations.




Project Timeline



Gantt Chart of Team's Timeline - FULL Updated Version
Snapshot of Gantt Chart (as of 3 March 2019)

Gantt Chart Snapshot





























Feel free to leave any comments! :)

No.

Name

Date

Comments

1.

Insert your Name here

Insert Date here

Insert Comment here

2.

Insert your Name here

Insert Date here

Insert Comment here

3.

Insert your Name here

Insert Date here

Insert Comment here