Difference between revisions of "Group06 Elec3city Proposal"
Yu.fu.2015 (talk | contribs) |
Yu.fu.2015 (talk | contribs) |
||
Line 184: | Line 184: | ||
<Strong>Data Collection and Preprocessing</Strong> | <Strong>Data Collection and Preprocessing</Strong> | ||
<p> | <p> | ||
− | We collected 2013- 2016 average monthly household electricity consumption by postal code and dwelling type from Energy Market Authority. The postal codes are matched with longitudes and latitudes with the use of OneMap API. | + | *We collected 2013- 2016 average monthly household electricity consumption by postal code and dwelling type from Energy Market Authority. The postal codes are matched with longitudes and latitudes with the use of OneMap API. |
− | Singapore Temperature Historical Data from 2013 to 2016 at daily granularity were crawled from Meteorological Service Singapore. | + | *Singapore Temperature Historical Data from 2013 to 2016 at daily granularity were crawled from Meteorological Service Singapore. |
<br> | <br> | ||
<br> | <br> |
Revision as of 19:04, 14 April 2019
Contents
Project Motivation
Rising energy consumption is an issue that plagues the Singapore government for several years. Recently, the government has begun pushing for more efficient energy usage, and most effort is expended on the efficiency of energy sources – e.g. using less carbon-intensive fuels. In exploring potential ways to aid this cause, we realised that there has been scant statistical analysis energy consumption patterns. As such, our team feels that there is a need for an app which allows for authorities in Singapore such as the National Environment Agency to understand with data-driven evidence the origins of variation in Singapore energy consumption, so as to allow for more targeted efforts to reduce energy wastage. Another factor that provided a fertile ground for the development of this application is the availability of energy consumption data, with granularity right down to individual postal code.
Project Objective
We have been observing rising trends in energy consumption across all sub-sectors, including households, which has seen sharp increase over the years from 6092.5GWh in 2005 to 7295.8GWh in 2017. As such, household sub-sector is a good starting point for our analysis.
- Energy Consumption Patterns in Singapore
Through the exploration of potential energy consumption patterns among households in Singapore, we will be able to tease out potential drivers of energy consumption. We will not only be looking at seasonal patterns in consumption (i.e. Is there a particular month in a year when energy consumption spikes?), we will also try to identify spatial patterns in consumption (i.e. Is there a particular region in Singapore whereby energy consumption is particularly high?).
- Temperature VS Energy Consumption
The next issue that we are interested in exploring is the relationship between temperature and energy consumption. Intuitively, we will assume that as temperature increases, energy consumption will also increase due to more electrical appliances used to regulate temperature such as air-conditioner. Thus, we plan to explore the relationship between temperature in different region of Singapore, and how this will affect the energy consumption in each respective region.
Data
Literature Review
In our due diligence for the project, the team looked at multiple research papers to inform and influence us in the best practices for analyzing geospatial variation in energy use, when it is to be compared against variables such as temperature and housing composition.
1. Appropriate use of Interpolation Methods in GIS - Mitas, L. and Mitasova, H. Spatial Interpolation, Chap. 34 Spatial Interpolation (2005)Aim of literature: to enlighten reader of the appropriate interpolation method for different GIS themes.
Methodology:
1. Inverse Distance Weighted Interpolation (IDW) - adopted
2. Kriging - rejected
3. Regularised spline with tension (RST) - rejected
Learning Points:
1. Inverse Distance Weighted Interpolation (IDW)
- Pro: relatively less demanding computationally
- Pro: better at reproducing approximations on linear patterns
- Con: "produces local extrema at the data points"
2. Kriging
- Con: While good at predicting spatial distribution of uncertainty, it is less successful for applications where local geometry and smoothness are the key issues - Critical weakness for our interpolation of temperature data where granularity is at housing block level, thus Kriging is rejected.
3. Regularised spline with tension (RST)
- Pro: Allows for smoothing according to parameters such as the tension φ and smoothing weights {wj} which are empirically informed through minimisation of the predictive error estimated by a cross-validation procedure
- Pro: Can realistically represent rough gradients in spite of the smoothness condition, if the roughness is sufficiently described by the input data - might be true of temperature when it comes to the Urban Heat Island effect - pockets of high building density can cause a micro-climate of higher temperatures; particularly pertinent in Singapore.
- Con: requires a lot of 'guess-timation' and past domain knowledge to fine-tune the tension and smoothing weights.
Areas for improvement:
Our team has selected IDW as the interpolation technique for smoothing of temperature data of the 22 weather stations across Singapore.
Aim of literature: investigate poverty and inequities that are associated with vegetation
Methodology:
1. Pixel level regression - Curve Fit extension in ArcGIS
- Run regression trend analysis using raster datasets for temporal analysis
2. Global Ordinary Least Squares (OLS) regression
- Capture global geospatial correlation
3. Local Geographically Weighted Regression (GWR)
- Capture local geospatial correlation
4. Moran's I for spatial autocorrelation
- For local level analysis of spatial autocorrelation
5. Local Indicators of Spatial Association (LISA) map - Contiguity Edges and Corners method
- Queen contiguity to show clustering
Learning Points:
1. Pixel level regression - Curve Fit extension in ArcGIS
- Helps us see the degree of model prediction for energy consumption given our variables
2. Global Ordinary Least Squares (OLS) regression
- Investigate if the distributions of these random variables all have the same variance and a mean of zero. If so, then the least squares method may be the best unbiased linear estimator of the model coefficient.
- If residuals are spatially correlated, OLS results are biased. GWR models would then be used to remove the spatial autocorrelation of residuals.
3. Local Geographically Weighted Regression (GWR)
- Provides local t-values with which to find level of confidence in our local model
4. Moran's I for spatial autocorrelation
- We can use this to ascertain if local level analysis is indeed appropriate to understand the relationship between income level and energy consumption, after accounting for other factors like number of household members and number of rooms.
5. Local Indicators of Spatial Association (LISA) map - Contiguity Edges and Corners method
- Shows us clustering of energy consumption at local level
Areas for improvement:
1. Pixel level regression - Curve Fit extension in ArcGIS
- No ArcGIS - so we use curveFit function provided in mixtox v1.3 package by Xiangwei Zhu
Aim of literature: reduce traditional energy usage and promoting sustainable energy production through geographically segmented marketing
Methodology:
1. Creation of a REEI (Relative Energy Efficiency Index)
- Done by dividing the zonal average consumption growth rate by the consumption change rate for each block group.
2. Global Moran’s I
- Determine if spatial autocorrelation is taking place
3. Local Moran's I
- See where clustering is taking place
Learning Points:
1. REEI (Relative Energy Efficiency Index)
- Team can look into calculating such an index for each HDB parcel
Areas for improvement:
1. The temperature data used was too simple - only two zones of temperature.
- Our team will use the previously learnt RST interpolation method to create a model for temperature geospatial variation, that also allows for temporal analysis.
Approach
Data Collection and Preprocessing
- We collected 2013- 2016 average monthly household electricity consumption by postal code and dwelling type from Energy Market Authority. The postal codes are matched with longitudes and latitudes with the use of OneMap API.
- Singapore Temperature Historical Data from 2013 to 2016 at daily granularity were crawled from Meteorological Service Singapore.
Methodology
- Hot Spot and Cold Spot maps:
- LISA Map
- Spatial Interpolation
- Geographically Weighted Regression Local R square.
Web Application Design
Design Inspiration
The dashboard design is inspired by https://stanleyadion.shinyapps.io/AmazeingCrop
Initial Storyboard
Project Challenges
Key Challenges | Description | Solution | |
---|---|---|---|
1. | Temperature Data Collection | We can only download the temperature data from Meteorological Service Singapore for one station and one month each time. There are more than 60 stations and 4 years of data to be downloaded for this project, which can be very time consuming. |
|
2. | Imperfect Temperature Data | Temperature information is only collected at the designated temperature stations. |
|
Project Timeline
Gantt Chart of Team's Timeline - FULL Updated Version
Snapshot of Gantt Chart (as of 3 March 2019)
No. |
Name |
Date |
Comments |
1. |
Insert your Name here |
Insert Date here |
Insert Comment here |
2. |
Insert your Name here |
Insert Date here |
Insert Comment here |
3. |
Insert your Name here |
Insert Date here |
Insert Comment here |