|
|
Line 6: |
Line 6: |
| <font color="#FFFFFF" size="2"><strong>PROJECT PROPOSAL</strong></font> | | <font color="#FFFFFF" size="2"><strong>PROJECT PROPOSAL</strong></font> |
| | style="font-family:Century Gothic; font-size:100%; background:#708090; text-align:center; border-left: 0px" width="25%" | | | | style="font-family:Century Gothic; font-size:100%; background:#708090; text-align:center; border-left: 0px" width="25%" | |
− | [[IS415 Team wiki: 2017T2 On The Fly Project Poster|<font color="#FFFFFF" size="2"><strong>PROJECT POSTER</strong></font>]] | + | [[1718t1is428T15 Poster|<font color="#FFFFFF" size="2"><strong>PROJECT POSTER</strong></font>]] |
| | style="font-family:Century Gothic; font-size:100%; background:#778899; text-align:center;border-left: 0px" width="25%" | | | | style="font-family:Century Gothic; font-size:100%; background:#778899; text-align:center;border-left: 0px" width="25%" | |
− | [[IS415 Team wiki: 2017T2 On The Fly Project Application|<font color="#FFFFFF" size="2"><strong>PROJECT APPLICATION</strong></font>]] | + | [[1718t1is428T15 Application|<font color="#FFFFFF" size="2"><strong>PROJECT APPLICATION</strong></font>]] |
| | style="font-family:Century Gothic; font-size:100%; background:#708090; text-align:center; border-left: 0px" width="25%" | | | | style="font-family:Century Gothic; font-size:100%; background:#708090; text-align:center; border-left: 0px" width="25%" | |
− | [[IS415 Team wiki: 2017T2 On The Fly Research Paper|<font color="#FFFFFF" size="2"><strong>RESEARCH PAPER</strong></font>]] | + | [[1718t1is428T15 Research Paper|<font color="#FFFFFF" size="2"><strong>RESEARCH PAPER</strong></font>]] |
| |} | | |} |
| <br /> | | <br /> |
Line 35: |
Line 35: |
| <!-- END PROJECT OBJECTIVE --> | | <!-- END PROJECT OBJECTIVE --> |
| | | |
− | <!-- START TECHNOLOGY--> | + | <!-- START DATASET--> |
| | | |
− | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">T</span>echnology</div>== | + | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">D</span>ataset</div>== |
| | | |
− | <!-- START SYSTEM ARCHITECTURE --> | + | <!-- END DATASET--> |
| | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:20px;"> System Architecture</span></div>===
| + | <!-- START RELATED WORKS--> |
− | [[File:IS415-OnTheFly-SA.png|400px|center]]
| |
− | <!-- END SYSTEM ARCHITECTURE --> | |
| | | |
− | <!-- START R LIBRARY --> | + | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">R</span>elated <span style="font-size:24px">W</span>orks</div>== |
| | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:20px;"> R Library</span></div>===
| + | <!-- END RELATED WORKS --> |
− | *shiny
| |
− | **Web Application Framework for R
| |
− | *maptools
| |
− | **Tools for Reading and Handling Spatial Objects
| |
− | *rgdal
| |
− | **Bindings for the Geospatial Data Abstraction Library
| |
− | *leaflet
| |
− | **Create Interactive Web Maps with the JavaScript 'Leaflet' Library
| |
− | *spatialEco
| |
− | **Functions for Kriging and Point Pattern Analysis
| |
− | *plyr
| |
− | **Tools for Splitting, Applying and Combining Data
| |
− | *spdep
| |
− | **Spatial Dependence: Weighting Schemes, Statistics and Models
| |
− | *GISTools
| |
− | **Some further GIS capabilities for R
| |
− | *spatstat
| |
− | **Spatial Point Pattern Analysis, Model-Fitting, Simulation, Tests
| |
− | *classInt
| |
− | **Choose Univariate Class Intervals
| |
− | *RColorBrewer
| |
− | **ColorBrewer Palettes
| |
− | *rsconnect
| |
− | **Deployment Interface for R Markdown Documents and Shiny Applications
| |
− | *openxlsx
| |
− | **Read, Write and Edit XLSX Files
| |
− | <!-- END R LIBRARY --> | |
| | | |
− | <!-- END TECHNOLOGY--> | + | <!-- START INSPIRATIONS--> |
| | | |
− | <!-- START APPLICATION FEATURES-->
| + | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">I</span>nspirations</div>== |
− | | |
− | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">A</span>pplication <span style="font-size:24px">F</span>eatures</div>== | |
− | | |
− | <!-- START UPLOADING -->
| |
− | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:20px;">Uploading and Processing On The Fly</span></div>===
| |
− | | |
− | <b>Enerlyst</b> allows the uploading of EMA housing data and process it on the fly. Users are able to view the processed data on the Data tab. After uploading, users are able to select the type of data (residential, private or both). Different year and month are processed and display on the fly when selected. By having this feature, Enerlyst ensures application longevity which allows future datasets to be analysed. The data can be found on EMA website. <ref>https://www.ema.gov.sg/Statistics.aspx</ref><br /><br />
| |
− | | |
− | <b>Cleaning up of raw data before uploading</b>
| |
− | The data to be uploaded should be in the following format:
| |
− | *Geocoded, consisting of X and Y coordinates with column name as "X" and "Y" respectively
| |
− | *Row 4(Overall) of the EMA data has to be removed
| |
− | *Should follows a naming convention of "YYYY_priv" for private housing data and "YYYY_pub" for public housing data
| |
− | *Should follow a file extension of ''xlsx''
| |
− | *Merging of two 6 months data into a one year data (only applicable for public housing data) <br /><br />
| |
− | | |
− | The steps to convert into a recognisable format by Enerlyst is as follows:
| |
− | | |
− | <u>a) Preparing raw data from EMA for 2013 Private Housing</u>
| |
− | | |
− | [[File:IS415-DataPrepPriv1.png|700px|center]]
| |
− | | |
− | 1. Copy out year data into a new excel file
| |
− | | |
− | [[File:IS415-DataPrepPriv2.png|700px|center]]
| |
− | | |
− | 2. Save file as "2013_priv.xlsx"
| |
− | | |
− | 3. Delete Row 4 which contains the overall energy consumption
| |
− | | |
− | [[File:IS415-DataPrepPriv3.png|700px|center]]
| |
− | | |
− | 4. Add two columns in columns O and P, give them headers named "X" and "Y"
| |
− | | |
− | 5. Geocode the postal codes and put the results into "X" and "Y"
| |
− | | |
− | 6. Save the file
| |
− | | |
− | <u>b) Preparing raw data from EMA for 2013 Public Housing</u>
| |
− | | |
− | 1. Open up first half of the public data
| |
− | | |
− | 2. Open up a new excel file
| |
− | | |
− | [[File:IS415-DataPrepPub1.png|700px|center]]
| |
− | | |
− | 3. Copy out each month's data into the the file
| |
− | | |
− | [[File:IS415-DataPrepPub2.png|700px|center]]
| |
− | | |
− | 4. Repeat the steps 1 to 3 for second half of the public data. At the end, there should be 12 sheets in total, in ordered by months from January to December.
| |
− | | |
− | In each sheet of the new excel file:
| |
− | | |
− | [[File:IS415-DataPrepPub3.png|700px|center]]
| |
− | | |
− | 5. Delete Row 4 from each sheet
| |
− | | |
− | [[File:IS415-DataPrepPub4.png|700px|center]]
| |
− | | |
− | 6. Add two columns in columns O and P, give them headers named "X" and "Y"
| |
− | | |
− | 7. Geocode the postal codes and put the results into "X" and "Y"
| |
− | | |
− | 8. Save the file as "2013_pub.xlsx"
| |
− | | |
− | <!-- END UPLOADING -->
| |
− | | |
− | <b>Uploading Files to Enerlyst</b>
| |
− | | |
− | Once the data files for 2013 to 2015 are ready, we upload them into <b>Enerlyst</b>. The application reads the file’s name, and recognises the year and property type it represents.
| |
− | | |
− | For private housing data, the application converts the sheet into a data frame. Whereas for public housing data, the application loops through the 12 sheets (months) of data, aggregating each month’s energy consumption by postal code. In other words, the application finds the total energy consumed by a residential building by totalling consumption of 1-or-2-room, 3-room, 4-room and 5-room/executive apartments. The aggregate is transposed into a data frame, and columns are renamed to show the month. The data frames for private and public housing are similar, and contains the following columns:
| |
− | | |
− | <div style="text-align:center">
| |
− | - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
| |
− | | |
− | Postal Code ║ Jan ║ Feb ║ Mar ║ Apr ║ May ║ Jun ║ Jul ║ Aug ║ Sep ║ Oct ║ Nov ║ Dec ║ X ║ Y
| |
− | | |
− | - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
| |
− | </div>
| |
− | | |
− | After the data frame has been constructed, the application moves on to clean up ‘na’ and ’s’ values, which represent negligible levels of energy consumption and suppressed individual data. These values are replaced with zeroes, and treated as housing with no energy consumption. The application then uses the data frames’ X and Y coordinates to convert the it into a spatial points data frame, and change its reference coordinate system to WGS84.
| |
− | | |
− | To allow users to analyse energy consumption clusters by housing types, <b>Enerlyst</b> then moves on to identify which subzones these residential buildings belongs to, and computes 1) private housing’s average energy consumption by subzone, 2) public housing’s average energy consumption by subzone, and 3) combined average energy consumption by subzone. To perform the computation, the following details need to be concluded from the data frames:
| |
− | *Total energy consumption of private housing by subzone
| |
− | *Total energy consumption of public housing by subzone
| |
− | *Total energy consumption of all housing by subzone
| |
− | *Count of private residential building per subzone
| |
− | *Count of public residential building per subzone
| |
− | *Count of all residential building per subzone
| |
− | | |
− | <!-- END ON THE FLY -->
| |
− | | |
− | <!-- START CHOROPLETH -->
| |
− | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:20px;">Choropleth Map</span></div>===
| |
− | [[File:IS415-OnTheFly-ChoroSS.png|700px|center]]
| |
− | | |
− | Using monthly raw data on residential energy consumption from EMA, Enerlyst aggregates the energy consumption by subzone and then find the average consumption per apartment block in each subzone.
| |
− | | |
− | Enerlyst provides an overview of each subzone's average energy consumption using three different classification techniques:
| |
− | | |
− | *Natural break Jenks
| |
− | *Equal Interval
| |
− | *Quantile
| |
− | | |
− | Users are able to select different classifications, colors and number of classes using the selecting panel on the left. Changes will be updated dynamically once the user has finalised the selection.
| |
− | <!-- END CHOROPLETH -->
| |
− | | |
− | | |
− | <!-- START LOCAL MORAN I -->
| |
− | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:20px;">Local Moran's I</span></div>===
| |
− | [[File:IS415-OnTheFly-LocalMISS.png|700px|center]]
| |
− | Enerlyst provides local auto correlation analysis where hot and cold clusters are identified interms of residential energy consumption. The Local Moran's I's statistic of spatial association for each subzone is given as:
| |
− | | |
− | [[File:LocalMoranIFormula.png.png|400px|center]]
| |
− | | |
− | Where (''x<sub>i</sub>'' - ''X-bar'') is the deviation of subzone's energy consumption with respect ot he mean of its neighbours, and ''w<sub>ij</sub>'' is the spatial weight between two subzones, and
| |
− | | |
− | [[File:LocalMoranIFormula2.png.png|400px|center]]
| |
− | | |
− | with ''n'' being the number of subzones in Singapore. Each subzone's neighbour is defined as neighbouring subzones with which it shares a border.
| |
− | | |
− | There is also a scatterplot between X and the "spatial lag" of X, formed by averaging all values of X for the neighboring polygons, where X is a subzone's average apartment block energy consumption. The plot identifies which type of spatial autocorrelation exists.
| |
− | [[File:IS415-OnTheFly-Scatterplot.png|700px|center]]
| |
− | <!-- END LOCAL MORAN I -->
| |
− | | |
− | <!-- START LISA -->
| |
− | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:20px;">LISA</span></div>===
| |
− | [[File:IS415-OnTheFly-LisaSS.png|700px|center]]
| |
− | Extending from Local Moran's I, Enerlyst uses LISA to show each subzone's statistically significant relationship with its neighbors, and show the type of relationship. The quadrants in the plot can be interpreted in the following manner:
| |
− | | |
− | *Top-left quadrant = low-high cluster
| |
− | *Top-right quadrant = high-high cluster
| |
− | *Bottom-left quadrant = low-low cluster
| |
− | *Bottom-right quadrant = high-low cluster
| |
− | <!-- END LISA -->
| |
− | | |
− | <!-- END APPLICATION FEATURES-->
| |
− | | |
− | <!-- START CASE STUDY ANALYSIS-->
| |
− | | |
− | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">C</span>ase <span style="font-size:24px">S</span>tudy <span style="font-size:24px">A</span>nalysis</div>==
| |
| EMA publishes energy statistics on an annual basis to provide readers with a comprehensive understanding of the Singapore energy landscape through a detailed coverage of various energy-related topics. As project Enerlyst focuses on analysing households' energy consumption, only private and public households data will be used. This study will be based on EMA dataset from 2013 to 2015. 2013 data will be prepared manually whereas 2014 and 2015 data will be uploaded to the application and process on the fly. | | EMA publishes energy statistics on an annual basis to provide readers with a comprehensive understanding of the Singapore energy landscape through a detailed coverage of various energy-related topics. As project Enerlyst focuses on analysing households' energy consumption, only private and public households data will be used. This study will be based on EMA dataset from 2013 to 2015. 2013 data will be prepared manually whereas 2014 and 2015 data will be uploaded to the application and process on the fly. |
| | | |
− | <!-- END CASE STUDY ANALYSIS--> | + | <!-- END INSPIRATIONS--> |
− | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:18px;">Choropleth Map</span></div>===
| |
− | <div align = center><strong>Private Housing</strong></div><br />
| |
− | [[File:IS415-Group2-OnTheFly-choroSSPrivate.png|700px|center]]<br />
| |
− | Higher energy consumption can found in the central region.
| |
− | [[File:IS415-Group2-OnTheFly-choroSSPrivate2.png|700px|center]]<br />
| |
− | Sungei Road sub zone has the highest average energy consumption of approximately 2163 kWh.
| |
− | <div align = center><strong>Public Housing</strong></div><br />
| |
− | [[File:IS415-Group2-OnTheFly-choroSSPublic.png|700px|center]]<br />
| |
− | North-east region has a cluster of sub zones which has a higher energy consumption.
| |
− | [[File:IS415-Group2-OnTheFly-choroSSPublic2.png|700px|center]]<br />
| |
− | Lower Seletar subzone has the highest average energy consumption of approximately 1024 kWh.
| |
| | | |
− | Choropleth maps may seem to be a decent indicator of spatial clustering at a glance. When spatial polygons are of the same color as its neighboring polygons, it may appear to signify a clustering of features based around the attribute of interest. This however, is misleading as the choice of classification method and number of classes specified can result in very different looking choropleth maps. The map creater gets to paint the picture by controlling the variables and thus, the objectivity of the analysis is questionable at best.
| + | <!-- START PROPOSED STORYBOARD--> |
− | <div align = center><strong>Jenks Natural Breaks</strong></div><br /> | |
− | [[File:IS415-Group2-OnTheFly-choroSS1.png|700px|center]]<br />
| |
− | <div align = center><strong>Equal Interval</strong></div><br />
| |
− | [[File:IS415-Group2-OnTheFly-choroSS2.png|700px|center]]<br />
| |
| | | |
− | For instance, if we were to look at the choropleth map for energy consumption for the 4 months (March, June, September and December) of 2013, a classification using Jenks Natural Breaks would show that in the central region, in Paterson and Dunearn subzones particularly, they belongs to the grouping of highest energy consumptions visually. However, using a classification of Equal Interval, Paterson and Dunearn are no longer in the grouping of highest energy consumptions visually. Hence, a choropleth map could be misleading despite the attractiveness of the data representation. An analysis such as spatial autocorrelation could be used to provide concrete evidences to spatial clustering.
| + | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">P</span>roposed <span style="font-size:24px">S</span>toryboard</div>== |
| | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:18px;">Local Moran's I</span></div>===
| + | <!-- END PROPOSED STORYBOARD--> |
− | <div align = center><strong>Private Housing</strong></div><br />
| |
− | [[File:IS415-Group2-OnTheFly-LMIPrivate.png|700px|center]]<br />
| |
− | There is a clustering of subzones in the west, central and east region which share the similarity of almost equivalent energy consumption.
| |
− | <div align = center><strong>Public Housing</strong></div><br />
| |
− | [[File:IS415-Group2-OnTheFly-LMIPublic.png|700px|center]]<br />
| |
− | The clustering of subzones which share the similarity of almost equivalent energy consumption are in the, west, north-east and east region.
| |
− | [[File:IS415-Group2-OnTheFly-LMISP.png|700px|center]]<br />
| |
− | Together with the Local Moran's I, a Moran scatterplot is available to complement the Local Moran's I. It provides an easy way to categorize the nature of spatial autocorrelation into the four classifications which are mainly high-high, high-low, low-low, and low-high. The scatterplot compares the value of the selected variable (x- axis) with its own spatial lagged value (y-axis). This lagged value is derived from the average of the value of the same variable from its neighbors.
| |
| | | |
− | ===<div font-family: Century Gothic; padding: 0px 30px 0px 18px;"><span style="font-size:18px;">LISA</span></div> ===
| + | <!-- START TECHNICAL CHALLENGES--> |
− | <div align = center><strong>Private Housing</strong></div><br />
| |
− | [[File:IS415-Group2-OnTheFly-LISAPrivate.png|700px|center]]<br />
| |
− | the LISA for the private housing dataset in December 2015 shows that in the west region, Saujana, Jelabu, Dairy Farm and Bangkit subzone have a significant higher energy consumption when compared to the mean of the energy consumption of private housing and the neighbouring subzones are highly similar. For the east region, Bayshore subzone is identified as the higher energy consumption and its neighbouring sub zones such as Siglap shares similar traits.
| |
− | <div align = center><strong>Public Housing</strong></div><br />
| |
− | [[File:IS415-Group2-OnTheFly-LISAPublic.png|700px|center]]<br />
| |
− | LISA has proven that the Local Moran's I is accurate as Keat Hong and Hougang East subzone share a higher electricity consumption with its neighbouring subzones.
| |
| | | |
| + | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">T</span>echnical <span style="font-size:24px">C</span>hallenges</div>== |
| | | |
− | With such information, energy saving solution can be implemented on the identified subzones to further reduce energy consumption.
| + | <!-- END TECHNICAL CHALLENGES--> |
| | | |
| <!-- START TIMELINE --> | | <!-- START TIMELINE --> |
Line 306: |
Line 101: |
| <!-- END TIMELINE --> | | <!-- END TIMELINE --> |
| | | |
− | <!-- START FUTURE WORK --> | + | <!-- START TECHNOLOGIES--> |
| | | |
− | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">F</span>uture <span style="font-size:24px">W</span>ork</div>== | + | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">T</span>echnologies/<span style="font-size:24px">T</span>ools</div>== |
| | | |
− | *Allowing analyst to upload industrial energy usage data
| + | <!-- END TECHNOLOGIES--> |
− | *Performing cluster analysis using point data
| |
− | *Including Geary C analysis on top of Local Moran's I
| |
− | <!-- END FUTURE WORK --> | |
| | | |
| <!-- START REFERENCE --> | | <!-- START REFERENCE --> |
Line 320: |
Line 112: |
| <references /> | | <references /> |
| <!-- END REFERENCE --> | | <!-- END REFERENCE --> |
| + | |
| + | <!-- START COMMENTS --> |
| + | |
| + | ==<div style="margin-top: 10px;font-family: Helvetica; text-align: left;font-size:20px; border: 5px solid #00000000; border-radius:5px; text-align:center; background-color: #708090; color: white; padding: 2px"><span style="font-size:24px;">C</span>omments</div>== |
| + | <references /> |
| + | <!-- END COMMENTS --> |