Difference between revisions of "Kabak: Research Paper Data Preparation"
Jump to navigation
Jump to search
Line 89: | Line 89: | ||
|- | |- | ||
| | | | ||
− | *Stack data to consolidate data table in to 2 columns (Postal Code, Housing Type) | + | * Stack data to consolidate data table in to 2 columns (Postal Code, Housing Type) |
− | *Remove rows with missing data | + | * Remove rows with missing data |
|| | || | ||
− | |||
[[File: Kabakdatacleaning1.png|400px|center]] | [[File: Kabakdatacleaning1.png|400px|center]] | ||
|- | |- | ||
| | | | ||
− | *Concatenate all 12 months data into one consolidated data table | + | * Concatenate all 12 months data into one consolidated data table |
**By the end of this phase of data cleaning, we have a total of 177,053 rows | **By the end of this phase of data cleaning, we have a total of 177,053 rows | ||
|| | || | ||
[[File: Kabakdatacleaning2.png|400px|center]]|- | [[File: Kabakdatacleaning2.png|400px|center]]|- | ||
+ | |- | ||
| | | | ||
− | Merging Private Housing Data with Public Housing Data | + | * Merging Private Housing Data with Public Housing Data |
+ | **Final consolidated data consist of 241,766 rows | ||
|| | || | ||
[[File: Kabakdatacleaning3.png|400px|center]] | [[File: Kabakdatacleaning3.png|400px|center]] | ||
|} | |} | ||
<br/> | <br/> |
Revision as of 11:50, 22 November 2016
Initial Dataset
DATASET | DESCRIPTION | DATA USED |
---|---|---|
Average Monthly Household Electricity Consumption Link (1H): https://www.ema.gov.sg/cmsmedia/Publications_and_Statistics/Statistics/23RSU.xls Link (2H): https://www.ema.gov.sg/cmsmedia/Publications_and_Statistics/Statistics/25RSU.xls |
|
|
Average Monthly Household Electricity Consumption by Postal Code (Private Apartments), 2015 Link: https://www.ema.gov.sg/cmsmedia/Publications_and_Statistics/Statistics/2RSU.xls |
|
|
Basic Demographics Characteristics (2015) |
|
|
Data Cleaning
METHOD | DESCRIPTION |
---|---|
|
|
|
|- |
|