Difference between revisions of "IS428 2016-17 Term1 Assign3 Chen Huiyan"
(Replaced content with "==Data Source== ==Data Transformation== ==The Tasks== ==Tools Utilized== ==Final Visualization== ==Comments and Suggestions==") |
|||
(39 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
==Data Source== | ==Data Source== | ||
+ | *A building layout for the GAStech offices, including the maps of the prox zones and the HVAC zones | ||
+ | *A current list of employees, roles, and office assignments | ||
+ | *A description of the data formats and fields provided | ||
+ | *Proximity sensor data for each of the prox zone regions | ||
+ | *Proximity sensor data from Rosie the mobile robot | ||
+ | *HVAC sensor readings and status information from each of the building’s HVAC zones | ||
+ | *Hazium readings from four sensors. | ||
+ | |||
+ | ==Data Analysis== | ||
+ | Proximity sensor data for each of the prox zone regions is used to discover the typical working patterns for GAStech employees. In order to identify the pattern, I need find the relationship between prox-id and employees. Through some observations, I have spotted prox-id is constructed by the first character of employee's first name and full last name followed by numbers which start from 001. For example, one employee called Mat | ||
+ | Bramar with prox-id "mbramar001". In addition, the same employee may have multiple prox-ids with increasing numbers such as 002. | ||
+ | |||
==Data Transformation== | ==Data Transformation== | ||
+ | For proxOut-MC2.csv, I have extracted employee's last name to identify the unique employee from prox-id by using the "Left and Right" functions in Excel. | ||
+ | [[File:ExtractName.JPG]] | ||
+ | For bldg-MC2.csv, in order to discover interesting patterns in the building data, I have to segregate dataset by making use of Stack function of JMP Pro. | ||
+ | <br> | ||
+ | Step 1: Stack | ||
+ | [[File:Stack.JPG]] | ||
+ | Results: | ||
+ | [[File:StackResult.JPG]] | ||
+ | Step2: Create new columns for Floor, Zone and Element segregated by Label through applying formulas. | ||
+ | [[File:Formula.JPG]] | ||
− | + | Results: | |
+ | [[File:Step2Result.JPG]] | ||
+ | Finally, the results from Step 2 can be used by Tableau to produce visualizations. | ||
+ | ==The Tasks== | ||
+ | === What are the typical patterns in the prox card data? What does a typical day look like for GAStech employees?=== | ||
+ | [[File:WorkingPattern.jpg]] | ||
+ | Through looking at Gantt view above, I can easily tell most of employees work on the first and second level. For people working on the first floor, there are three working shifts: 12am - 7am (night shift), 7am - 4pm and 4pm - 11pm. For people on the second floor, there is no night shift and employees on the third floor only work from 7am to 4pm. | ||
+ | ===Describe up to ten of the most interesting patterns that appear in the building data. Describe what is notable about the pattern and explain its possible significance.=== | ||
+ | [[File:BuildingPatterns.png]] | ||
+ | * The Equipment Power of floor3 are much higher than floor 1 and floor 2. | ||
+ | * Floor 3 has the most number of COOLING COIL Power | ||
+ | * Floor 2 has the most number of concentration of C02 measured at the zone's return air grille | ||
+ | [[File:ElectricDemand.png]] | ||
+ | * The pattern of Electric Demand Power (total Electric Demand Power and HVAC Electric Demand Power) share the same trend regardless of different floors. | ||
+ | * People consume the most number of electric power at 8am and the most number of HVAC Electric Demand Power at 7am. | ||
+ | * The period from 6am to 6pm is the most power demanding period. | ||
+ | ===Describe up to ten notable anomalies or unusual events you see in the data. Prioritize those issues that are most likely to represent a danger or a serious issue for building operations.=== | ||
+ | * From patterns identified from the previous question, equipment power of floor 3 is extremely high which catches my attention. Through looking at Energy Zone, I have noticed that floor 3 places servers that explain the high equipment power. | ||
+ | * Concentration of C02 measured at the zone's return air grille of floor 2 is unusually high compared to other two floors which is bad for people although there are less number of employees working on the second floor than the first floor. | ||
+ | [[File:Employee.png]] | ||
+ | [[File:Hazium.png]] | ||
+ | * From the graph above, I am able to find out that Hazium level of the first-floor is relatively much lower than the other two floors. The only one sensor area (F3_Z1) detecting Hazium located on the third-floor shows that the uptrend of Hazium level from 12 am to 7 am and it starts slowly decreasing after that and Hazium level of the second-floor is increasing sharply from 8am and reaches the maximum level at 6pm. This phenomenon indicates a dangerous sign for people working from 4pm to 11pm at the second-level since the Hazium level of that period is relatively high. | ||
+ | ===Describe up to five observed relationships between the proximity card data and building data elements. If you find a causal relationship (for example, a building event or condition leading to personnel behavior changes or personnel activity leading to building operations changes), describe your discovered cause and effect, the evidence you found to support it, and your level of confidence in your assessment of the relationship=== | ||
==Tools Utilized== | ==Tools Utilized== | ||
− | + | * Excel to clean data | |
− | + | * JMP PRO to segregate and transform data | |
− | + | * Tableau to do data visualisation | |
− | |||
==Final Visualization== | ==Final Visualization== | ||
− | + | [[File:Dashboard.png]] | |
− | |||
− | |||
==Comments and Suggestions== | ==Comments and Suggestions== |
Latest revision as of 13:10, 24 October 2016
Contents
- 1 Data Source
- 2 Data Analysis
- 3 Data Transformation
- 4 The Tasks
- 4.1 What are the typical patterns in the prox card data? What does a typical day look like for GAStech employees?
- 4.2 Describe up to ten of the most interesting patterns that appear in the building data. Describe what is notable about the pattern and explain its possible significance.
- 4.3 Describe up to ten notable anomalies or unusual events you see in the data. Prioritize those issues that are most likely to represent a danger or a serious issue for building operations.
- 4.4 Describe up to five observed relationships between the proximity card data and building data elements. If you find a causal relationship (for example, a building event or condition leading to personnel behavior changes or personnel activity leading to building operations changes), describe your discovered cause and effect, the evidence you found to support it, and your level of confidence in your assessment of the relationship
- 5 Tools Utilized
- 6 Final Visualization
- 7 Comments and Suggestions
Data Source
- A building layout for the GAStech offices, including the maps of the prox zones and the HVAC zones
- A current list of employees, roles, and office assignments
- A description of the data formats and fields provided
- Proximity sensor data for each of the prox zone regions
- Proximity sensor data from Rosie the mobile robot
- HVAC sensor readings and status information from each of the building’s HVAC zones
- Hazium readings from four sensors.
Data Analysis
Proximity sensor data for each of the prox zone regions is used to discover the typical working patterns for GAStech employees. In order to identify the pattern, I need find the relationship between prox-id and employees. Through some observations, I have spotted prox-id is constructed by the first character of employee's first name and full last name followed by numbers which start from 001. For example, one employee called Mat Bramar with prox-id "mbramar001". In addition, the same employee may have multiple prox-ids with increasing numbers such as 002.
Data Transformation
For proxOut-MC2.csv, I have extracted employee's last name to identify the unique employee from prox-id by using the "Left and Right" functions in Excel.
For bldg-MC2.csv, in order to discover interesting patterns in the building data, I have to segregate dataset by making use of Stack function of JMP Pro.
Step 1: Stack
Step2: Create new columns for Floor, Zone and Element segregated by Label through applying formulas.
Finally, the results from Step 2 can be used by Tableau to produce visualizations.
The Tasks
What are the typical patterns in the prox card data? What does a typical day look like for GAStech employees?
Through looking at Gantt view above, I can easily tell most of employees work on the first and second level. For people working on the first floor, there are three working shifts: 12am - 7am (night shift), 7am - 4pm and 4pm - 11pm. For people on the second floor, there is no night shift and employees on the third floor only work from 7am to 4pm.
Describe up to ten of the most interesting patterns that appear in the building data. Describe what is notable about the pattern and explain its possible significance.
- The Equipment Power of floor3 are much higher than floor 1 and floor 2.
- Floor 3 has the most number of COOLING COIL Power
- Floor 2 has the most number of concentration of C02 measured at the zone's return air grille
- The pattern of Electric Demand Power (total Electric Demand Power and HVAC Electric Demand Power) share the same trend regardless of different floors.
- People consume the most number of electric power at 8am and the most number of HVAC Electric Demand Power at 7am.
- The period from 6am to 6pm is the most power demanding period.
Describe up to ten notable anomalies or unusual events you see in the data. Prioritize those issues that are most likely to represent a danger or a serious issue for building operations.
- From patterns identified from the previous question, equipment power of floor 3 is extremely high which catches my attention. Through looking at Energy Zone, I have noticed that floor 3 places servers that explain the high equipment power.
- Concentration of C02 measured at the zone's return air grille of floor 2 is unusually high compared to other two floors which is bad for people although there are less number of employees working on the second floor than the first floor.
- From the graph above, I am able to find out that Hazium level of the first-floor is relatively much lower than the other two floors. The only one sensor area (F3_Z1) detecting Hazium located on the third-floor shows that the uptrend of Hazium level from 12 am to 7 am and it starts slowly decreasing after that and Hazium level of the second-floor is increasing sharply from 8am and reaches the maximum level at 6pm. This phenomenon indicates a dangerous sign for people working from 4pm to 11pm at the second-level since the Hazium level of that period is relatively high.
Describe up to five observed relationships between the proximity card data and building data elements. If you find a causal relationship (for example, a building event or condition leading to personnel behavior changes or personnel activity leading to building operations changes), describe your discovered cause and effect, the evidence you found to support it, and your level of confidence in your assessment of the relationship
Tools Utilized
- Excel to clean data
- JMP PRO to segregate and transform data
- Tableau to do data visualisation