Difference between revisions of "IS428 2016-17 Term1 Assign3 Lee Wai Tong Arnold"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 5: Line 5:
  
 
The new office is built to the highest energy efficiency standard, but as with any new building, there are still several HVAC issues to work out. The building is divided into several HVAC (heating, ventilation, and air conditioning) zones. Each zone is instrumented with sensors that report building temperatures, heating and cooling system status values, and concentration levels of various chemicals such as carbon dioxide (abbreviated CO2) and hazium (abbreviated Haz), a recently discovered and possibly dangerous chemical. CEO Sten Sanjorge Jr. has read about hazium and requested that these sensors be included. However, they are very new and very expensive, so GAStech can afford only a small number of sensors.
 
The new office is built to the highest energy efficiency standard, but as with any new building, there are still several HVAC issues to work out. The building is divided into several HVAC (heating, ventilation, and air conditioning) zones. Each zone is instrumented with sensors that report building temperatures, heating and cooling system status values, and concentration levels of various chemicals such as carbon dioxide (abbreviated CO2) and hazium (abbreviated Haz), a recently discovered and possibly dangerous chemical. CEO Sten Sanjorge Jr. has read about hazium and requested that these sensors be included. However, they are very new and very expensive, so GAStech can afford only a small number of sensors.
 +
 +
 +
=Objective=
 +
The objective here is to build an interactive visualization for GAStech in order to see how different building attributes may correlate with one another. It also allows users to identify patterns and gain insights about the building usage and employees behaviour. The visualization also allows GAStech management to have an understanding of their employees activities during the day.
  
  
Line 24: Line 28:
 
=== Mapping Prox-id to employee ===
 
=== Mapping Prox-id to employee ===
 
The data containing the employees did not the prox-id they were assigned to. Upon analyzing, I realized that the prox-id is generated based on the employee's name. Eg. Earlean Edward = eedward001. The prox-id is generated by taking the first character of the first name and combining it with the last name. With this insight, I was able to map the prox-id to the respective employee.  
 
The data containing the employees did not the prox-id they were assigned to. Upon analyzing, I realized that the prox-id is generated based on the employee's name. Eg. Earlean Edward = eedward001. The prox-id is generated by taking the first character of the first name and combining it with the last name. With this insight, I was able to map the prox-id to the respective employee.  
[[File:Arnold-a3-data3.PNG|600px|center]]
+
[[File:Arnold-a3-data3.PNG|400px|center]]
 
However, this did not work for all employees as some of them have more unique names. Eg. Ada Campo-Corrente = acampo. For such special cases, manual mapping was used.  
 
However, this did not work for all employees as some of them have more unique names. Eg. Ada Campo-Corrente = acampo. For such special cases, manual mapping was used.  
  
Line 143: Line 147:
 
* Tableau Background Images tutorial (https://www.youtube.com/watch?v=CdqEio_8KTo)
 
* Tableau Background Images tutorial (https://www.youtube.com/watch?v=CdqEio_8KTo)
 
* Tableau Drawing Polygons on Background Images (https://tableauandbehold.com/2015/04/13/creating-custom-polygons-on-a-background-image/)
 
* Tableau Drawing Polygons on Background Images (https://tableauandbehold.com/2015/04/13/creating-custom-polygons-on-a-background-image/)
 +
 +
 +
=Comments=

Revision as of 08:37, 24 October 2016

Link: https://public.tableau.com/profile/arnold1293#!/vizhome/a3_0/Story1

Contents

Problem Overview

After the successful resolution of the 2014 kidnapping at GAStech’s Abila, Kronos office, GAStech officials determined that Abila offices needed a significant upgrade. At the end of 2015, the growing company moved into a new, state-of-the-art three-story building near their previous location. Even though the employee morale rose somewhat with the excitement of the new building, there are still a few disgruntled employees in the company.

The new office is built to the highest energy efficiency standard, but as with any new building, there are still several HVAC issues to work out. The building is divided into several HVAC (heating, ventilation, and air conditioning) zones. Each zone is instrumented with sensors that report building temperatures, heating and cooling system status values, and concentration levels of various chemicals such as carbon dioxide (abbreviated CO2) and hazium (abbreviated Haz), a recently discovered and possibly dangerous chemical. CEO Sten Sanjorge Jr. has read about hazium and requested that these sensors be included. However, they are very new and very expensive, so GAStech can afford only a small number of sensors.


Objective

The objective here is to build an interactive visualization for GAStech in order to see how different building attributes may correlate with one another. It also allows users to identify patterns and gain insights about the building usage and employees behaviour. The visualization also allows GAStech management to have an understanding of their employees activities during the day.


Data Transformation

The provided data arrived in different forms, such as an image of the floor zones, building data and employee list.

Data challenges

Building Data

An immediate challenge faced was in the building data. Despite having a large amount the data, its format is not ideal. The floor and zone of the data are in the header, which makes it difficult to do comparison and filtering. As such, I decided to reorganize the data and separate them into different files for easier management.

Arnold-a3-data1.PNG
Arnold-a3-data2.PNG

I wrote a small Java program to read and separate the data into different files and also add additional columns to include the floor and zone.

Prox-id

All employees issued card has a prox-id which ends with a number. Eg. 001. As such, when employees lose their cards and get a replacement, they will be issued with another card numbered 002.

Arnold-a3-data4.PNG
Arnold-a3-data5.PNG

At the start, there were prox-id with same alphabets but different ending numbers. On closer investigation, the increment in numbers suggests that those were replacement cards but by the same employee. As such, I created another field called userid and removed the trailing numbers, leaving only the alphabets. This helps to identify the employee despite the different prox-id.

Mapping Prox-id to employee

The data containing the employees did not the prox-id they were assigned to. Upon analyzing, I realized that the prox-id is generated based on the employee's name. Eg. Earlean Edward = eedward001. The prox-id is generated by taking the first character of the first name and combining it with the last name. With this insight, I was able to map the prox-id to the respective employee.

Arnold-a3-data3.PNG

However, this did not work for all employees as some of them have more unique names. Eg. Ada Campo-Corrente = acampo. For such special cases, manual mapping was used.

Mapping floor zones on the map

To facilitate a better exploring experience, I decided to include mapping in the visualization. To achieve that, it was necessary to map out each floor zone on the map. I looked up a tutorial online and the references can be found below. First, I had to insert the image into Tableau and put it into the background. After which, I had to point and find out the coordinates of the corners surrounding the floor zone and write them into a csv file. This process was done manually. Afterwards, the csv file is loaded into Tableau and the polygons are created based on the data inside the csv.


Challenges and steps to overcome

Lack of background knowledge

One problem when doing this assignment was the lack of knowledge about buildings and its ventilation systems. As such, it was difficult to understand how HVAC and VAV Systems operate and their terminology. To overcome this, extra time had to be spent researching and understanding how do these systems work and how they correlate with one another. The improved background understanding offered better appreciation of the data given.


Visualization

Link: https://public.tableau.com/profile/arnold1293#!/vizhome/a3_0/Story1

Employee Activity

Arnold-a3-db1.PNG

This dashboard allows users to view employees' activities throughout the day. Based on the data provided, it will also indicate the estimate location of the user on the map below. However, since the fixed-proximity data does not contain any coordinate values, I plotted the employees' location randomly within the zone to simulate his presence. While this may not provide an accurate visualization of the exact employee's location, but it helps to plot his presence within the floor zone.

Uses:

  1. Filter by day. User can select the day he wants to explore by selecting the filter. Mandatory filtering by day is done to prevent overcrowding of data on the visualization which does not benefit the user. As such, I decided to limit the filter to per day, allowing the user to have a better experience and appreciate the employees' activities by hour per day.
  2. Filter by hour. The user can also select to filter by hour. This allows the user to see the employees movement on the map as he navigates across the hours. With it, he can see during which hours, where do the employees tend to gather or work.


Light and Equipment Analysis

Arnold-a3-db2.PNG

This dashboard allows user to view the different light and equipment power consumption for different zones.

Uses: Mouse over floor zone. By mousing over floor zone, the chart will filter accordingly to display the appropriate chart for that zone. Initially the view was set to display all. However because there are many different zones, the chart became too cluttered. As such, I decided to help the user filter when he mouses over the floor zone he wishes to see. If he wishes to view the charts for all the zones, he can still do so by scrolling down the chart.


Building Temperature and Electric Power

Arnold-a3-db3.PNG

This dashboard shows the user an overview of the temperature in the building and the electric power consumed. These charts are placed together because they relate to regulating the building's temperature.

Uses: Mouse over heatmap. When user sees an area of interest on the heatmap, he can mouse over the region to see how it corresponds with other charts.


Air Quality

Arnold-a3-db4.PNG

This dashboard shows the air quality in the building. Specifically, it analyzes the CO2 concentration and hazium concentration.

Uses: Mouse over charts. By hovering over the charts, user can view the floor zone highlighted at the bottom.


Water Heater Analysis

Arnold-a3-db5.PNG

This dashboard shows the data related to water heater.


VAV System Coil Power, Reheat Coil Power and Damper Position

Arnold-a3-db6.PNG

This dashboard shows charts related to coil power and damper position

Uses: Mouse over charts. By hovering over the charts, user can view the floor zone highlighted at the bottom.


Building Fan Power Analysis

Arnold-a3-db7.PNG

This dashboard shows charts related to fan power and supply fan.


Building Air Flow Analysis

Arnold-a3-db8.PNG

This dashboard shows charts related to air flow rate within the building.


Observations and Answer to Questions

What are the typical patterns in the prox card data? What does a typical day look like for GAStech employees?

  1. Most employees report to work from 7am onwards. Data reveals that during the 7th hour, employee activity is the highest.
  2. Lunch time at work is around 12pm onwards. For most employees, there is an increase in activity on the 12th hour.
  3. Office hours is until 5pm. For most employees, the last peak in activity is at 5pm after which there is no activity. This suggests that the employees are no longer in the office.
  4. Most departments only work until 5pm and go home. These departments are Admin, Executive, HR and Security. Most of these employees do not have any recorded activity after 5pm.
  5. Other departments like Engineering, Information Technology and Facilities tend to have longer operating hours as there are recorded activity from 7am until 11pm.
  6. On closer look, some employees that are detected in the office after office hours only have recorded activity from 4pm onwards. This suggests that they probably work the night shift and hence get off work later.
  7. Employees that report to work at 7am tend to get off work before 8pm. This suggests that employees usually do not need to work overtime.
  8. Every Sunday, there would be an employee from Administration in office in the afternoon. On both Sundays, a different Admin employee was detected. It is likely that the department has a rotation of which Admin employee come in on Sunday to work in the afternoon.


Describe up to ten of the most interesting patterns that appear in the building data. Describe what is notable about the pattern and explain its possible significance.

  1. Lights power and equipment power in certain floor zones share similar patterns. Example of such places are the offices on the 2nd floor. A possible reasoning is that an employee has entered the zone to do something. This suggests that the employee switched on the lights and use the equipment which explains to similar pattern in light and equipment power consumption.
  2. Some floor zones have constant usage of light power. An explanation for this is that these areas are common areas and thus the lights are always kept on for safety reasons.
  3. Another pattern that is observed is that lights and equipment power is various zones usually increase sharply from 6h and drops close to 0 after 17h. This probably maps the employees’ behavior in the office whereby the arrive into work in the morning and leave after office hours.
  4. Water heater gas rates is usually low during the weekends when there are no employees. However, during the weekdays, it is observed that the gas rate usually peaks at 12h, which coincides with the lunch hour. It is likely that employees are consuming more hot water during lunch which results in the water heater having to heat more water and thus the higher gas rate.
  5. Reheat damper position. It usually follows the employee activity patterns whereby there are higher recordings during office hours. This is likely due to damper position being used to regulate ventilation in the building and hence are more active during that time.
  6. Reheat damper position & supply inlet mass flow. From observation, these 2 readings follow very similar patterns. This suggests that they correlate with one another very strongly as when the damper position is opened more widely, it increases the inlet mass flow rate by the same constant.
  7. Total electric demand power. It follows a similar pattern whereby it goes up during office hours. During the day, it has 3 peaks during 8h, 12h and 17h. This coincides with employees arriving for work, going for lunch and leaving the office. The employees use more power during these hours. It also follows a similar pattern with HVAC electric consumption. It can mean that the increase electric consumption by HVAC significantly contribute to the increase in overall electric consumption.


Describe up to ten notable anomalies or unusual events you see in the data. Prioritize those issues that are most likely to represent a danger or a serious issue for building operations.

  1. On 11 june after 10am, hazium levels started to shoot upwards. Most levels reached the peak at 18 hour before starting to decline. Whereas, hazium level on level 3 continued to remain high until 21 hour.
  2. On 7 and 8 june, the building temperature was much higher than usual. The higher temperature may make it uncomfortable for employees who are working in the building.
  3. On 7 and 8 june, it is observed on the temp heating setpoint is set much higher than usual. This would have led to the air conditioners to hold the temperature at a higher reading and thus causing the temperature during those days to be much higher than usual. It is unlikely that this phenomenon is due to machine error as the setpoint was reduced much lower during the wee hours before going up again during the office hours. It is more likely an error in setting of the heating setpoint.
  4. It is observed that fan power on 7 and 8 june is not functioning well. The fan power on both days are observed to be lower than usual. This means that air might not be ventilated inside the building, posing a risk to employees with respiratory problems.
  5. Co2 concentration in the building on 7 and 8 was also observed to be higher on both days. This may be due to the poor ventilation as highlighted above. Higher co2 concentration may pose a risk to employees with respiratory problem.
  6. During the weekend of June 11 and 12, it is observed that HVAC demand power was high. This suggests that the HVAC systems were left on over the weekend. The power consumption was also much higher than normal office consumption. This suggests that there was some testing going on with the HVAC systems over the weekend, resulting in the high consumption of HVAC power.
  7. Electric consumption. During the weekend when most employees are not in office, it makes sense that electric consumption is low. However, on the weekend of June 11 and 12, it is observed that electricity consumption over that weekend remains high.
  8. Reheat coil power. During the weekend of june 11 and 12, reheat coil power was high and not in its usual form. This means that more heat is generated by the system and would result in warmer air to be produced in the building. This may explain why HVAC power consumption was high over the same period because it was trying to cool the building down.
  9. Reheat damper position. During the weekend of june 11 and 12, the reheat damper position was also left open. Because of that, heated air entered the building and cause the HVAC systems to work to bring down the building temperature.


Describe up to five observed relationships between the proximity card data and building data elements. If you find a causal relationship (for example, a building event or condition leading to personnel behavior changes or personnel activity leading to building operations changes), describe your discovered cause and effect, the evidence you found to support it, and your level of confidence in your assessment of the relationship.

  1. When the number of employees in the building is high, the consumption of building resources also increase. An example of this is the light and equipment power. The pattern follow closely to employee activities in the building. The extremely strong correlation suggests that the presence of employees greatly influence the usage of light and equipment. Given this strong evidence, I am highly confident of this assessment.


Tools used

  • Tableau
  • Microsoft Excel
  • Java code for extracting and printing data


References


Comments