IS428 2016-17 Term1 Assign3 Jonathan Eduard Chua Lim

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search

Questions

As an expert in visual analytics, you have been hired to help GAStech understand its operations data. In this assignment, you are given two weeks of building and prox sensor data. Can you use visual analytics to identify typical patterns of and issues of concern?

You will be asked to answer the following types of questions:

  1. What are the typical patterns in the prox card data? What does a typical day look like for GAStech employees?
  2. Describe up to ten of the most interesting patterns that appear in the building data. Describe what is notable about the pattern and explain its possible significance.
  3. Describe up to ten notable anomalies or unusual events you see in the data. Prioritize those issues that are most likely to represent a danger or a serious issue for building operations.
  4. Describe up to five observed relationships between the proximity card data and building data elements. If you find a causal relationship (for example, a building event or condition leading to personnel behavior changes or personnel activity leading to building operations changes), describe your discovered cause and effect, the evidence you found to support it, and your level of confidence in your assessment of the relationship.

HVAC Information and its Significance

Field Units Description
F_#_BATH_EXHAUST:Fan Power [W] Power used by the bathroom exhaust fan
F_#_VAV_SYS AIR LOOP INLET Mass Flow Rate [kg/s] Total flow rate of air returning to the HVAC system from all zones it serves
F_#_VAV_SYS AIR LOOP INLET Temperature [C] Mixed temperature of air returning to the HVAC system from all zones it serves
F_# VAV Availability Manager Night Cycle Control Status On/off status of the HVAC system during periods when the system is normally scheduled off. The night cycle manager cycles the HVAC system to maintain night and weekend set point temperatures.
F_#_VAV_SYS COOLING COIL Power [W] Power used by the HVAC system cooling coil
F_#_VAV_SYS HEATING COIL Power [W] Power used by the HVAC system heating coil
F_#_VAV_SYS SUPPLY FAN OUTLET Mass Flow Rate [kg/s] Total flow rate of air delivered by the HVAC system fan to the zones it serves
F_#_VAV_SYS SUPPLY FAN OUTLET Temperature [C] Temperature of the air exiting the HVAC system fan
F_#_VAV_SYS SUPPLY FAN:Fan Power [W] Power used by the HVAC system fan
F_#_VAV_SYS Outdoor Air Flow Fraction Percentage of total air delivered by the HVAC system that is from the outside
F_#_VAV_SYS Outdoor Air Mass Flow Rate [kg/s] Flow rate of outside air entering the HVAC system
COOL Schedule Value Example The supply air temperature set point. Air exiting the HVAC system fan is maintained at this temperature during cooling operation
DELI-FAN Power [W] Power used by the deli exhaust fan
Drybulb Temperature [C] Drybulb temperature of the outside air
Wind Direction [deg] Direction of wind outside of the building
Wind Speed [m/s] Speed of wind outside of the building
HEAT Schedule Value The supply air temperature set point. Air exiting the HVAC system fan is maintained at this temperature during heating

operation

Pump Power [W] Power used by the hot water system pump
Water Heater Setpoint Water heater set point temperature
Water Heater Gas Rate [W] Rate at which the water heater burns natural gas
Water Heater Tank Temperature [C] Temperature of the water inside the hot water heater
Loop Temp Schedule Example Temperature set point of the hot water loop. This is the temperature at which hot water is delivered to hot water

appliances and fixtures.

Supply Side Inlet Mass Flow Rate [kg/s] Flow rate of water entering the hot water heater
Supply Side Inlet Temperature [C] Temperature of the water entering the hot water heater
Supply Side Outlet Temperature [C] Temperature of the water exiting the hot water heater
F_#_Z_# REHEAT COIL Power [W] Power used by the zone air supply box reheat coil
F_#_Z_# RETURN OUTLET CO2 Concentration [ppm] Concentration of C02 measured at the zone's return air grille
F_#_Z_# SUPPLY INLET Mass Flow Rate [kg/s] Flow rate of the air entering the zone from its air supply
F_#_Z_# SUPPLY INLET Temperature [C] Temperature of the air entering the zone from its air supply
F_#_Z_# VAV REHEAT Damper Position Position of the zone's air supply box damper. 1 corresponds to fully open, 0 corresponds to fully closed
F_#_Z_#: Equipment Power [W] [W] Power used by the electric equipment in the zone
F_#_Z_#: Lights Power [W] Power used by the lights in the zone
F_#_Z_#: Mechanical Ventilation Mass Flow Rate [kg/s] Ventilation rate of the zone exhaust fan
F_#_Z_#: Thermostat Temp [C] Temperature of the air inside the zone
F_#_Z_#: Thermostat Cooling Setpoint [C] Cooling set point schedule for the zone
F_#_Z_#: Thermostat Heating Setpoint [C] Heating set point schedule for the zone
Total Electric Demand Power [W] Total power used by the building
HVAC Electric Demand Power [W] Total power used by the building's HVAC system including coils, fans and pumps.

Data Preparation

Modifying Employee Information

  • I created an EmployeeID column using the first letter of the employee’s name and the Full last name.
  • Generated Prox ID by using the EmployeeID and adding “001”, denoting their first Prox ID card.
  • From looking at Mobile Prox Data (Rosie), I have found that employees with 2 last names (ie. Ada Campo-Corrente) or special characters after their last name (ie. Sten Sanjorge Jr) was transformed to “acampo” and “jsanjorge” respectively.
  • The formula and the transformation for these special types of proxid can be seen below.

EmployeeIDTransformation.jpg

First Name Last Name Transformed Employee ID Formula
Ruscella Mies Haber rmieshaber =LOWER(LEFT(B12,1)&LEFT(C12,4)&MID(C12,6,6))
Ada Campo-Corrente acampo =LOWER(LEFT(B53,1)&LEFT(C53,5))
Sten Sanjorge Jr. jsanjorge =LOWER(MID(C58,10,1)&MID(C58,1,8))
Willem Vasco-Pais wvasco =LOWER(LEFT(B60,1)&LEFT(C60,5))

Creating Custom Polygon for Tableau

I plotted the Custom Polygon using this tool: [Blog Link] [Github Link]. This tool allowed me to create polygons by allowing you to draw in them, and the exported result is a 4 column: Identifier, Path ID, X and Y Coordinates. These can be used for Tableau to read the polygons and also to calculate the Fixed Prox Data's X and Y coordinates.
CustomPolygon.jpg


Mobile Prox and Fixed Prox Data:

I found out that the Fixed Prox Data did not have X and Y axis, and that the Mobile Prox Data did not have a zone value allocated to its dataset. The following is my attempt to bridge the gap.
InitialProxData.jpg


Fixed Prox Data:
My main assumption is that the the Fixed Proximity Sensor is located in one specific part of the respective zone. Firstly, I plotted the zones in a custom polygon html (Link here) and got the X and Y axis in a separate excel file.

I then used V lookup to generate the X and Y axis for the Fixed Prox Data, with the following formula: =VLOOKUP(E2,[Floor1.xlsx]Sheet1!$A$2:$B$34,2,FALSE) (credit to [James Chua]). Then I was able to generate the X and Y axis for the Fixed Proximity Data. FixedProxVlookup.jpg

I repeated these steps for Floor # 2 and # 3 respectively. To simplify the analysis, I decided to combine the 2 Proximity Data together in one csv file.

Hazium Concentration

Combined the 4 tables of Hazium Concentration with the following settings. Afterwards, I deleted the common Date/Time column, only leaving 1 date/time column. I joined it using JMP Pro's Join function. I ticked the "include non-mathches" for both Main and With Table column, and did these for each of the tables to combine it into one table. HaziumDataPrep.jpg

Preparing HVAC Data

I used the “Stack” function in JMP Pro to convert the columns into rows
HVACStack.jpg

After stacking them, I realized that there are 2 main types of measurement name: those starting “F_#_Z_#” (Floor #,Zone #) and those without. I segregated those without to another file, and joining the two together afterwards.

I then used the Standardise Attributes > Recode function under Columns, to ensure that all those measurement names with “F_#_Z_#” are standardized and ready for delimiting. I checked all of them manually to ensure smooth delimiting later on.
HVACattr.jpg

I combined the Hazium Concentration data to the main table. After this, I delimited it by “_” to create a Floor and Zone table. This is so that when the data is uploaded into tableau, it can be filtered by floor and zone. Afterwards I combined the label and data without the starting:“F_#_Z_#” HVACcombined.jpg

Visualisations

Link to Visualisation: https://public.tableau.com/profile/jonathan.eduard.lim#!/vizhome/MA3Revised123/Assignment3Summary

Findings: Question 1

What are the typical patterns in the prox card data? What does a typical day look like for GAStech employees?

  1. Rosie starts to deliver letters from 9am and 2pm, going around each floor in an anticlockwise manner, from Floor 1, 2 and 3.
  2. Most of those in the Executive Department arrive to work around 7am.
  3. The CEO, Sten Sanjorge, only appear at around 11am. He is also the latest Executive member to leave the office at 6pm.
  4. The security department patrols each floors of the office regularly throughout the day. They usually patrol at 8am, 12pm, 1pm and 5pm based on their Prox Id count.
  5. There is no Prox ID count for the Security Department between 11-12 June, which is worrying due to the importance of their job.
  6. Mat Bramar, an Administration department staff, is only seen in the office on the 11th of June. He is not present at any days other than 11 June.
  7. The Engineering department are present either in 1st and 2nd floor. They are never seen on the 3rd floor.
  8. The Facilities team often arrive later than most people, usually at 10am.
  9. The HR team starts their day at 8am, and end their day at 5pm
  10. The IT department has the shortest work day amongst the different departments, as short as 2 hours in 7 June 2016. (1 hour in 8 June)

Findings: Question 2

Describe up to ten of the most interesting patterns that appear in the building data. Describe what is notable about the pattern and explain its possible significance

  1. In Floor 1 Zone 1 (Deli at floor 1), there is a spike of “Lights Power” between 11am-12pm. This signifies lunch time for employees, as most or if not all of the lights are turned on to accommodate for the employees who are having lunch. Also, there is a small increase at 7am, in which could mean that some employees eat breakfast before starting work.
  2. In Floor 1 Zone 5 (Conference Room), there is a sudden increase of Lights and Equipment power usage between 1am-3am. There seem to be conferences starting from 1am up to 3am. A similar pattern can be seen between 5am-7am, which are another set of conferences.
  3. Floor 1 Zone 7 (Loading Room with Offices), there are a spike in Equipment Power and Lights Power at these following times: 1am, 4-5am, 7-8am, 3pm-5pm, and 9pm. These could mean that there are either outgoing or incoming deliveries at these time frames.
  4. Floor 2 Zone 5 (Office number 2300) seem to be used after office hours. This is because there is an increase of Equipment Power and Lights Power used from 3pm to 11pm, with a “break” at 10pm. An employee may be working at these odd hours.
  5. Floor 2 Zone 16 (Cluster of Offices near Floor 2 Zone 5) seems to be used from 3pm-11pm. This is derived from the Equipment Power and Lights Power increasing starting from 3pm until 11pm.
  6. Between 6am-12pm, the Reheat Coil Power went to value 0 on 7 June 2011, at Floor 3 Zone 1, which is a room in the upper left corner of Floor 3. This phenomenon repeats every day up to 13 June 2016. Before 2 June, the value for Reheat Coil Power is 0 throughout the day, only increasing at 2 June, 1pm.

Findings: Question 3

Describe up to ten notable anomalies or unusual events you see in the data. Prioritize those issues that are most likely to represent a danger or a serious issue for building operations.

  1. Hazium Concentration spike up at Floor 1 Zone 8A at approximately 6-7am. It is mentioned that Hazium is a harmful substance in the air and that it is not safe for humans to be exposed in high quantities. Floor 1 Zone 8A is the corridor adjacent to the main entrance of the office. This is not safe for employees because they usually enter the office at this time and may be exposed to Hazium.
  2. Hazium Concentration also spike up at Floor 2 Zones 2 and 4 respectively. This occurs at around 6pm, where most of the employees are going home. Both Zone 2 and 4 are offices that are adjacent to the windows, which may be the source of this spike. However, if opene windows were the cause, the hazium level should be high and levelled throughout the day, which is not the case here.
  3. There seem to be a sudden increase of Reheat Coil Power between 11 June – 12 June. It may not be good for the building overall as it will affect the building’s overall electricity demand and this may result to higher costs of maintaining the new building, among many problems.

Findings: Question 4

Describe up to five observed relationships between the proximity card data and building data elements. If you find a causal relationship (for example, a building event or condition leading to personnel behavior changes or personnel activity leading to building operations changes), describe your discovered cause and effect, the evidence you found to support it, and your level of confidence in your assessment of the relationship.

Tools used

JMP: JMP was very helpful in dealing with the building data, in which Microsoft Excel would have problems doing. JMP was helpful especially in the Stack function, which Excel is lacking. JMP is also very stable in dealing with huge rows of data, and this enabled me to clean the data with minimal crashes or problems.
Microsoft Excel: Microsoft Excel was helpful especially in generating the Employee ID and its formulas are easier to grasp in comparison with JMP. JMP Formulas are powerful as well, but at this point I am still learning JMP formulas.
Tableau: Tableau is a good method to create wonderful dashboard and stories, enabling the users to explore the data and pinpoint any discovery that they may find along the way.

References

Comments and Feedback