Difference between revisions of "IS428 AY2018-19T1 Kung Jung-wen"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 60: Line 60:
 
=== Q1: Spatio-temporal Analysis of Official Air Quality ===
 
=== Q1: Spatio-temporal Analysis of Official Air Quality ===
  
 +
== Interesting & Anomalous Observations ==
 +
Using the dashboard as a platform for investigation and analysis, the following aims to provide answers to the questions posed.
  
Based on the data captured by the proximity sensors, the following shows a typical pattern in the proximity card data.
+
'''Characteristic of the past and most recent situation with respect to air quality measures in Sofia City'''<br/>
# For the fixed proximity sensors, readings are collected for 24hours during the weekdays, except for 1am and 4am. From 12midnight to 6am, only level 1 fixed proximity sensor will collect data. This can possibly mean that there is no employee movement from 12midnight to 6am on level 2 and 3.
+
[[File:Overview of EEA.png|700px|center]]
# For the mobile proximity sensor, Rosie (the mobile robot) travels the halls at 9am and 2pm daily. On 4, 5, 11 and 12 June, the robot does not collect any readings. One possible reason is that these are weekends (Sat/Sun) and therefore, Rosie will not be required to travel along the hallways.
+
# The overall trend of slight improvement in air quality: Sofia City’s PM10 level is on the fall, air quality is becoming better over the years with lower peaks in 2015 onwards compared to 2014. The PM10 level reached a new minimum level in 2018 with a PM 10 concentration of 12.2.  
 +
# Overall improvement in air quality in 2018: from the chart showing the PM10 Concentration Fluctuation by Month, every month in 2018 has the lowest PM10 concentration compared to previous years.  
  
'''Typical Day for GAStech Employees'''<br/>
+
'''Anomaly pattern in PM10 Concentration'''  
<p>The following lists a typical day for GAStech employees, in all departments:</p>
 
# Employees start arriving in the office at 7am. By 9am, majority of the employees would have already arrived in the building and settle in their own office space.
 
# Lunch time is usually between 12nn to 2pm.
 
# At around 2pm, many employees will be back in their office and that’s when we can see lots of activities going on in the different levels.
 
# Employees typically end work at around 5pm, though some may leave slightly earlier.
 
# After 6pm, majority of the employees in the office are from the Engineering, Facilities and IT department.
 
# After 7pm, employees working in level 3 would have left the office.
 
# At 12midnight, most of the employees would have left the building and only people in the facilities team will be present in level 1.
 
# People in the facilities department will be available in the building 24 hours and its always the same people patrolling around the area.
 
# Other than the facilities personnel, IT and engineering employees often stay in the office and will only leave at around 12midnight.
 
  
<p>The following lists some activities that is part of a typical day pattern for employees in a specific department:</p>
+
# Extremely poor daily PM10 Concentration in December and January: from Daily PM10 Concentration Chart, most of the peaks are around December and January period and the PM10 level can reach as high as 400+ range.  
{| class="wikitable"
+
# Poor monthly average PM10 concentration in 2014 and 2016: The two peaks shown in the Monthly Avg PM10 Concentration Chart, are December 2014 and January 2016 which is consistent with the findings above.
|-
+
# No improvement in monthly PM10 concentration for January and November: as it was found that the air quality had an overall improvement in 2018, however it seems that the average monthly PM10 level in January and November is on the rise. We cannot make this conclusion about improved air quality in 2018 November and December as the data is not available yet. However, it seems that the main activity that causes of poor air quality is being shifted to November.  
! style="font-weight: bold;background: #536a87;color:#fbfcfd;width: 20%;" | Department
+
 
! style="font-weight: bold;background: #536a87;color:#fbfcfd;" | Activities
+
 
|-
+
'''What does a typical day look like for Sofia city? '''
| <center>'''Engineering'''</center>
+
[[File:Day in Sofia.png|700px|center]]
||
+
The graph above shows the average hourly PM10 concertation from the air quality stations within the Sofia City for the year 2017 and 2018. Data in 2016 and before are excluded as they are incomplete, where as 2017 data includes the November and December hourly data that complements with 2018’s hourly data from January to September.
# Between 2pm and 3pm, employees seem to appear around the meeting and training room.
+
# Hours with worst air quality for 2017 and 2018: In a typical day in Sofia City, 12 am -1 am, 7am to 8am and 7pm and 12 pm have the higher concentration of PM 10 level. Generally the PM10 yearly concentration by hour is within the acceptable range.
# After 6pm, only Clemencia Whaley, Penney Bueno and Twana Quiroz will remain in the building. They will leave around 11.40pm.
+
# Three period of bad quality time: Analysing the hourly concentration for every month, January, November and December have the worst hourly P10 hourly concentration. In January, the PM10 concentration reached very poor category for 2-4 hours consecutively. The hours that have very poor air quality are around midnight, in the morning 8-9 am and at the night time around 8 to 11pm.  
|-
+
# Good air quality most of the time: months other than January, November and December, the air quality is within the good air quality range and in certain months 11 am to 3 pm can achieve very good air quality.
| <center>'''Executive'''</center>
+
 
||
+
 
# At around 12nn, CEO Sten Sanjorge Jr. will be moving around the building. At around 1pm, other executives start leaving for lunch.
+
'''Do you see any trends of possible interest in this investigation?'''
|-
+
# From the analysis above, we should investigate the activities that cause January, November and December to hit poor air quality score.  
| <center>'''Facilities'''</center>
+
# Understand why from certain months, for instance, April to September is able to achieve very good air quality score during the 11 am to 3 pm hours.  
||  
+
 
# Shift work is being done throughout the day as we can see some pattern for employees in the facilities department:
+
'''How do these affect your analysis of potential problems to the environment?'''
## Varro Awelon and Emile Earpa will always be in level 1 from 12mn to around 7.30/8am.
+
 
## Dylan Scozzese and Chi Staley will always be in level 1 from 8am onwards.
+
From the analysis we identified that the worst PM10 concentration level can reach as high as 413 in a day which is hazardous to any human being and during the January, November and December period, some hours are consistently reaching very poor air quality standard. It is then important to identify the factors behind these patterns and allow the public of Sofia City to be adequately prepared in these periods with masks.
# At approximately 9am, some employees from the facilities department will proceed to the deli.
 
|-
 
| <center>'''Information Technology'''</center>
 
||
 
# At around 10.30am, all employees from the IT department will gather at the meeting room area on level 2.
 
# Out of all IT employees, Erminia Bello seems to be always staying in the office till late.
 
|-
 
| <center>'''Security'''</center>
 
||
 
# Employees do not typically move around the building. The data captured had shown that they always stay within their own office.
 
|}
 

Revision as of 20:59, 11 November 2018

Problem & Motivation

Dataset Transformation Process

Before starting with the analysis in Tableau, each feature within the data set is being analysed to better understand the context of the problem and to ensure that the data transformation process is performed accurately.

Working with EEA Data:

1. Merge all the EEA air quality data: As there are 28 csv files being provided from the data source, we first combining all the data within python.

Merging EEA files

2. All the CSV files provided contains the air quality station identifiers, air pollution measurement, time period of measurement and other links. To obtain the geographic location of each air station, we then merged the data with metadata using left merge in Python to obtain mainly the longitude and latitude points of each station.

3. The date variables need to be standardised and cleaned up in a format that is readable in Tableau. change date variable to a format that tableau can read. The current DatetimeBegin and DatetimeEnd variables contain time in milliseconds, as the data will only be analysed on either an hourly or daily level, the date of transformed to include the date and hour of the measurement time in Python.

EEA date transform.jpg


Dataset Import Structure & Process

Interactive Visualization

The interactive visualization can be accessed here: https://public.tableau.com/viws/Assignment3_145/Home?:embed=ey&:display_count=yes

Interactive Technique Rationale Brief Implementation Steps
Filter dates with the use of time range slider
To provide flexibility for analysts to choose the time period that they are interested to analyse.
The use of checkboxes or dropdown list requires the analyst to check/uncheck each date manually which is time-consuming. As such, a time range slider is preferred.
  1. The date/time field have to be duplicated with its data type set to “date”
  2. Add the new field to be filtered.
Filter each floor/zone using a single selection drop down list
To allow analysts to concentrate on the data collected from each level with the use of a single selection.
Use of a drop down list also allow analysts to easily choose the building level that they are interested to analyse.
  1. Configure the filter selection to be a single selection drop down list
Change and zoom of floor plans based on each floor filter
To allow for easy reference of mapping each building data elements with each zone.
When a user filters from one floor to another, the floor plan also changes to provide for quick and easy reference. Due to the space constraint, each floor plan has to be zoomed for users to identify and see the zone areas clearly.
  1. Create calculated fields for the x and y axis.
  2. Put the 2 calculated fields into the worksheet view.
  3. To hide the mark, set the colour as transparent.
  4. Navigate to Maps > Background Images. Add the floor plans into the background images and configure it according to the filter condition.
  5. Put the “floor” attribute as a filter.


Interesting & Anomalous Observations

Using the dashboard as a platform for investigation and analysis, the following aims to provide answers to the questions posed.

Q1: Spatio-temporal Analysis of Official Air Quality

Interesting & Anomalous Observations

Using the dashboard as a platform for investigation and analysis, the following aims to provide answers to the questions posed.

Characteristic of the past and most recent situation with respect to air quality measures in Sofia City

Overview of EEA.png
  1. The overall trend of slight improvement in air quality: Sofia City’s PM10 level is on the fall, air quality is becoming better over the years with lower peaks in 2015 onwards compared to 2014. The PM10 level reached a new minimum level in 2018 with a PM 10 concentration of 12.2.
  2. Overall improvement in air quality in 2018: from the chart showing the PM10 Concentration Fluctuation by Month, every month in 2018 has the lowest PM10 concentration compared to previous years.

Anomaly pattern in PM10 Concentration

  1. Extremely poor daily PM10 Concentration in December and January: from Daily PM10 Concentration Chart, most of the peaks are around December and January period and the PM10 level can reach as high as 400+ range.
  2. Poor monthly average PM10 concentration in 2014 and 2016: The two peaks shown in the Monthly Avg PM10 Concentration Chart, are December 2014 and January 2016 which is consistent with the findings above.
  3. No improvement in monthly PM10 concentration for January and November: as it was found that the air quality had an overall improvement in 2018, however it seems that the average monthly PM10 level in January and November is on the rise. We cannot make this conclusion about improved air quality in 2018 November and December as the data is not available yet. However, it seems that the main activity that causes of poor air quality is being shifted to November.


What does a typical day look like for Sofia city?

Day in Sofia.png

The graph above shows the average hourly PM10 concertation from the air quality stations within the Sofia City for the year 2017 and 2018. Data in 2016 and before are excluded as they are incomplete, where as 2017 data includes the November and December hourly data that complements with 2018’s hourly data from January to September.

  1. Hours with worst air quality for 2017 and 2018: In a typical day in Sofia City, 12 am -1 am, 7am to 8am and 7pm and 12 pm have the higher concentration of PM 10 level. Generally the PM10 yearly concentration by hour is within the acceptable range.
  2. Three period of bad quality time: Analysing the hourly concentration for every month, January, November and December have the worst hourly P10 hourly concentration. In January, the PM10 concentration reached very poor category for 2-4 hours consecutively. The hours that have very poor air quality are around midnight, in the morning 8-9 am and at the night time around 8 to 11pm.
  3. Good air quality most of the time: months other than January, November and December, the air quality is within the good air quality range and in certain months 11 am to 3 pm can achieve very good air quality.


Do you see any trends of possible interest in this investigation?

  1. From the analysis above, we should investigate the activities that cause January, November and December to hit poor air quality score.
  2. Understand why from certain months, for instance, April to September is able to achieve very good air quality score during the 11 am to 3 pm hours.

How do these affect your analysis of potential problems to the environment?

From the analysis we identified that the worst PM10 concentration level can reach as high as 413 in a day which is hazardous to any human being and during the January, November and December period, some hours are consistently reaching very poor air quality standard. It is then important to identify the factors behind these patterns and allow the public of Sofia City to be adequately prepared in these periods with masks.