ISSS608 2016-17 T3 Assign ERIC PRABOWO CUNDOMANIK

From Visual Analytics and Applications
Jump to navigation Jump to search


Comments and Feedback

1

2

3

Background [Mini Challenge 2]

VAST Challenge 2017 MC2

Ornithology student Mitch Vogel was immediately suspicious of the noxious gases just pouring out of the smokestacks from the four manufacturing factories south of the nature preserve. He was almost certain that all of these companies are contributing to the downfall of the poor Rose-crested Blue Pipit bird. But when he talked to company representatives and workers, they all seem to be nice people and actually pretty respectful of the environment.

Problem

The primary job for Mitch is to determine which (if any) of the factories may be contributing to the problems of the Rose-crested Blue Pipit. Often, air sampling analysis deals with a single chemical being emitted by a single factory. In this case, though, there are four factories, potentially each emitting four chemicals, being monitored by nine different sensors. Further, some chemicals being emitted are more hazardous than others. Your task, as supported by visual analytics that you apply, is to detangle the data to help Mitch determine where problems may be.

Questions

  1. Characterize the sensors’ performance and operation. Are they all working properly at all times? Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture?
  2. Now turn your attention to the chemicals themselves. Which chemicals are being detected by the sensor group? What patterns of chemical releases do you see, as being reported in the data?
  3. Which factories are responsible for which chemical releases? Carefully describe how you determined this using all the data you have available. For the factories you identified, describe any observed patterns of operation revealed in the data.


Dataset Description

Mini-Challenge 2 provides a three month set of data for you to analyze, covering April, August, and December 2016.

I. COORDINATES:
The factories and sensors locations are provided in terms of x,y coordinates on a 200x200 grid, with (0,0) at the lower left hand corner (southwest). The sensors map shows the locations of the sensors and factories by number for the sensors and by name for the factories. Some of the other features of the map (such as entrances and gates in that area) have been removed for readability. (Please note that the terms “sensor” and “monitor” are used interchangeably.)

The following are the factory locations:
Roadrunner Fitness Electronics: 89,27
Kasios Office Furniture: 90,21
Radiance ColourTek: 109,26
Indigo Sol Boards: 120,22

The following are the sensor locations:
1: 62,21
2: 66,35
3: 76,41
4: 88,45
5: 103,43
6: 102,22
7: 89,3
8: 74,7
9: 119,42


II. METEOROLOGICAL DATA
[Date, Wind Direction, Wind Speed]
Date: The date and time of the readings, local time with no change for Daylight Savings.
Wind Direction: The compass directions where the wind is originating from, using a north-referenced azimuth bearing where 360/000 is true north.
Wind Speed: The speed of the wind in meters per second.
Each of these reading is taken at the date and time provided.


III. SENSOR DATA
[Chemical, Monitor, Reading, Date Time]
Chemical: Which one of the four chemicals detected by the sensors
Monitor: Which one of the nine sensors picking up the reading
Reading: The air sensor detected amount in parts per million
Date Time: The date and time of day of the reading, local time with no change for Daylight Savings.


Data Preparation and Cleaning

[in progress..]

1. Create X, Y coordinate Excel file for sensors and companies

Create a table with X and Y data

2. Clean meteorological data

Remove empty 4th column, and delete 5th column with elevation data

3. Duplicate meteorological data (wind data) for building the polygon shape in Tableau

  • Add 3 additional columns [Angle, Length, and Point]: new columns are specifically for building the polygon shape for the Wind Rose on the chart. Each of them are for building components of the lines of the Wind Rose, for example the Angle is Tripled with the additional +10 and -10 degree from the original data.
Added 3 column on Meteorological Data (wind)
  • Triple the data: by adding 3 columns above, there should also be copying process of the data. Data should be copied 3 times for each of the angle, before applying formula to calculate the Angle and Length column.

4. Connecting to Tableau with inner join method

  • Going forward to Tableau, add the 3 flat files of the data (Sensor, Coordinate, and Meteorological in sequence). This will make Sensor as the key data point, and connect to both Coordinate and Meteorological data. For meteorological data used is the one tripled, or modified by copying the data 3 times for the polygon shape drawing on Wind Rose.
  • Add another dataset in Tableau by joining a new combination, where the dataset is not tripled (as for the polygon shape drawing). Use the Sensor, Coordinate, and Meteorological (non-modified meteorological dataset).


Data Visualization Charts

[in progress..]

I. Line Charts for Sensor Readings
II. Wind Rose
III. Bar Chart
IV. Line Chart for Wind Speed
V. Chloropeth for Sensor and Chemical Readings



Tableau Dashboard Overview

Link: https://public.tableau.com/profile/eric.prabowo#!/vizhome/MC2-EricPrabowo-v2_0/STORYLekagulWildlifePreserveArea


Question Answers

Question 1

Characterize the sensors’ performance and operation. Are they all working properly at all times? Can you detect any unexpected behaviours of the sensors through analysing the readings they capture? Limit your response to no more than 9 images and 1000 words.

Sensor 4

Sensor performance and operation are overall good, except for sensor 4 which shows an unusual pattern compared to sensor 3 and 5, which is nearby. Sensor 4 have a problem on detecting the amount of chemical pollution, where it is increasing in August and December compared to April. Probably the measurement sensor is not performing well from the unusual behaviour.

Q1-eric prabowo.png
Sensor 4 unusual behaviour of capturing data (increasing pattern)


Sensor 3

Sensor 3 shows unusual measurement of chemical pollution detected. Overall average of measurement is higher for sensor 3 within 3 months of data, for all chemicals detected. This indicates measurement problem of the sensor 3, where measurement unit calculation is inaccurate. Conversion of the data should provide a more accurate result.

Q1 1-eric prabowo.png
Sensor 3 unusual behaviour of measurement (shown as outlier)


Sensor 9

Sensor 9 shows unusual measurement of chemical starting 23rd of August / last quarter of August, and the whole December. This pattern might show a broken sensor indication from Sensor 9 for certain chemicals reading.

Q2 3-eric prabowo.png
Sensor 9 have got a different measurement on Appluimonia, Chlorodinine, and Methylosmolene starting on the last quarter of August and the whole December.

In details, the data also given a prove that wind direction does not match the amount of chemicals detected on the sensors as shown in the image below. As a result, the amount of Appluimonia and Chlorodinine is probably not the real detected amount of chemicals.

Q1 2-eric prabowo.png
Dashboard showing wind direction and Sensor 3 readings unrelated


Sensor 7

The dashboard data also shows that sensor 7 is not performing well, as Chlorodinine is showing up on the sensor, while the wind direction is not showing indication of direction from any factories.

Q1 3-eric prabowo.png
Dashboard showing wind direction and Sensor 7 readings unrelated


Sensor Reading Pattern

When readings on AGOC-3A is intense, sensors cannot read both AGOC-3A and Appluimonia at the same time altogether. So, when it comes to high intensity reading of AGOC-3A, Appluimonia data is not actually recorded by the sensors, as shown in image below. There is also an indication of blank data every second (2nd), fourth (4th), sixth (6th), seventh (7th) day of the month at exactly 12am, indicating sensor maintenance or restart, where it is not capturing data.

Q2 5-eric prabowo.png
Sensor Reading Counts – Sensors cannot read both AGOC-3A and Appluimonia at the same time

Question 2

Which chemicals are being detected by the sensor group? What patterns of chemical releases do you see, as being reported in the data?

Chemical Hourly Release Pattern

On hourly filtered data, the chemicals detected have patterns of releases. These indicates production hour of certain chemicals detected.

Q2-eric prabowo.png
Hourly filtered sensor data – showing chemical release patterns by hour of the day
Q2 1-eric prabowo.png
Hourly filtered sensor data – showing chemical release patterns by hour of the day (sensor 3,4,7 are NOT shown)

As shown on the line chart above, the pattern shows production time that uses AGOC-3A is 05:00-20:00 in August. Production pattern is shown by aggregated daily data within 3 months on the similar hour of the day. This production pattern time gap is increasing in August and December. In August, production time is 05:00-23:00. In December, AGOC-3A production time is 06:00-23:00, however there is an additional peak of AGOC-3A waste at 04:00. Second pattern shown on the data is that, Methylosmolene is being released by production starting from 9pm to 6am in the morning. This shows production patterns of Methylosmolene is done in midnight hours.

Chemical Significant Readings
Q2 2-eric prabowo.png
Chemical significant readings – Appluimonia

On the Dashboard Monitor, Appluimonia shows significant read from most of all sensors. This shows indication that the chemical pollution is not coming from the factories. Instead, it might come from surrounding areas.


Chemical Monthly Release Pattern

Chemicals reading pattern for AGOC-3A shows there are monthly indication of lower production at the very beginning of the month, and no production at the end of the month. This chemical pattern shows production behaviour of companies using AGOC-3A, where production tend to increase in the middle of the month, and stopped at about day 25th of every month. Except for December, production stops early at 22nd December.

Q2 4-eric prabowo.png
Production pattern of AGOC-3A


Question 3

Which factories are responsible for which chemical releases? Carefully describe how you determined this using all the data you have available. For the factories you identified, describe any observed patterns of operation revealed in the data.

Companies responsible for the chemical wastes are:

  • Kasios: AGOC-3A
  • Radiance ColourTek: AGOC-3A, Chlorodinine
  • Roadrunner Fitness Elect: Methylosmolene, Chlorodinine, AGOC-3A
  • Indigo: Methylosmolene, Appluimonia


Kasios
Q3-eric prabowo.png
Sensor 5 showing indication of wind direction from Kasios Office – AGOC-3A detected at 12PM


Radiance ColourTek
Q3 1-eric prabowo.png
Sensor 9 showing indication of wind direction from Radiance ColourTek – AGOC-3A detected at 12PM
Q3 5-eric prabowo.png
Sensor 8 and wind direction indicates Radiance - Chlorodinine detected at 12PM


Roadrunner Fitness Elect.
Q3 2-eric prabowo.png
Sensor 6 showing indication of wind direction from Roadrunner Fitness Elect. – Methylosmolene detected at 12AM
Q3 3-eric prabowo.png
Sensor 6 showing indication of wind direction from Roadrunner Fitness Elect. – Chlorodinine detected at 3AM
Q3 6-eric prabowo.png
Sensor 5 detected AGOC-3A coming from the wind direction of Roadrunner


Indigo Sol Boards
Q3 4-eric prabowo.png
Sensor 9 and wind direction indicates Indigo - Methylosmolene detected at 3AM
Q3 7-eric prabowo.png
Sensor 9 detected Appluimonia coming from the wind direction of Indigo Sol Boards at 12 PM

References

Wind Polygon: https://community.tableau.com/thread/148044
VAST Challenge 2017 [Mini-Challenge 2]: http://www.vacommunity.org/VAST+Challenge+2017+MC2