ISSS608 2016-17 T3 Assign ERIC PRABOWO CUNDOMANIK
Contents
Comments and Feedback
1
Feedback from Josef Exconde
For the first graph presented, though it is a neat idea to use color to show whether the reading is High or Low, I find that there is some problem to this approach, 1.) We don't have a baseline unto which is really a high reading and which is low, we don't know the acceptable value of each reading and if we assume that 0 is the acceptable value then everything should be high, so even if it is a nice representation we can't seem to make the most out of it for this viz, and secondly having 9 lines in a facet or a pane is difficult by itself, having 9 lines with the same color scheme it makes it harder to differentiate one sensor over the other. If I want to single out a reading for each sensor I would need to manipulate the filter. Personally I would suggest either having a label for each line or using color to differentiate them.
Another is the Sensor Reading Pattern, though you stated a great finding regarding the lack of reading for Appluimonia when there is a spike from AGOC-3A, the packed information on one screen makes it hard to observe the finding. The value on the columns and rows are too small to comprehend and the dimensions won't properly fit in one screen altogether.
2
3
Background [Mini Challenge 2]
VAST Challenge 2017 MC2
Ornithology student Mitch Vogel was immediately suspicious of the noxious gases just pouring out of the smokestacks from the four manufacturing factories south of the nature preserve. He was almost certain that all of these companies are contributing to the downfall of the poor Rose-crested Blue Pipit bird. But when he talked to company representatives and workers, they all seem to be nice people and actually pretty respectful of the environment.
Problem
The primary job for Mitch is to determine which (if any) of the factories may be contributing to the problems of the Rose-crested Blue Pipit. Often, air sampling analysis deals with a single chemical being emitted by a single factory. In this case, though, there are four factories, potentially each emitting four chemicals, being monitored by nine different sensors. Further, some chemicals being emitted are more hazardous than others. Your task, as supported by visual analytics that you apply, is to detangle the data to help Mitch determine where problems may be.
Questions
- Characterize the sensors’ performance and operation. Are they all working properly at all times? Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture?
- Now turn your attention to the chemicals themselves. Which chemicals are being detected by the sensor group? What patterns of chemical releases do you see, as being reported in the data?
- Which factories are responsible for which chemical releases? Carefully describe how you determined this using all the data you have available. For the factories you identified, describe any observed patterns of operation revealed in the data.
Dataset Description
Mini-Challenge 2 provides a three month set of data for you to analyze, covering April, August, and December 2016.
I. COORDINATES:
The factories and sensors locations are provided in terms of x,y coordinates on a 200x200 grid, with (0,0) at the lower left hand corner (southwest). The sensors map shows the locations of the sensors and factories by number for the sensors and by name for the factories. Some of the other features of the map (such as entrances and gates in that area) have been removed for readability. (Please note that the terms “sensor” and “monitor” are used interchangeably.)
The following are the factory locations:
Roadrunner Fitness Electronics: 89,27
Kasios Office Furniture: 90,21
Radiance ColourTek: 109,26
Indigo Sol Boards: 120,22
The following are the sensor locations:
1: 62,21
2: 66,35
3: 76,41
4: 88,45
5: 103,43
6: 102,22
7: 89,3
8: 74,7
9: 119,42
II. METEOROLOGICAL DATA
[Date, Wind Direction, Wind Speed]
Date: The date and time of the readings, local time with no change for Daylight Savings.
Wind Direction: The compass directions where the wind is originating from, using a north-referenced azimuth bearing where 360/000 is true north.
Wind Speed: The speed of the wind in meters per second.
Each of these reading is taken at the date and time provided.
III. SENSOR DATA
[Chemical, Monitor, Reading, Date Time]
Chemical: Which one of the four chemicals detected by the sensors
Monitor: Which one of the nine sensors picking up the reading
Reading: The air sensor detected amount in parts per million
Date Time: The date and time of day of the reading, local time with no change for Daylight Savings.
Data Preparation and Cleaning [in progress..]
[in progress..]
1. Create X, Y coordinate Excel file for sensors and companies
2. Clean meteorological data
3. Duplicate meteorological data (wind data) for building the polygon shape in Tableau
- Add 3 additional columns [Angle, Length, and Point]: new columns are specifically for building the polygon shape for the Wind Rose on the chart. Each of them are for building components of the lines of the Wind Rose, for example the Angle is Tripled with the additional +10 and -10 degree from the original data.
- Triple the data: by adding 3 columns above, there should also be copying process of the data. Data should be copied 3 times for each of the angle, before applying formula to calculate the Angle and Length column.
4. Connecting to Tableau with inner join method
- Going forward to Tableau, add the 3 flat files of the data (Sensor, Coordinate, and Meteorological in sequence). This will make Sensor as the key data point, and connect to both Coordinate and Meteorological data. For meteorological data used is the one tripled, or modified by copying the data 3 times for the polygon shape drawing on Wind Rose.
- Add another dataset in Tableau by joining a new combination, where the dataset is not tripled (as for the polygon shape drawing). Use the Sensor, Coordinate, and Meteorological (non-modified meteorological dataset).
5. Convert Map Coordinates and Windspeed to miles standard unit
- Convert measurement units to miles
Data Visualization Charts [in progress..]
[in progress..]
I. Line Charts for Sensor Readings
II. Wind Rose
III. Bar Chart
IV. Line Chart for Wind Speed
V. Chloropeth for Sensor and Chemical Readings
Tableau Dashboard Overview
Question Answers
Question 1
Characterize the sensors’ performance and operation. Are they all working properly at all times? Can you detect any unexpected behaviours of the sensors through analysing the readings they capture? Limit your response to no more than 9 images and 1000 words.
Sensor 4
Sensor performance and operation are overall good, except for sensor 4 which shows an unusual pattern compared to sensor 3 and 5, which is nearby. Sensor 4 have a problem on detecting the amount of chemical pollution, where it is increasing in August and December compared to April. Probably the measurement sensor is not performing well from the unusual behaviour.
Sensor 3
Sensor 3 shows unusual measurement of chemical pollution detected. Overall average of measurement is higher for sensor 3 within 3 months of data, for all chemicals detected. This indicates measurement problem of the sensor 3, where measurement unit calculation is inaccurate. Conversion of the data should provide a more accurate result.
Sensor 9
Sensor 9 shows unusual measurement of chemical starting 23rd of August / last quarter of August, and the whole December. This pattern might show a broken sensor indication from Sensor 9 for certain chemicals reading.
In details, the data also given a prove that wind direction does not match the amount of chemicals detected on the sensors as shown in the image below. As a result, the amount of Appluimonia and Chlorodinine is probably not the real detected amount of chemicals.
Sensor 7
The dashboard data also shows that sensor 7 is not performing well, as Chlorodinine is showing up on the sensor, while the wind direction is not showing indication of direction from any factories.
Sensor Reading Pattern
When readings on AGOC-3A is intense, sensors cannot read both AGOC-3A and Appluimonia at the same time altogether. So, when it comes to high intensity reading of AGOC-3A, Appluimonia data is not actually recorded by the sensors, as shown in image below. There is also an indication of blank data every second (2nd), fourth (4th), sixth (6th), seventh (7th) day of the month at exactly 12am, indicating sensor maintenance or restart, where it is not capturing data.
Question 2
Which chemicals are being detected by the sensor group? What patterns of chemical releases do you see, as being reported in the data?
Chemical Hourly Release Pattern
On hourly filtered data, the chemicals detected have patterns of releases. These indicates production hour of certain chemicals detected.
As shown on the line chart above, the pattern shows production time that uses AGOC-3A is 05:00-20:00 in August. Production pattern is shown by aggregated daily data within 3 months on the similar hour of the day. This production pattern time gap is increasing in August and December. In August, production time is 05:00-23:00. In December, AGOC-3A production time is 06:00-23:00, however there is an additional peak of AGOC-3A waste at 04:00. Second pattern shown on the data is that, Methylosmolene is being released by production starting from 9pm to 6am in the morning. This shows production patterns of Methylosmolene is done in midnight hours.
Chemical Significant Readings
On the Dashboard Monitor, Appluimonia shows significant read from most of all sensors. This shows indication that the chemical pollution is not coming from the factories. Instead, it might come from surrounding areas.
Chemical Monthly Release Pattern
Chemicals reading pattern for AGOC-3A shows there are monthly indication of lower production at the very beginning of the month, and no production at the end of the month. This chemical pattern shows production behaviour of companies using AGOC-3A, where production tend to increase in the middle of the month, and stopped at about day 25th of every month. Except for December, production stops early at 22nd December.
Question 3
Which factories are responsible for which chemical releases? Carefully describe how you determined this using all the data you have available. For the factories you identified, describe any observed patterns of operation revealed in the data.
Companies responsible for the chemical wastes are:
- Kasios: AGOC-3A
- Radiance ColourTek: AGOC-3A, Chlorodinine
- Roadrunner Fitness Elect: Methylosmolene, Chlorodinine, AGOC-3A
- Indigo: Methylosmolene, Appluimonia
Kasios
Radiance ColourTek
Roadrunner Fitness Elect.
Indigo Sol Boards
References
Wind Polygon: https://community.tableau.com/thread/148044
VAST Challenge 2017 [Mini-Challenge 2]: http://www.vacommunity.org/VAST+Challenge+2017+MC2