ISSS608 2016-17 T3 Assign CHEN XIAOQING

From Visual Analytics and Applications
Jump to navigation Jump to search

JO Smokestack.jpg ISSS608 Assign Chen Xiaoqing - MC2

Background

Analytical Results

Data Preparations and Methods

Comments

 



Background

Vast.png


Ornithology student Mitch Vogel was immediately suspicious of the noxious gases just pouring out of the smokestacks from the four manufacturing factories south of the nature preserve. He was almost certain that all of these companies are contributing to the downfall of the poor Rose-crested Blue Pipit bird. But when he talked to company representatives and workers, they all seem to be nice people and actually pretty respectful of the environment.

In fact, Mitch was surprised to learn that the factories had recently taken steps to make their processes more environmentally friendly, even though it raised their cost of production. Mitch discovered that the state government has been monitoring the gaseous effluents from the factories through a set of sensors, distributed around the factories, and set between the smokestacks, the city of Mistford and the nature preserve. The state has given Mitch access to their air sampler data, meteorological data, and locations map. Mitch is very good in Excel, but he knows that there are better tools for data discovery, and he knows that you are very clever at visual analytics and would be able to help perform an analysis.

Mini-Challenge 2 provides a three month set of data for you to analyze, covering April, August, and December 2016.

Objectives/Questions

  1. Characterize the sensors’ performance and operation. Are they all working properly at all times? Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture?Limit your response to no more than 9 images and 1000 words.
  2. Now turn your attention to the chemicals themselves. Which chemicals are being detected by the sensor group? What patterns of chemical releases do you see, as being reported in the data? Limit your response to no more than 6 images and 500 words.
  3. Which factories are responsible for which chemical releases? Carefully describe how you determined this using all the data you have available. For the factories you identified, describe any observed patterns of operation revealed in the data. Limit your response to no more than 8 images and 1000 words.

Data Descriptions

Overview

This data consists of sensor readings from a set of air-sampling sensors and meteorological data from a weather station in proximity to the factories and sensors.

The factories and sensors locations are provided in terms of x,y coordinates on a 200x200 grid, with (0,0) at the lower left hand corner (southwest).

The following are the factory locations:

Monitor and Factories Locations

Roadrunner Fitness Electronics: 89,27

  • Kasios Office Furniture: 90,21
  • Radiance ColourTek: 109,26
  • Indigo Sol Boards: 120,22

The following are the sensor locations:

  • 1: 62,21
  • 2: 66,35
  • 3: 76,41
  • 4: 88,45
  • 5: 103,43
  • 6: 102,22
  • 7: 89,3
  • 8: 74,7
  • 9: 119,42

Companies

Factory.jpg

Manufacturing Companies near Mistford:

  • Roadrunner Fitness Electronics – Roadrunner produces personal fitness trackers, heart rate monitors, headlamps, GPS watches, and other sport-related consumer electronics. Roadrunner began as one of the region’s first fitness stores in 1962, with an eye toward outfitting the entire nation with appropriate outdoor gear. After an earthquake nearly destroyed their main warehouse in 1968, Roadrunner turned a bad situation into a glowing success with the first “slightly damaged goods” sale. After which they began to focus on manufacturing; though their “Earthshaking Bargains” business still sells dented, overstocked and refurbished items over the internet and from a small retail shop attached to their front office.
  • Kasios Office Furniture – Kasios Office Furniture manufactures metal and composite-wood office furniture including desks, tables, and chairs. Kasios wants to do with desk chairs what Starbucks did for coffee – making office furniture what people must have, instead of what they just need. “Office equipment doesn’t need to be ugly!” says founder Ken Kasios. “We have redesigned all office products to be cool, fun, and hip—even your basic stapler.” Kasios business model is focused on in-store merchandising highlighting the beauty and functionality of their “user-centered design”. They recently celebrated the one-year anniversary of a distribution and merchandising agreement with the national office supply chain store PaperKlips.
  • Radiance ColourTek – Radiance produces solvent based optically variable metallic flake paints. “Metallic paints with an untarnished reputation!” quips ColourTek’s Senior Vice President Arthur Donner. “Radiance ColourTek metallic paints are worth their weight in gold.” Offering a new generation of paints in the 1970s, Radiance out marketed all competitors for three decades until manufacturing process issues began to tarnish their reputation. “We were challenged,” said Donner. “Polishing up our pearlescent pigments caused us to lose luster, but now we have the lowest VOCs (volatile organic compounds) in the industry!”
  • Indigo Sol Boards – Indigo Sol produces skateboards and snowboards. Founder Billy Keys started off manufacturing wooden wine barrels for northwestern US wineries, but then navigated a course from decorative fiberglass wine barrels to making his first pair of fiberglass skis in 1971. Excellent product and sales decisions rocketed Keys Skis production to unexpected levels, until they were bought out by a large Denver, Colorado-based private investment group. Keys returned to making specialized snowboards in the 1980s, with a small company in Mistford called Indigo Sol. The company has seen modest growth in recent years.


Chemicals

Chemical.jpg
  • Appluimonia – An airborne odor is caused by a substance in the air that you can smell. Odors, or smells, can be either pleasant or unpleasant. In general, most substances that cause odors in the outdoor air are not at levels that can cause serious injury, long-term health effects, or death to humans or animals. However, odors may affect your quality of life and sense of well-being. Several odor-producing substances, including Appluimonia, are monitored under this program.
  • Chlorodinine – Corrosives are materials that can attack and chemically destroy exposed body tissues. Corrosives can also damage or even destroy metal. They begin to cause damage as soon as they touch the skin, eyes, respiratory tract, digestive tract, or the metal. They might be hazardous in other ways too, depending on the particular corrosive material. An example is the chemical Chlorodinine. It has been used as a disinfectant and sterilizing agent as well as other uses. It is harmful if inhaled or swallowed.
  • Methylosmolene – This is a trade name for a family of volatile organic solvents. After the publication of several studies documenting the toxic side effects of Methylosmolene in vertebrates, the chemical was strictly regulated in the manufacturing sector. Liquid forms of Methylosmolene are required by law to be chemically neutralized before disposal.
  • AGOC-3A – New environmental regulations, and consumer demand, have led to the development of low-VOC and zero-VOC solvents. Most manufacturers now use one or more low-VOC substances and Mistford’s plants have wholeheartedly signed on. These new solvents, including AGOC-3A, are less harmful to human and environmental health.

Analytic Results and Answers

Question 1

Are all the sensors working properly all the time?

To see if the sensor is working properly all the time, we can visualize the number of record of each monitor at specific chemical and same timestamp.

Interestingly, we can find there are some duplicate records in the same occasion and also some missing records.

5-1.png5-2.png5-3.png

The dark blue indicates there are two records in this timestamp and green color is only 1 record. While the white color represents there is no record.

Conclusion:

1. April

Monitor 3,4,5,6,7,8 and 9 have some duplicate records when detecting chemical AGOC-3A while also have some no records when read the chemical Methylosmolene. These monitors except 1 and 2, are not working well all the time in April.

2. August

Monitor 1,2,3,4,5,6 and 9 have some duplicate records when detecting chemical AGOC-3A while also have some no records when read the chemical Methylosmolene. These monitors except 7 and 8, are not working properly all the time in April.

3. December

The monitor working condition is the same as it in August. Monitors except monitor 7 and 8, are not working properly all the time.


1-1

To characterize the performance of each monitor, first use the line chart to compare the reading in every time stamp with the overall median reading value, whether the reading in this time stamp is higher or lower than the median value. Then characterize how the monitor performs and operates through the whole period.

The line chart consists of two lines in each monitor panel. One line is the reading of this monitor, another one is the overall median value. It has four chemicals and each chemical detected characteristics is not the same, so it’s not reasonable to sum them up. It does more sense to see the monitors operates with each chemical. The same reason for looking at individual month. So here placed two filters, by month and by chemical. The whole dashboard is like right graph 1-1:




Findings:

Finding 1:

1-2 April

Above graph indicates the reading of four chemicals of each monitor in April. The X-axis is the hour of each day in April, while the Y-axis represents the monitors. Compare the reading of these four pollutions through whole period, the characteristics are shown below:

  • Monitor 3, 7 and 8 are consistently detecting the relative higher reading value compare to the median reading of four chemicals.
  • The reading detected by rest monitors (1, 2, 4, 5, 6 and 9) along this month is consistently fluctuated around the median reading value with some cases that detect relative higher value above the median value in all chemicals. (monitor 1 and 2 have very few cases that detect relative higher values.)
  • Note that the number of cases of detecting relative higher value is much higher in detecting AGOC-3A than detecting others. The reason probably is the duplicated records of reading this chemical. (there are 168 records with same time stamp, same monitor but two different reading records) Another reason may be this kind of chemical is more actively released around this park.
                                                       Duplicated Records 


Finding 2:

August

The characteristics of monitors in August are slightly different with those in April.

  • Not only monitor 3 and 7 but also monitor 4 are consistently detect values higher than median.
  • Monitor 1, 2, 8 and 9 detect the values just around the median with very few cases that will much higher than the median. While the rest monitors have read more cases with higher value.


Finding 3:

December

  • Monitor 3, 4, 5, 7 and 9 always have read the relative higher value than median during this month. The reading of 9 is just slightly higher than the median.
  • Monitor 1, 2, 6 and 8’s readings fluctuated around the median value. Only the monitor 6 in detecting AGOC-3A has higher variance.


Conclusion:

                                                   Conclusion 

From the conclusion table we can see the monitors have three main performances in reading the chemicals. There are two main finding in the performance of all the monitors:

  • The monitor 3 has always read the consistent higher value in all moths and all chemicals.
  • Monitor 1 and 2 have caught the fluctuated reading around median all the time no matter what the chemicals and whenever the months.

These two behaviors may be considered as unexpected performances due to consistent the same behavior. We have higher possibility to suspect that they may not be working properly all the time because the performance of both monitor 3 and 1&2 has remained the same all the time no matter what the chemical is and which moth it is in. What’s more, the reading detected by sensor 3 is much higher than the median at all situations.


Take a closer look at the unexcepted behaviors using the horizon graph. The horizon graph of all the reading through three months shows the difference between individual reading and median value in a same horizon, using orange-green diverging color to represent the positive and negative value. The density of color represents the level of difference from median. The darker the color is, the higher it differs from median.

                                 Horizon Graph 

First take a look at the monitor 1, which represents as consistently slight lower than the median through this whole period. As previous analysis showed, the reading of monitor 1 fluctuated around median, which here obviously is wrong. The reason probably is the reading of monitor 1 is very slightly lower than the median and we cannot see it clearly at first look. Although it’s different from previous result, the monitor 1 still may be not working properly all the time. The performance always tends to read lower value consistently.

Then is the monitor 2, both the green and orange have taken up almost half percentage with relative low variance. This is identical to previous analysis. The level of difference between reading of monitor 3 and median is very high during all the time. Many time stamps have reached the highest level.

Interestingly, monitor 4 also seems not to be working properly all the time. It performs differently during all three months, which represents around median, relative higher than median and much higher than median in April, August and December respectively.

Question 2

1) To determine which chemicals are mainly be detecting by sensor group, we’d better use the boxplot to see the reading distribution of each chemical in each monitor. Define the top-3 covered quartile size into the main sensor group.

April August December


Below table shows the conclusion of grouping sensors for detecting each chemical through the distribution of reading in three months.

                                              Conclusion

2) Chemical Release Pattern:

  • Looking the reading month by month, it shows the total release of each chemical has an upward trend from April to December. Then looking it day by day in each month, the release of Appluimonia and Chlorodinie remains almost the same in the whole month (not changing too much), which the AGOC-3A and Methylosmolene have their specific pattern. The release patterns of AGOC-3A in three months are quite similar, which has been fluctuating during around 5-25 days. The release pattern of Methylosmolene remained consistent during the most days in whole month but there are some irrgular days fluctuated a lot.
                                 Release Pattern-1
  • Below gragh is to see the release pattern in persepctive of hour in a day. It’s obivious that the release pattern of Appluimonia and Chlorodinie remains constant through whole day. But the AGOC-3A release is mainly gathered from 5:00 – 0:00, while Methylosmolene is mainly collected from 21:00 to 6:00 in the next day.
                                 Release Pattern-2


Conclusion:

Release Pattern-2

Question 3

1) Defined Rules:

plot the wind covered area using the wind direction and wind speed detected by each monitor. Because the wind data is detected by each sensor, so here I reverse the wind coverage area to see if which factory is the source to release chemicals. If it is under the area, it means the chemical is released by this factory.

The wind triangle starts from each sensor, reverse the wind direction. Two sides are the distance calculated by speed, the angle is self-defined. Technically, the angle is related to the height of chimney, which is not available in this data set. So here just assume it is 30.

Then combine the wind plot and horizon graph to check those time stamps with high reading of chemicals belongs to which factory. The high reading value here is determined as the value higher than median+2.04, which is the 5th band in the horizon graph.

                the dashboard of detecting chemical release from which factory

Steps:

  • Highlight the value in 5th band first to find those corresponding time stamps.
  • See which factory is covered under the corresponding wind detected area.
  • Manually check one by one, record which factory is covered and its frequency.
  • Summarize the frequency of each factory related to release of each chemical.
                one example for how to detect

For example, highlight the value in 5th band, then use the dashboard action to filter the wind detected area in the above map at this specific time stamp. This graph shows the reading of monitor 3 in April-22 0:00 is 2,473 and the wind detected area cover factory Roadruuner and Kasio, which means these two factories probably are responsible for the release of chemical Appluimonia.

Result:

After repeating several times, will get the frequency of each factory being exposed in the wind coverage area under the high reading value of chemicals.

Recording: the Chinese character “正” stands for frequency 5.

Frequency Records=============>>>Summary Table

The table indicates the frequency of each factory showed up at the area of the wind under each chemical’s release. Yellow color highlights the highest frequency of being exposed in the wind area. Indigo has the highest frequency 74 times of the high release value both in AGOC-3A and Methylosmolene. Factory Radiance has the frequency 59 and 68 times releasing high amount of Applumonia and Chlorodinie respectively.

Conclusion:

  • Factory Indigo are mainly responsible for AGOC-3A and Methylosmolene release.
  • Factory Radiance are mainly responsible for Applumonia and Chlorodinie release.


2) The pattern of operation of Indigo and Radiance

Pattern-1

It has been identified factories Indigo and Radiance are mainly responsible for release the chemicals AGOC-3A & Methylosmolene and Applumonia & Chlorodinie. So we can identify the pattern of two factories’ operation through the release pattern of their representative chemicals. The aggregation reading of AGOC-3A & Methylosmolene can represent the operation of Indigo while aggregation reading of Applumonia & Chlorodinie can reflect the operation pattern of Radiance.

                                    Pattern by Month

See the operation month by month, both Indigo and Radiance has an increasing trend of the release amount of chemicals from April to December. It means both factories have an increasing pattern of operation.


Pattern-2

Take a closer look at the pattern day by day, Indigo is more fluctuated in 30 days of all three months compared to those patterns of Radiance with relative constant pace (with lower variance).

                                     Pattern by Day


Pattern-3

Lastly look at it hour by hour in a whole day. Radiance still has a very constant and consistent pattern in whole day of all three months. While the relative busy working hour in a day of three month varies. Indigo is operating more often at 0:00-14:00, 5:00-23:00 and 22:00-7:00(+1) in April, August and December respectively.

                                     Pattern by Hour

Assignment Tableau Workbook Link

Tableau work sheets:

https://public.tableau.com/profile/xiaoqing7362#!/vizhome/Assignment_Chen_Xiaoqing/monitorperformance?publish=yes