Difference between revisions of "ISSS608 2016-17 T3 Assign SUMALIKA KODUMURU"

From Visual Analytics and Applications
Jump to navigation Jump to search
 
(34 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 +
<div style="background:#FFFFFF ; border:#001a66; padding-left:15px; text-align:center;">
 +
<font size = 5; color="#001a66"><span style="font-family:Century Gothic;">ISSS608: Visual Analytics and Applications</span></font>
 +
<br/>
 +
<font size = 5; color="#001a66"><span style="font-family:Century Gothic;">VAST CHALLENGE 2017 </span></font>
 +
<br/>
 +
<font size = 4; color="#001a66"><span style="font-family:Century Gothic;"> '''- SUMALIKA KODUMURU''' </span></font>
  
<div style=background:#2B3856 border:#0099ff>
 
[[Image:Blue.gif|250px]]
 
<font size = 5; color="#ffffff">ISSS608 Visual Analytics and Applications</font>
 
</div>
 
<br/>
 
 
<!--MAIN HEADER -->
 
<!--MAIN HEADER -->
{|style="background-color:#1B338F;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
+
{|style="background-color:#001a66;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |  
| style="font-family:Century Gothic; font-size:100%; solid #000000; background:#2B3856; text-align:center;" width="20%" |  
+
 
 +
| style="font-family:Century Gothic; font-size:100%; solid #000000; background:#b30000; text-align:center;" width="16.6%" | 
 +
;
 +
[[ISSS608 2016-17 T3 Assign SUMALIKA KODUMURU | <font color="#FFFFFF">Assignment Overview</font>]]
 +
 
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" |
 
;
 
;
[[About| <font color="#FFFFFF">About</font>]]
+
[[Data Overview| <font color="#FFFFFF"> Data Overview </font>]]
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" | 
 +
;
 +
[[Question1_SUMALIKA KODUMURU| <font color="#FFFFFF"> Sensor Performance </font>]]
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
+
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" |
 
;
 
;
[[ISSS608 2016-17 T3 Assign SUMALIKA KODUMURU| <font color="#FFFFFF">Q1</font>]]
+
[[Question 2_SUMALIKA KODUMURU| <font color="#FFFFFF"> Patterns of Chemical Release </font>]]
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
+
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" |
 
;
 
;
[[Q2| <font color="#FFFFFF">Q2</font>]]
+
[[Question 3_SUMALIKA KODUMURU| <font color="#FFFFFF"> Factories Responsible </font>]]
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
+
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" |
 
;
 
;
[[Q3| <font color="#FFFFFF">Q3</font>]]
+
[[References & Feedback | <font color="#FFFFFF"> References & Feedback </font>]]
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
+
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" |
 
;
 
;
 
 
|  &nbsp;
 
|  &nbsp;
 
|}
 
|}
 +
</div>
 
<br/>
 
<br/>
  
 
+
<div style="background:#FFFFFF ; border:#001a66; padding-left:15px; text-align:center;">
<font size="5">'''The Curious Case of Rose-Crested Blue Pipit'''</font>
+
<font size = 5; color="#001a66"><span style="font-family:Century Gothic;"> '''Mystery at the Wildlife Preserve'''</span></font>
 +
[[File:Vast1.jpg|600px|center|float]]
 +
</div>
 
<br/>
 
<br/>
=Question1=
+
<font size = 4; color="#001a66"><span style="font-family:Century Gothic;">'''The Challenge Overview:'''</span> </font>
Are they all working properly at all times? Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture?Limit your response to no more than 9 images and 1000 words
 
 
<br/>
 
<br/>
=Solution 1=
+
Mistford is a mid-size city located to the southwest of a large nature preserve. The city has a small industrial area with four light-manufacturing endeavors.  Mitch Vogel is a post-doc student studying ornithology at Mistford College and has been discovering signs that the number of nesting pairs of the Rose-Crested Blue Pipit, a popular local bird due to its attractive plumage and pleasant songs, is decreasing! The decrease is sufficiently significant that the Pangera Ornithology Conservation Society is sponsoring Mitch to undertake additional studies to identify the possible reasons. Mitch is gaining access to several datasets that may help him in his work, and he has asked you (and your colleagues) as experts in visual analytics to help him analyze these datasets.
[[File:Sumalika Q1P1.PNG|600px|center]]
 
  
 
<br/>
 
<br/>
All the sensors need to capture the chemical readings at all times of the day for all the three months i.e. July, August and December.The "Calendar Plot" has dates highlighted in red indicating the days when sensors ( all 9) have failed to capture chemical readings. For example: On 2nd and 6th of April, 2nd, 4th and 7th of August and 2nd and 7th of December sensors have failed to capture the readings. Further investigation needs to be performed to make a deduction on sensor readings. At this point the sensors could have
+
<font size = 4; color="#001a66"><span style="font-family:Century Gothic;">'''The Mini Challenge 2 Overview:'''</span> </font>
 
<br/>
 
<br/>
- Incorrectly captured data
+
Ornithology student Mitch Vogel was immediately suspicious of the noxious gases just pouring out of the smokestacks from the four manufacturing factories south of the nature preserve. He was almost certain that all of these companies are contributing to the downfall of the poor Rose-crested Blue Pipit bird. But when he talked to company representatives and workers, they all seem to be nice people and actually pretty respectful of the environment.
 
<br/>
 
<br/>
- Missed capturing data
 
[[File:Sumalika Q1P2.PNG|600px|center|float]]
 
 
<br/>
 
<br/>
To identify the missing data, drill down to "Hour" level to note the monitors/sensors that are responsible for these readings:
+
In fact, Mitch was surprised to learn that the factories had recently taken steps to make their processes more environmentally friendly, even though it raised their cost of production. Mitch discovered that the state government has been monitoring the gaseous effluents from the factories through a set of sensors, distributed around the factories, and set between the smokestacks, the city of Mistford and the nature preserve. The state has given Mitch access to their air sampler data, meteorological data, and locations map. Mitch is very good in Excel, but he knows that there are better tools for data discovery, and he knows that you are very clever at visual analytics and would be able to help perform an analysis.
On 2nd April and 6th April, all the sensors recorded no readings at 12 AM. Either the factories were not working or have started late than their usual time and hence readings were not recorded. But when compared to the other days in this month, rest of the days have readings captured even at 12 AM and as there is no evident pattern in the operational hours of the factories,we can conclude that the sensors were not functioning well in capturing the readings on 2nd and 6th April.  
 
 
<br/>
 
<br/>
[[File:Sumalika_Q1P5.PNG|600px|center|float]]
 
 
<br/>
 
<br/>
Similar to the above case in April month, on the 4th and 7th August all the sensors recorded no readings at 12 AM. However on that day at 12 AM Monitor 3 recorded activity, but instead of 4 chemicals it captured the release of two chemicals. On further investigation it was observed that AGOG-3A and Methylosmolene were the chemicals that released from factories.  
+
The primary job for Mitch is to determine which (if any) of the factories may be contributing to the problems of the Rose-crested Blue Pipit. Often, air sampling analysis deals with a single chemical being emitted by a single factory. In this case, though, there are four factories, potentially each emitting four chemicals, being monitored by nine different sensors. Further, some chemicals being emitted are more hazardous than others. Your task, as supported by visual analytics that you apply, is to detangle the data to help Mitch determine where problems may be. Use visual analytics to analyze the available data and develop responses to the questions below. In addition, prepare a video that shows how you used visual analytics to solve this challenge. Novel visualizations and analysis approaches are especially interesting for this mini-challenge. Please do not use any other data in your work (including other Internet-based sources or other mini-challenge data)
 
<br/>
 
<br/>
This finding raises several questions such as - Were factories operating on 2nd August at 12 AM? If no then why do we see these readings?, did the sensor capture it incorrectly ? or were there any suspicious activities? Before making any conclusions, behavior of the readings needs to be analysed and hence December readings are also considered to see possible similar patterns.
 
 
<br/>
 
<br/>
[[File:Sumalika_Q1P6.PNG|600px|center|float]]
+
The four factories in the industrial area are subjected to higher-than-usual environmental assessment, due to their proximity to both the city and the preserve. Gaseous effluent data from several sampling stations has been collected over several months, along with meteorological data (wind speed and direction), that could help Mitch understand what impact these factories may be having on the Rose-Crested Blue Pipit. These factories are supposed to be quite compliant with recent years’ environmental regulations, but Mitch has his doubts that the actual data has been closely reviewed. Could visual analytics help him understand the real situation?
<br/>
+
 
Similar to the previous months, the sensors did not capture chemicals on all days at all times. On the 2nd and 7th of December, sensors missed to read chemicals at 12 AM. On further analysis it can be seen that Monitor No: 6,7,8 show readings but not all the four chemicals. Further analysis needs to be performed on the chemicals released on these particular days and identify a pattern. Overall view of how the sensors have captured the data on each day can be seen in Calendar Plot below.
 
<br/>
 
[[File:Sumalika_Q1P7.jpg|600px|center|float]]
 
<br/>
 
[[File:Sumalika_Q1P8.PNG|450px|right|float]]
 
<br/>
 
Now that we identified missing data we need to check if there were any incorrectly captured data. With the help of a Heat Map it can be evidently deduced that there has been excess release of AGOG-3A and less of Methylosmolene.  
 
 
<br/>
 
<br/>
Two questions arise at this point. Did the sensors capture the chemicals correctly? or were there excess release of a certain chemical from a factory.
+
<font size = 4; color="#001a66"><span style="font-family:Century Gothic;">''' Questions: '''</span> </font>
 
<br/>
 
<br/>
[[File:Sumalika_Q1P9_(2).jpg|1150px|center|float]]
+
1. Characterize the sensors’ performance and operation. Are they all working properly at all times? Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture?Limit your response to no more than 9 images and 1000 words.
<br/>
 
 
 
 
<br/>
 
<br/>
 
+
2. Now turn your attention to the chemicals themselves. Which chemicals are being detected by the sensor group? What patterns of chemical releases do you see, as being reported in the data? Limit your response to no more than 6 images and 500 words.
 
<br/>
 
<br/>
 +
3. Which factories are responsible for which chemical releases? Carefully describe how you determined this using all the data you have available. For the factories you identified, describe any observed patterns of operation revealed in the data. Limit your response to no more than 8 images and 1000 words

Latest revision as of 10:02, 17 July 2017

ISSS608: Visual Analytics and Applications
VAST CHALLENGE 2017
- SUMALIKA KODUMURU

Assignment Overview

Data Overview

Sensor Performance

Patterns of Chemical Release

Factories Responsible

References & Feedback

 


Mystery at the Wildlife Preserve

float


The Challenge Overview:
Mistford is a mid-size city located to the southwest of a large nature preserve. The city has a small industrial area with four light-manufacturing endeavors. Mitch Vogel is a post-doc student studying ornithology at Mistford College and has been discovering signs that the number of nesting pairs of the Rose-Crested Blue Pipit, a popular local bird due to its attractive plumage and pleasant songs, is decreasing! The decrease is sufficiently significant that the Pangera Ornithology Conservation Society is sponsoring Mitch to undertake additional studies to identify the possible reasons. Mitch is gaining access to several datasets that may help him in his work, and he has asked you (and your colleagues) as experts in visual analytics to help him analyze these datasets.


The Mini Challenge 2 Overview:
Ornithology student Mitch Vogel was immediately suspicious of the noxious gases just pouring out of the smokestacks from the four manufacturing factories south of the nature preserve. He was almost certain that all of these companies are contributing to the downfall of the poor Rose-crested Blue Pipit bird. But when he talked to company representatives and workers, they all seem to be nice people and actually pretty respectful of the environment.

In fact, Mitch was surprised to learn that the factories had recently taken steps to make their processes more environmentally friendly, even though it raised their cost of production. Mitch discovered that the state government has been monitoring the gaseous effluents from the factories through a set of sensors, distributed around the factories, and set between the smokestacks, the city of Mistford and the nature preserve. The state has given Mitch access to their air sampler data, meteorological data, and locations map. Mitch is very good in Excel, but he knows that there are better tools for data discovery, and he knows that you are very clever at visual analytics and would be able to help perform an analysis.

The primary job for Mitch is to determine which (if any) of the factories may be contributing to the problems of the Rose-crested Blue Pipit. Often, air sampling analysis deals with a single chemical being emitted by a single factory. In this case, though, there are four factories, potentially each emitting four chemicals, being monitored by nine different sensors. Further, some chemicals being emitted are more hazardous than others. Your task, as supported by visual analytics that you apply, is to detangle the data to help Mitch determine where problems may be. Use visual analytics to analyze the available data and develop responses to the questions below. In addition, prepare a video that shows how you used visual analytics to solve this challenge. Novel visualizations and analysis approaches are especially interesting for this mini-challenge. Please do not use any other data in your work (including other Internet-based sources or other mini-challenge data)

The four factories in the industrial area are subjected to higher-than-usual environmental assessment, due to their proximity to both the city and the preserve. Gaseous effluent data from several sampling stations has been collected over several months, along with meteorological data (wind speed and direction), that could help Mitch understand what impact these factories may be having on the Rose-Crested Blue Pipit. These factories are supposed to be quite compliant with recent years’ environmental regulations, but Mitch has his doubts that the actual data has been closely reviewed. Could visual analytics help him understand the real situation?


Questions:
1. Characterize the sensors’ performance and operation. Are they all working properly at all times? Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture?Limit your response to no more than 9 images and 1000 words.
2. Now turn your attention to the chemicals themselves. Which chemicals are being detected by the sensor group? What patterns of chemical releases do you see, as being reported in the data? Limit your response to no more than 6 images and 500 words.
3. Which factories are responsible for which chemical releases? Carefully describe how you determined this using all the data you have available. For the factories you identified, describe any observed patterns of operation revealed in the data. Limit your response to no more than 8 images and 1000 words