Difference between revisions of "ISSS608 2017-18 T3 Assign See Kwan Yen Visualization"

From Visual Analytics and Applications
Jump to navigation Jump to search
 
(15 intermediate revisions by the same user not shown)
Line 1: Line 1:
==Chemical Trends==
+
==Trends: Do you see any trends of possible interest in this investigation?==
 
<table border='1'>
 
<table border='1'>
 
<tr>
 
<tr>
Line 29: Line 29:
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td><b> 2. Measure Insight - Aluminum Contamination</b>
+
<td><b> 2. Chemicals by Locations</b>
<br>Water sensor readings have shown Aluminum to be present in 2008-2010 in 2 of the waterways: from Busarakha, Chai, to Kannika and from Somchair to Sakda. In all 5 of the locations, Aluminum value rose to over 400 ug/l in the month of April 2009. Aluminum values were observed to be the greatest in April 17, 18, and 25, all within the span of a week and occurring on the same day for pairs of the locations.</td>
+
<br>Since Iron has the highest level previously, figure 1.3 shows the trend of Iron across all locations.  Iron had been consistently low for Boonsri, Busarakhan, Chai, Kannika, Kohsoom, Sakda, and Somchair. Kohsoom has the highest level of Iron among all locations. It is almost non-existent in Achara, Decha and Tansanee.  
<td>[[File:1.2_rb.png|700px|center]]</td>
 
</tr>
 
<tr>
 
<td><b> 3. Measure Insight - Total Chlorodine and Hardness Groups </b>
 
Total Hardness and Chlorodinine have been observed to have very distinct group of values before and after a certain period across the locations in the waterway.
 
  
<br> Total Hardness is shown to have a pattern of:
+
However, a spike in the value of Iron is observed among 6 of the locations as shown in chart.
<br> (1) Average values (before 2010)
+
Investigating further on 2003 period shows that values Copper have also spiked during the same day for Kohsoom.
<br> (2) Period of missing or low values (2010-2014)
 
<br> (3) Higher values (2014 onwards)
 
  
<br> Total Chlorodinine is shown to have a pattern of:
+
While we do not know the cause, the correlation of such values seem to suggest a contamination incident around that period.</td>
<br> (1) Higher values (before 2016)
 
<br> (2) Lower values (2016 onward)
 
  
<br> Such consistent trends merit interest and are possible for further investigation as to the change in general average of the measures/properties of water.
+
<td>
 +
[[File:MapChemical2.png|600px|center]]
 
</td>
 
</td>
<td>[[File:1.3_rb.png|700px|center]]</td>
 
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td><b> 4. Measure Insight - Growing presence of Arsenic</b>
+
 
<br>In figure 1.4, Arsenic has been observed to have increasing value in the recent years across most of the locations in the reserve and as far is the data is concerned, has shown no signs of stopping. Further investigation as to the possible cause of such continuous growth would be advised as to keep it from getting into dangerous levels.</td>
+
<td><b> 3. Measure Insight - Increasing presence of Arsenic</b>
<td>[[File:1.4_rb.png|700px|center]]</td>
+
<br>In figure 1.4, Arsenic has seen an increasing trend across many locations. Further investigation as to the possible cause of such continuous growth would be advised as to prevent the ecosystem of the waterways and the habitat from being contaminated further.</td>
 +
<td>
 +
[[File:Arsenic.png|600px|center]]
 +
</td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td><b> 5. Location Insight - Kohsoom Fecal Fears</b>
+
<td><b> 4. Location Insight - Achara & Kohssom</b>
<br> While several of the locations have their share of fecal coliforms, possibly excrements of the wildlife in the area, Kohsoom has significantly more traces of fecal streptococci, peaking at 330mg/l in April 27, 2011, seen in figure 1.5. Streptococci is a strain of bacteria that can cause urinary tract infections, meningitis, and respiratory infections to animals. While the values for the measures have dropped ever since, it would be wise to further study the contamination event, as the bacteria present in the animal feces is indicative of the animal’s health and they may have indeed been infected at the time.</td>
+
<br> There is an elevated level of Mercury in Achara in 2010 to 2011. Prior to that, there is no data for Mercury in this location. Kohsoom also saw an increased level of mercury in 2006. </td>
<td>[[File:1.5_rb.png|700px|center]]</td>
+
<td>
 +
[[File:Mercury.png|600px|center]]
 +
</td>
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td><b> 6. Location Insight - November 22, 2014 in Tanasee</b>
+
</table>
As seen in the figure 1.6A, among the locations, Tanasee was observed to have a recent period of high ammonium that peaked with 15 mg/l on November 22, 2014.
 
  
After examining the rest of the data for Tanasee around that date, figure 1.6B shows that Chlorides, Nitrites, and Total Dissolved Salts have also been shown to have the same pattern of heightened increase. This suggests that contamination may have occurred prompting the increased values for the measures.
+
==Anomalies: What anomalies do you find in the waterway samples dataset? How do these affect your analysis of potential problems to the environment? Is the Hydrology Department collecting sufficient data to understand the comprehensive situation across the Preserve? What changes would you propose to make in the sampling approach to best understand the situation? ==
 +
 
 +
<table>
 +
<table border='1'>
 +
<th>Patterns</th>
 +
<th>Visualization</th>
 +
<tr>
 +
<td> <b> 1. Missing data for specific years and locations </b>
 +
<br> The figure 2.1 provides a clear view of missing samples of Sulphates and uneven sampling. This is a cause of major concern. We can clearly see that are missing values for samples in Achara, Decha and Tansanee. Plus a very low level of samples taken in those sites compared to the other locations. At the same time, we can appreciate a high level of samples in Boonsri, Chai, Kannika, Sakda specially in a range of years from 2002 to 2009. This scenario occurred for many other chemicals too.  </td>
 +
<td>
 +
[[File:Heatmapcpunt.png|600px|center]]
 
</td>
 
</td>
<td>[[File:1.6a_rb.png|600px|center]]
 
[[File:1.6b_rb.png|600px|center]]</td>
 
 
</tr>
 
</tr>
 
<tr>
 
<tr>
<td><b> 7. The incident of August 15, 2003</b>
+
<td> <b> 2.Methylosmoline reading started from 2014 onwards but inconsistent </b>
After earlier identifying Iron as having the top value of all the measures that were read, figure 1.8A was created to see the trend of Iron across all locations. With the exception of Achara, Decha, and Tanasee, locations that have just recently had traces of Iron, Iron had been consistently low for Boonsri, Busarakhan, Chai, Kannika, Kohsoom, Sakda, and Somchair.  
+
<br>Methylosmoline is the major contaminant chemical. However, there are only 3 years of samples available: 2014, 2015 and 2016. The sampling distribution of Methylosmoline varies across the locations. We can possibly traced it back to when Kasios started setting up its factories in the area and also when they started producing Methylosmoline. There is a high probability that if they coincide, then Kasios is highly probable to be found guilty of the production of Methylosmoline.
 +
 
 +
<td>[[File:Boxplot2.JPG|600px|center]]</td>
 +
</tr>
 +
<tr>
 +
<td> <b>3.Other chemicals have uneven distributions in different locations </b>
 +
<br> The figure shows Aluminium and Iron distributions of samples taken by locations, as we can see they are very differently distributed with outliers clearly marked in locations like Chai or Boonsri </td>
 +
<td>
 +
[[File:BoxAluminium.png|300px|left]]
 +
 
 +
 
 +
[[File:Boxcount.png|300px|right]]
 +
</td>
 +
</tr>
  
However, a spike in the value of Iron is observed among the latter group of locations. On average, without the spike, Iron values for these group of locations is around 1 mg/l. For Boonsri, the value had jumped to 30mg/l on October 15, 2002, 30 times more. For the rest of the locations, all having the spike on the same day, August 15, 2003, the value had reached 5 digits ranging from 17,970 in Chai to 37,959 in Kohsoom. Taking the sum of the values on the spike, the iron observed on the day accounts for 99.07% of total iron values collected from the reserve.
+
</table>
 +
==Causes of concern: do any of your findings cause particular concern for the Pipit or other wildlife? Would you suggest any changes in the sampling strategy to better understand the waterways situation in the Preserve? ==
  
Investigating further on August 15, 2003, figure 1.8C shows that values for Chromium and Copper have also spiked during the same day for the locations. This is especially true for Kohsoom, having the peak of all 3 measures on the same day.
+
<br>
 +
<table>
 +
<table border='1'>
 +
<tr>
 +
<th> Causes of Concern </th>
 +
<th>Visualization</th>
 +
</tr>
 +
<tr>
 +
<td>
 +
<br> A major cause of concern is the average values of Methylosmoline in locations nearby the dumping site. I have created a control chart to represent outliers in 1,2 or 3 standard deviations and show specific outliers in the data.
 +
<td>The figure below shows the outliers of 2 standard deviation of Methylosmoline in Kohsoom.
 +
[[File:Control chart.png|500px|center]]</td>
 +
</tr>
 +
<tr>
 +
<td> Suggestions for sampling strategy:
 +
<br>1. I suggest that we should regularly check and maintain the sensors in those locations, as some missing data could be caused by impairment of the hardware. <br> 2.Also, the number of times a reading is performed need to be consistent in the same river source, so that we can compare the data across the locations in that same area. <br> 3.We should focus on 3 groups: a main group comprising Area 4 - Boonsri - Kohsoom - Busarakhan - Chai and Kannika. A second group comprising joined locations in the river in Area 3 like Achara, Somchair,Sakda. A third group comprising:  Decha and Tansanee.  This way we could understand the impact of chemicals from one location flowing to other locations down the rivers.
 +
<td>
 +
[[File:Riversource.png|600px|center]]
 +
</td>
 +
</tr>
  
While we cannot say entirely for certain the cause, the correlation of such values seem to suggest a contamination incident around the said date.
+
<tr>
 +
<td> Levels of Methylosmoline detected across different days of a week
 +
<br> The cycle plot beside shows that the readings in Somchair usually hit a peak (darker green shade) on weekend, especially Sundays. Also noticed that Kohsoom has higher than usual amount of Methylosmoline on Thursday.  Interestingly, the readings in Somchair are close to zero on the same day. This pattern certainly raise suspicion that  Kasios is trying to avoid detection by relevant authorities on weekdays and only produce the harmful chemical on weekends in Somchair. However, there is a high chance that the waste was dumped at Kohsoom instead on Thursday.
 +
<td>
 +
[[File:Timing1.png|800px|center]]
 
</td>
 
</td>
<td>[[File:1.7a.png|700px|center]]
 
[[File:1.7b.png|700px|center]]</td>
 
 
</tr>
 
</tr>
 +
</table>
 +
 +
<table>
 +
<table border='1'>
 +
<tr>
 +
 +
</tr>
 +
<tr>
 +
 
</table>
 
</table>

Latest revision as of 01:13, 9 July 2018

Trends: Do you see any trends of possible interest in this investigation?

Patterns Visualization
1. Overview of chemical tracking across the Years

An overview of the water sensor readings across the years will give us a snapshot of the level of chemicals in the wwterway.


Figure 1.1 shows the average water sensor reading values by year of all the measures by units of mg/l and locations. It is apparent that iron had the highest value among all the measures in 2003. This is followed by high amount of Total Coliforms and Total Dissolved Salts.


Figure 1.2 shows the average water sensor reading values by year of the measure by units of microgram per liter.Aluminium has the highest amount of value in 2009, followed by Barium in 2009, Boron in 2012 and Zinc across multiple years.


Some observation on the data:
(a) Consistent readings
Measures such as those of Cadmium,Copper and Lead are consistent throughout the years in all the locations, though values may vary with different measures across time


(b) Present for a certain period of time
Measures such as those of PCB and Endrin are observed to be only present in a period of time or in certain location. Measures of these pattern may hint at contamination which have to be examined.


(c) Rare readings
Measures such as those of Barium and PAHS are rarely observed. Only a few instances were recorded by the water sensors. Measures of this pattern may still hint at a possible contamination though with such little data.

Overview of chemicals.png
Overview of microgram.png
2. Chemicals by Locations


Since Iron has the highest level previously, figure 1.3 shows the trend of Iron across all locations. Iron had been consistently low for Boonsri, Busarakhan, Chai, Kannika, Kohsoom, Sakda, and Somchair. Kohsoom has the highest level of Iron among all locations. It is almost non-existent in Achara, Decha and Tansanee.

However, a spike in the value of Iron is observed among 6 of the locations as shown in chart. Investigating further on 2003 period shows that values Copper have also spiked during the same day for Kohsoom.

While we do not know the cause, the correlation of such values seem to suggest a contamination incident around that period.
MapChemical2.png
3. Measure Insight - Increasing presence of Arsenic
In figure 1.4, Arsenic has seen an increasing trend across many locations. Further investigation as to the possible cause of such continuous growth would be advised as to prevent the ecosystem of the waterways and the habitat from being contaminated further.
Arsenic.png
4. Location Insight - Achara & Kohssom
There is an elevated level of Mercury in Achara in 2010 to 2011. Prior to that, there is no data for Mercury in this location. Kohsoom also saw an increased level of mercury in 2006.
Mercury.png

Anomalies: What anomalies do you find in the waterway samples dataset? How do these affect your analysis of potential problems to the environment? Is the Hydrology Department collecting sufficient data to understand the comprehensive situation across the Preserve? What changes would you propose to make in the sampling approach to best understand the situation?

Patterns Visualization
1. Missing data for specific years and locations
The figure 2.1 provides a clear view of missing samples of Sulphates and uneven sampling. This is a cause of major concern. We can clearly see that are missing values for samples in Achara, Decha and Tansanee. Plus a very low level of samples taken in those sites compared to the other locations. At the same time, we can appreciate a high level of samples in Boonsri, Chai, Kannika, Sakda specially in a range of years from 2002 to 2009. This scenario occurred for many other chemicals too.
Heatmapcpunt.png
2.Methylosmoline reading started from 2014 onwards but inconsistent


Methylosmoline is the major contaminant chemical. However, there are only 3 years of samples available: 2014, 2015 and 2016. The sampling distribution of Methylosmoline varies across the locations. We can possibly traced it back to when Kasios started setting up its factories in the area and also when they started producing Methylosmoline. There is a high probability that if they coincide, then Kasios is highly probable to be found guilty of the production of Methylosmoline.

Boxplot2.JPG
3.Other chemicals have uneven distributions in different locations
The figure shows Aluminium and Iron distributions of samples taken by locations, as we can see they are very differently distributed with outliers clearly marked in locations like Chai or Boonsri
BoxAluminium.png


Boxcount.png

Causes of concern: do any of your findings cause particular concern for the Pipit or other wildlife? Would you suggest any changes in the sampling strategy to better understand the waterways situation in the Preserve?


Causes of Concern Visualization


A major cause of concern is the average values of Methylosmoline in locations nearby the dumping site. I have created a control chart to represent outliers in 1,2 or 3 standard deviations and show specific outliers in the data.

The figure below shows the outliers of 2 standard deviation of Methylosmoline in Kohsoom.
Control chart.png
Suggestions for sampling strategy:


1. I suggest that we should regularly check and maintain the sensors in those locations, as some missing data could be caused by impairment of the hardware.
2.Also, the number of times a reading is performed need to be consistent in the same river source, so that we can compare the data across the locations in that same area.
3.We should focus on 3 groups: a main group comprising Area 4 - Boonsri - Kohsoom - Busarakhan - Chai and Kannika. A second group comprising joined locations in the river in Area 3 like Achara, Somchair,Sakda. A third group comprising: Decha and Tansanee. This way we could understand the impact of chemicals from one location flowing to other locations down the rivers.

Riversource.png
Levels of Methylosmoline detected across different days of a week


The cycle plot beside shows that the readings in Somchair usually hit a peak (darker green shade) on weekend, especially Sundays. Also noticed that Kohsoom has higher than usual amount of Methylosmoline on Thursday. Interestingly, the readings in Somchair are close to zero on the same day. This pattern certainly raise suspicion that Kasios is trying to avoid detection by relevant authorities on weekdays and only produce the harmful chemical on weekends in Somchair. However, there is a high chance that the waste was dumped at Kohsoom instead on Thursday.

Timing1.png