Difference between revisions of "ISSS608 2017-18 T3 Assign Gao Jiaoyang Conclusion"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 72: Line 72:
 
===<font size="3"><font color="#8B4513"> Question2</font></font>===
 
===<font size="3"><font color="#8B4513"> Question2</font></font>===
 
What anomalies do you find in the waterway samples dataset? How do these affect your analysis of potential problems to the environment? Is the Hydrology Department collecting sufficient data to understand the comprehensive situation across the Preserve? What changes would you propose to make in the sampling approach to best understand the situation? Your submission for this question should contain no more than 6 images and 500 words.
 
What anomalies do you find in the waterway samples dataset? How do these affect your analysis of potential problems to the environment? Is the Hydrology Department collecting sufficient data to understand the comprehensive situation across the Preserve? What changes would you propose to make in the sampling approach to best understand the situation? Your submission for this question should contain no more than 6 images and 500 words.
 +
 +
===<font size="2"><font color="#8B4513"> Data Issue</font></font>===
 +
 +
<table>
 +
<table border='1'>
 +
<tr>
 +
<th>Anomalies</th>
 +
<th>Details</th>
 +
</tr>
 +
<tr>
 +
<td><b> Incomplete Data  </b>
 +
</td>
 +
<td>Neither the test variables nor the date of the test seem to be random and chaotic. The date of the test is not continuous and there is no regularity, always suddenly record some measures and then stop without given any reasons.  Apart from that, the test variables are not uniform, and most test data on test days are incomplete. Also, test times are rarely the same at different locations. All of these are likely to result in errors in subsequent data analysis. Because, if different test locations are compared with data from different test dates and variables, the results obtained are of little significance.
 +
 +
[[Image:GJY measure record.jpg|800px]]
 +
 +
If we transform the raw dataset by split the measure into column, we can explore the missing pattern more directly.
 +
 +
</td>
 +
</tr>
 +
 +
<tr>
 +
<td><b> Improper Timestamp </b>
 +
</td>
 +
<td> The raw dataset includes multiple records of the same measure in the same date at the same location with different values. This may due to the different test time in that day, if so, the timestamp should also include the precise time in that day instead of only the date. This issue may confuse investigators and also result in wrong conclusion.
 +
</td>
 +
</tr>
 +
 +
<tr>
 +
<td><b> Insufficient Data Source </b>
 +
</td>
 +
<td> If we want to detect the water pollution, we need more information to define the situation not just the timeline records. First, a large number of objective conditions may affect data measurements, which makes the data less "objective." Second, if we want to make sure that water pollution is related to he Kasios Furniture Company, we also need emissions data from Kasios. Third, a detailed data description will be more helpful.
 +
</td>
 +
</tr>
 +
</table>
 +
 +
===<font size="2"><font color="#8B4513"> Proposal Change</font></font>===
 +
 +
 
===<font size="3"><font color="#8B4513"> Question3</font></font>===
 
===<font size="3"><font color="#8B4513"> Question3</font></font>===
 
After reviewing the data, do any of your findings cause particular concern for the Pipit or other wildlife? Would you suggest any changes in the sampling strategy to better understand the waterways situation in the Preserve? Your submission for this question should contain no more than 6 images and 500 words.
 
After reviewing the data, do any of your findings cause particular concern for the Pipit or other wildlife? Would you suggest any changes in the sampling strategy to better understand the waterways situation in the Preserve? Your submission for this question should contain no more than 6 images and 500 words.
  
 
==<font size="5"><font color="#8B4513">'''Reference'''</font></font>==
 
==<font size="5"><font color="#8B4513">'''Reference'''</font></font>==

Revision as of 18:08, 8 July 2018

Duck pic.jpg VAST Challenge 2018 MC2: Like a Duck to Water

Background

Data Description & Data Preparation

Visualization

Conclusion & Reference

 


Conclusion

Question1

Characterize the past and most recent situation with respect to chemical contamination in the Boonsong Lekagul waterways. Do you see any trends of possible interest in this investigation? Your submission for this questions should contain no more than 10 images and 1000 words.

Measure Details
Water Temperature The water temperature trends are looks similar among the 10 locations. And all of them have quite same up and down trend among these years. Even though 3 of the locations which are Achara, Decha, Tansanee are only be recorded from 2009, they still have the same trend with others. In additional, the whole trend of the water temperature in 10 locations are increasing over the 15 years.

GJY water change.jpg

Macrozoobenthos Now the records of Macrozoobenthos are not continuously as they were before, Busarakhan and Somchair did not record since 2008.

The values of Macrozoobenthos are not change too much from 1998 to 2013, but in 2014 it suddenly increased to 36 in Kannika and 37 in Sakda , then the values came back to around 2 in the next year. Since the higher value of Macrozoobenthos may indicate the better water quality, we may have a guess that the water quality had a improvement in 2014 at this two location.

GJY M.jpg
Chemical Contaminant When we visual the data, we found that there are 4 measures only be recorded in recent years. And this may caused by the change of the water quality.

As a whole, we can see that the Total dissolved phosphorus has the highest value compare to the other 3 measures in the three years, and this one has the most changeable trend. AGOC-3A and Chlorodinine have low and steady values except AGOC-3A suddenly increased sharp in Dec 2014.

GJY new.jpg

Through the map, we can see that the Kohsoom has the highest value of new chemicals and the Chai has the lowest value. Also the river route should be Kohsoom to Chai.

GJY map of chemicals.jpg

Question2

What anomalies do you find in the waterway samples dataset? How do these affect your analysis of potential problems to the environment? Is the Hydrology Department collecting sufficient data to understand the comprehensive situation across the Preserve? What changes would you propose to make in the sampling approach to best understand the situation? Your submission for this question should contain no more than 6 images and 500 words.

Data Issue

Anomalies Details
Incomplete Data Neither the test variables nor the date of the test seem to be random and chaotic. The date of the test is not continuous and there is no regularity, always suddenly record some measures and then stop without given any reasons. Apart from that, the test variables are not uniform, and most test data on test days are incomplete. Also, test times are rarely the same at different locations. All of these are likely to result in errors in subsequent data analysis. Because, if different test locations are compared with data from different test dates and variables, the results obtained are of little significance.

GJY measure record.jpg

If we transform the raw dataset by split the measure into column, we can explore the missing pattern more directly.

Improper Timestamp The raw dataset includes multiple records of the same measure in the same date at the same location with different values. This may due to the different test time in that day, if so, the timestamp should also include the precise time in that day instead of only the date. This issue may confuse investigators and also result in wrong conclusion.
Insufficient Data Source If we want to detect the water pollution, we need more information to define the situation not just the timeline records. First, a large number of objective conditions may affect data measurements, which makes the data less "objective." Second, if we want to make sure that water pollution is related to he Kasios Furniture Company, we also need emissions data from Kasios. Third, a detailed data description will be more helpful.

Proposal Change

Question3

After reviewing the data, do any of your findings cause particular concern for the Pipit or other wildlife? Would you suggest any changes in the sampling strategy to better understand the waterways situation in the Preserve? Your submission for this question should contain no more than 6 images and 500 words.

Reference