Difference between revisions of "Answer1"
Line 30: | Line 30: | ||
*(2) Duplicate values: | *(2) Duplicate values: | ||
− | The dataset has duplicate record for the same and | + | The dataset has duplicate record for the same location, same sample date and same chemicals. |
*(3) Outliers: | *(3) Outliers: | ||
− | + | The value of zinc at Decha are high at 2009 and that of lead are high during 2009-2015, but the values of other chemicals seem normal. The value of Nitrite of Tansanee at 2010 and 2014 are higher than that of other location. | |
− | |||
Kohsoom has many outliers than other location.From this plot, it is clear that value of chemicals at Kohsoom is always higher other that of location on the same stream. And the value of Busarakhan, Chai which are on the downstream of Kohsoom are also relatively higher. It can be assumed that the source of water pollution may locate near Kohsoom and Busarakhan, then the pollution goes down the stream and influences Chai. | Kohsoom has many outliers than other location.From this plot, it is clear that value of chemicals at Kohsoom is always higher other that of location on the same stream. And the value of Busarakhan, Chai which are on the downstream of Kohsoom are also relatively higher. It can be assumed that the source of water pollution may locate near Kohsoom and Busarakhan, then the pollution goes down the stream and influences Chai. |
Revision as of 10:00, 14 July 2018
Vast Chanllenge 2018 MC2
Like a duck to water
Data Preparation | Visualization Design | Answer | Application | Assignments | |
Contents
- 1 Question 1
- 2 Question 2
- 2.1 What anomalies do you find in the waterway samples dataset?
- 2.2 How do these affect your analysis of potential problems to the environment?
- 2.3 Is the Hydrology Department collecting sufficient data to understand the comprehensive situation across the Preserve?
- 2.4 What changes would you propose to make in the sampling approach to best understand the situation?
- 3 Question 3
Question 1
Characterize the past and most recent situation with respect to chemical contamination in the Boonsong Lekagul waterways.
From the trellis plot by year, Cadmium, Chromium, Lead, Copper, Zinc and Chemical Oxygen Demand (Cr) show relative more fluctuation during 1999-2016 and the rest of the chemicals keep more even at the same time.
Do you see any trends of possible interest in this investigation?
According to above table, the Cadmium, Chromium, Lead, Copper, Magnesium, Zinc have more change on the trend.
Question 2
What anomalies do you find in the waterway samples dataset?
- (1) Missing values:
The data is not complete because many chemicals have too many missing values and can not be used to analyze and lose completeness of time.
- (2) Duplicate values:
The dataset has duplicate record for the same location, same sample date and same chemicals.
- (3) Outliers:
The value of zinc at Decha are high at 2009 and that of lead are high during 2009-2015, but the values of other chemicals seem normal. The value of Nitrite of Tansanee at 2010 and 2014 are higher than that of other location.
Kohsoom has many outliers than other location.From this plot, it is clear that value of chemicals at Kohsoom is always higher other that of location on the same stream. And the value of Busarakhan, Chai which are on the downstream of Kohsoom are also relatively higher. It can be assumed that the source of water pollution may locate near Kohsoom and Busarakhan, then the pollution goes down the stream and influences Chai.