Difference between revisions of "ISSS608 2017 T3 Assign BI HE Question2"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 41: Line 41:
 
'''* Incorrect data (abnormal value)'''
 
'''* Incorrect data (abnormal value)'''
 
some values such as the data for iron in 2002 and 2003, and nitrates in some place are extremely high, and it is suspected to be incorrect.<br />
 
some values such as the data for iron in 2002 and 2003, and nitrates in some place are extremely high, and it is suspected to be incorrect.<br />
 +
<table width=90%>
 +
<tr>
 +
<td>
 +
[[File:bh32.png|500 px|left]]</td>
 +
<td>[[File:bh_8.png|500 px|left]]</td>
 +
</tr>
 +
</table>
 +
<table width=90%>
 +
<tr>
 +
<td>
 +
[[File:bh33.png|500 px|left]]</td>
 +
<td>[[File:bh_17.png|500 px|left]]</td>
 +
</tr>
 +
</table>
  
 
Hydrology Department is not collecting sufficient data to understand the comprehensive situation across the Preserve. I propose that Hydrology Department set up automatic sensors in each observation spot and get the regular and precise data for data analysts.
 
Hydrology Department is not collecting sufficient data to understand the comprehensive situation across the Preserve. I propose that Hydrology Department set up automatic sensors in each observation spot and get the regular and precise data for data analysts.

Revision as of 11:28, 10 July 2018



The Challenge

Data Preparation

Question 1

Question 2

Question 3 & Dashboard

 

Q2 What anomalies do you find in the waterway samples dataset? How do these affect your analysis of potential problems to the environment? Is the Hydrology Department collecting sufficient data to understand the comprehensive situation across the Preserve? What changes would you propose to make in the sampling approach to best understand the situation?

Anomalies for waterway samples data

* Redundant data The original dataset has more than one records for the same location in the same time, which are redundant, and may make noise in further research.

* Missing data Using the heat maps with count of records number as marks show an overview for the whole dataset. The data for many chemistry indicators are not continues and incomplete. it is impossible to find the correlation between chemistry indicators and find the pattern.

* Incorrect data (abnormal value) some values such as the data for iron in 2002 and 2003, and nitrates in some place are extremely high, and it is suspected to be incorrect.

Bh32.png
Bh 8.png
Bh33.png
Bh 17.png

Hydrology Department is not collecting sufficient data to understand the comprehensive situation across the Preserve. I propose that Hydrology Department set up automatic sensors in each observation spot and get the regular and precise data for data analysts.