Difference between revisions of "ISSS608 2017 T3 Assign BI HE DataPrep"

From Visual Analytics and Applications
Jump to navigation Jump to search
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
<div style=background:##CCCCCC border:#A3BFB1>
 +
[[Image:Bhtitle.jpg.jpg|300px]]
 +
<b><font size = 6; color="#FFFFFF"; style="font-family:Century Gothic"> ISSS608 2017-18 T3 Assign Bi He MC2</font></b>
 +
</div>
  
 
__NOTOC__
 
__NOTOC__
Line 34: Line 38:
 
The original dataset contains 136,824 rows, but for the same time at the same location, it has more than one record, and it may disturb further analysis. If there is more than one record, only the average number will be left to represent the specific record. <br />
 
The original dataset contains 136,824 rows, but for the same time at the same location, it has more than one record, and it may disturb further analysis. If there is more than one record, only the average number will be left to represent the specific record. <br />
 
The function “Summary”  from JMP can be used to generate the new table.<br />
 
The function “Summary”  from JMP can be used to generate the new table.<br />
[[File:bh21.png|thumbnail|left|200 px]]<br />
+
 
[[File:Bh22.png|thumbnail|left|200 px]]
 
  
 
The new dataset contains 67,503 rows
 
The new dataset contains 67,503 rows
Line 43: Line 46:
 
The waterway data is usually driven by the seasonal change of the river--the pulse of the water. Month-to-month comparison is suitable for the situation, for this method can eliminate the effect of season.<br/>
 
The waterway data is usually driven by the seasonal change of the river--the pulse of the water. Month-to-month comparison is suitable for the situation, for this method can eliminate the effect of season.<br/>
  
'''<big>Monthly average</big>'''  
+
[[File:bh24.png|thumbnail|300px|left]]<br />
 +
'''<big>Monthly average</big>''' <br />
 +
 
 +
 
 
To observe the change across years, the measure of monthly average can be used as criterion. Create calculation field and compute the monthly average by the formula below. The measure can be used to discover the abnormal phenomena in time series for specific location. <br />
 
To observe the change across years, the measure of monthly average can be used as criterion. Create calculation field and compute the monthly average by the formula below. The measure can be used to discover the abnormal phenomena in time series for specific location. <br />
  
'''<big>%vs monthly average</big>'''<br />
+
[[File:bh25.png|thumbnail|300px|left]]<br />
 +
 
 +
'''<big>%vs monthly average</big>'''<br /><br />
 +
 
 +
 
 +
 
 +
 
 +
 
 +
 
  
  
Line 52: Line 66:
  
 
Diff from monthly avg.=%vs monthly average – 1<br />
 
Diff from monthly avg.=%vs monthly average – 1<br />
 +
 +
[[File:bh26.png|thumbnail|300px|left]]<br />
  
 
<big>'''Location avg.'''</big><br />
 
<big>'''Location avg.'''</big><br />
  
 
The measure is for comparing data in different locations at the same time, it can be used to find out the abnormal location in the specific time period.<br />
 
The measure is for comparing data in different locations at the same time, it can be used to find out the abnormal location in the specific time period.<br />
 +
 +
 +
 +
 +
 +
 +
  
 
<big>'''%vs location avg.'''</big>
 
<big>'''%vs location avg.'''</big>

Latest revision as of 12:12, 10 July 2018

Bhtitle.jpg.jpg ISSS608 2017-18 T3 Assign Bi He MC2



The Challenge

Data Preparation

Question 1

Question 2

Question 3 & Dashboard

 


Data Preparation

The original dataset contains 136,824 rows, but for the same time at the same location, it has more than one record, and it may disturb further analysis. If there is more than one record, only the average number will be left to represent the specific record.
The function “Summary” from JMP can be used to generate the new table.


The new dataset contains 67,503 rows

Measure Method

The waterway data is usually driven by the seasonal change of the river--the pulse of the water. Month-to-month comparison is suitable for the situation, for this method can eliminate the effect of season.

Bh24.png


Monthly average


To observe the change across years, the measure of monthly average can be used as criterion. Create calculation field and compute the monthly average by the formula below. The measure can be used to discover the abnormal phenomena in time series for specific location.

Bh25.png


%vs monthly average





Diff from monthly avg.

Diff from monthly avg.=%vs monthly average – 1

Bh26.png


Location avg.

The measure is for comparing data in different locations at the same time, it can be used to find out the abnormal location in the specific time period.





%vs location avg. {FIXED [year],[month],[Measure]:AVG({FIXED [year],[month],[Measure],[Location]: SUM([Mean(value)])})}

Diff from location avg. %vs location avg-1