Difference between revisions of "Huang Yiyun Data Preparation"

From Visual Analytics and Applications
Jump to navigation Jump to search
(Created page with "<div style=background:#2B3856 border:#A3BFB1> 300px <font size = 6; color="#FFFFFF">ISSS608 Assignment Huang Yiyun - MC2 </font> </div> <!--MAIN HEADER...")
 
 
Line 43: Line 43:
 
*There is no missing data in "Boonsong Lekagul waterways readings.csv" dataset, but there is only one missing value in "chemical units of measure.csv" dataset, I just deleted it.<br>[[File: missing1.png|600px|center]][[File: missing2.png|600px|center]]
 
*There is no missing data in "Boonsong Lekagul waterways readings.csv" dataset, but there is only one missing value in "chemical units of measure.csv" dataset, I just deleted it.<br>[[File: missing1.png|600px|center]][[File: missing2.png|600px|center]]
 
*There are some sample dates need to recode.I recoded 2098 to 1998 and 2099 t0 1999.<br>[[File: Sampledates.png|600px|center]]
 
*There are some sample dates need to recode.I recoded 2098 to 1998 and 2099 t0 1999.<br>[[File: Sampledates.png|600px|center]]
In "chemical units of measure.csv", units are different,such as mg/l and μg/l.  
+
In "chemical units of measure.csv", units are different,such as mg/l and μg/l, which need to be consistent.<br>[[File: units.png|600px|center]]
 
|-
 
|-
 
|  
 
|  
2.
+
Excel
 
||
 
||
Microsoft Excel<br>
 
(Data Wrangling)
 
||
 
*Use filter and pivot table function in Microsoft Excel extract out the data we need and rename them as revised value(μg/l).
 
*Given thr geographical analysis we also record the geo-code of background image in the data file.[[File:Zhimao5.png|200px]]
 
|-
 
|
 
3.
 
||
 
&emsp;&emsp;Tableau<br>
 
(Data Analysis)
 
||
 
*Import raw data into Tableau and inner join the geo-code.[[File:Zhimao6.png|500px]]
 
<br>
 
*From the view of trend change, we can oberserve there are two dramaticlly increase in both 2003 and 2009.
 
*In addition, 2 locations were taken into consideration of sampling extraction site since 2009. Especillay in Tansanee, it always occupies the largest value of chemical contamination in recent years.<br>
 
[[File:Zhimao7.png|800px|center]]
 
<br>
 
*To prove whether the Kasios Furniture Company caused environmental damage to the Boonsong Lekagul Wildlife Preserve, we select the sample from recent 3 years and combined them with the maps.<br>
 
[[File:Zhimao8.png|800px|center]]
 
<br>
 
*To be specific, we would like to view the measure performances in different locations in a treemap.
 
[[File:Zhimao9.png|800px|center]]
 
|-
 
|}
 
</div>
 

Latest revision as of 23:41, 19 August 2018

MC2 2018.jpg ISSS608 Assignment Huang Yiyun - MC2

Overview

Data Preparation

Visualization

Conclusion

 



Tool

 Approach

  Findings

JMP

Data Exploration

  • There is no missing data in "Boonsong Lekagul waterways readings.csv" dataset, but there is only one missing value in "chemical units of measure.csv" dataset, I just deleted it.
    Missing1.png
    Missing2.png
  • There are some sample dates need to recode.I recoded 2098 to 1998 and 2099 t0 1999.
    Sampledates.png
In "chemical units of measure.csv", units are different,such as mg/l and μg/l, which need to be consistent.
Units.png

Excel