Difference between revisions of "Methodology"

From Visual Analytics and Applications
Jump to navigation Jump to search
(Undo revision 7174 by Yanzhang.lu.2017 (talk))
Line 1: Line 1:
 
<div style=background:#2B3856 border:#A3BFB1>
 
<div style=background:#2B3856 border:#A3BFB1>
[[Image:MC3 2018.jpg|left|250px]]
+
[[Image:Duck.jpg|250px]]
<font size = 6; font face: "Arial";font color = #FFFFF0>VAST Challenge 2018 MC3: <br>  Who is involved in the hurt Eurasian Pipit? </font>     
+
<font size = 5; color="#FFFFFF">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Vast Mini Challenge 2: Like a Duck to Water</font>     
 
</div>
 
</div>
 
<!--MAIN HEADER -->
 
<!--MAIN HEADER -->
Line 7: Line 7:
 
| style="font-family:Century Gothic; font-size:100%; solid #000000; background:#2B3856; text-align:center;" width="20%" |  
 
| style="font-family:Century Gothic; font-size:100%; solid #000000; background:#2B3856; text-align:center;" width="20%" |  
 
;
 
;
[[Introduction| <font color="#FFFFFF">'''INTRODUCTION'''</font>]]
+
[[OVERVIEW| <font color="#FFFFFF">'''Overview'''</font>]]
  
 
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
 
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
 
;
 
;
[[Data Preparation| <font color="#FFFFFF">'''DATA PREPARATION'''</font>]]
 
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%"|  
+
[[Methodology| <font color="#FFFFFF">'''Methodology'''</font>]]
 +
 
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
 
;
 
;
[[Methodology | <font color="#FFFFFF">'''METHODOLOGY'''</font>]]
+
 
 +
[[Observations| <font color="#FFFFFF">'''Observations'''</font>]]
  
 
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
 
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
 
;
 
;
[[Insights| <font color="#FFFFFF">'''OBSERVATION AND INSIGHTS'''</font>]]
+
[[Conclusion| <font color="#FFFFFF">'''Conclusion'''</font>]]
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#0b3d53; text-align:center;" width="25%" |   
+
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |
 +
;
 +
[[Assignment_Dropbox_G1| <font color="#FFFFFF">'''Back to Dropbox'''</font>]]
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |
 +
;
 +
|  &nbsp;
 +
&nbsp;
 +
|}
 +
<br/>
 +
 
 +
<!--METHODOLOGY CONTENT -->
 +
==<font size="5"><font color="#000000">'''Overview'''</font></font>==
 +
 
 +
Two datasets are provided to answer the questions for this challenge; one is “Boonsong Lekagul waterways readings” and other is “Chemical units of measure”. The former file includes information related to 106 chemical indicators of water quality and their readings collected from 10 locations across the preserve. There are total of 136,824 records of readings collected from January 1998 to December 2016. Belos is the snippet of dataset.
 +
 
 +
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Image:Rup Data.jpg|600px]]
 +
 
 +
 
 +
==<font size="5"><font color="#000000">'''Data Preparation'''</font></font>==
 +
The above mentioned two data files are combined into one by matching measure name in both the files. Description of variables from the combined files is shown below.
 +
 
 +
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Image:Variables.JPG|600px]]
  
[[Assignment_Dropbox_G1| <font color="#FFFFFF">Back to Dropbox</font>]]
 
|  &nbsp;
 
|}
 
  
<center>
+
==<font size="5"><font color="#000000">'''Data Cleaning'''</font></font>==
{| style="background-color:#ffffff ; margin: 3px 10px 3px 10px;" width="80%"|
+
Initial data exploration of combined data files is performed in JMP. It is found that 9700 records of various chemical indicators have value of 0. Reading value of 0 is equivalent to missing value for any chemical measure. This can be problem if 0 is not removed from records and taken into tableau for visualization. Tableau treats 0 as value and plot these points in the visualization. To avoid that, records with value 0 is removed from dataset before loading into Tableau.
| style="font-family:Open Sans, Arial, sans-serif; font-size:15px; text-align: center; border-top:solid #f5f5f5; background-color: #f5f5f5" width="150px" |
 
[[Is this company growing(2015-2017)?|<font color="#3c3c3c"><strong>Is this company growing(2015-2017)?</strong></font>]]
 
  
| style="font-family:Open Sans, Arial, sans-serif; font-size:15px; text-align: center; border:solid 1px #f5f5f5; background-color: #fff" width="150px" | 
+
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[[Image:Data Cleaning.jpg|600px]]
[[Q2|<font color="#3c3c3c"><strong>Question 2</strong></font>]]
 
  
| style="font-family:Open Sans, Arial, sans-serif; font-size:15px; text-align: center; border:solid 1px #f5f5f5; background-color: #fff" width="150px" | 
 
[[Q3|<font color="#3c3c3c"><strong>Question 3</strong></font>]]
 
|}
 
</center>
 
  
<!--Sub Heading-->
+
==<font size="5"><font color="#000000">'''Geo – Location Mapping'''</font></font>==
 +
In addition to two data files, an image file “Waterways Final” with location of various water sensors across the preserve is also provided. This image file is added to tableau workspace as a background image to extract the coordinates of every location of sensors. New excel file is created as “Location.csv” with column names as location, X and Y. The point that represents each location of sensors in the map is annotated to get X and Y coordinates. The values of coordinates are then manually included into the excel file.

Revision as of 18:09, 8 July 2018

Duck.jpg        Vast Mini Challenge 2: Like a Duck to Water

Overview

Methodology

Observations

Conclusion

Back to Dropbox

   


Overview

Two datasets are provided to answer the questions for this challenge; one is “Boonsong Lekagul waterways readings” and other is “Chemical units of measure”. The former file includes information related to 106 chemical indicators of water quality and their readings collected from 10 locations across the preserve. There are total of 136,824 records of readings collected from January 1998 to December 2016. Belos is the snippet of dataset.

                          Rup Data.jpg


Data Preparation

The above mentioned two data files are combined into one by matching measure name in both the files. Description of variables from the combined files is shown below.

                          Variables.JPG


Data Cleaning

Initial data exploration of combined data files is performed in JMP. It is found that 9700 records of various chemical indicators have value of 0. Reading value of 0 is equivalent to missing value for any chemical measure. This can be problem if 0 is not removed from records and taken into tableau for visualization. Tableau treats 0 as value and plot these points in the visualization. To avoid that, records with value 0 is removed from dataset before loading into Tableau.

                          Data Cleaning.jpg


Geo – Location Mapping

In addition to two data files, an image file “Waterways Final” with location of various water sensors across the preserve is also provided. This image file is added to tableau workspace as a background image to extract the coordinates of every location of sensors. New excel file is created as “Location.csv” with column names as location, X and Y. The point that represents each location of sensors in the map is annotated to get X and Y coordinates. The values of coordinates are then manually included into the excel file.