ISSS608 2018-19 T3 Assign PARAM GADHAIYA MethodologynDataprep

From Visual Analytics and Applications
Jump to navigation Jump to search

Tenor.gif VAST MINI CHALLENGE 2: Like a Duck to Water


Abstract

Methodology and Data Preparation

Insights

Conclusion

 

The following datasets were used for creating the visualizations:

Datasets

Description

Boonsong Lekagul Waterways Readings

The .xlsx file contains details about the sample id, location from where the sample is taken, date when the sample is taken, type of measure and value of measure.

Chemical Units of Measure

The .xlsx file contains the name of the measure and the unit of measurement

Map of Waterways

The .jpg file contains the map which comprises of the several dumping grounds in Boonsong Lekagul waterways

Creating the Geolocation points

To find the X and the Y coordinates of every dumping ground in Boonsong Lekagul Waterways, the following steps were taken:
1. Load the Boonsong Lekagul Waterways excel file onto Tableau. Under the maps options, select background images. Select add an image and input the name, file and X and Y field as shown below.

1(2).png
2. Based on the scale of the map, Select the exact point and click on annotate to insert and obtain the X and Y coordinates. Input the coordinates obtained on to a new excel file. The excel sheet now has the following values:
2(2).png
3. Save the data sheet and left join the new data set to the existing dataset.
3(2).png
4. Click on new sheet and drag and drop X coordinate into columns and Y coordinates into rows. Add the background image and check the scale. The map of the dumping grounds appear as below:
4(2).png


Creating the Dataset

1. Go to Tableau and select import from Microsoft Excel.
2. Add the other files you would like to join using the “add connection” option in Tableau
3. Click on left join to join the datasets.
4. The new dataset is now created.
5(2).png


Missing Value Pattern Detection

The missing value pattern option in JMP is used to detect the missing value patterns in the dataset. Under the tables option, we select Missing value pattern. Based on our analysis, we observe that the dataset does not contain any missing values and hence no data cleaning or amendments would be made to the file.
6(2).png

Creating Calculated Fields