ISSS608 2018-19 T3 Assign PARAM GADHAIYA MethodologynDataprep
|
|
|
|
|
The following datasets were used for creating the visualizations:
|
Datasets |
Description |
|
Boonsong Lekagul Waterways Readings |
The .xlsx file contains details about the sample id, location from where the sample is taken, date when the sample is taken, type of measure and value of measure. |
|
Chemical Units of Measure |
The .xlsx file contains the name of the measure and the unit of measurement |
|
Map of Waterways |
The .jpg file contains the map which comprises of the several dumping grounds in Boonsong Lekagul waterways |
Contents
Creating the Geolocation points
To find the X and the Y coordinates of every dumping ground in Boonsong Lekagul Waterways, the following steps were taken:
1. Load the Boonsong Lekagul Waterways excel file onto Tableau. Under the maps options, select background images.
Select add an image and input the name, file and X and Y field as shown below.
2. Based on the scale of the map, Select the exact point and click on annotate to insert and obtain the X and Y coordinates. Input the coordinates obtained on to a new excel file. The excel sheet now has the following values:
3. Save the data sheet and left join the new data set to the existing dataset.
4. Click on new sheet and drag and drop X coordinate into columns and Y coordinates into rows. Add the background image and check the scale. The map of the dumping grounds appear as below:
Creating the Dataset
1. Go to Tableau and select import from Microsoft Excel.
2. Add the other files you would like to join using the “add connection” option in Tableau
3. Click on left join to join the datasets.
4. The new dataset is now created.
Missing Value Pattern Detection
The missing value pattern option in JMP is used to detect the missing value patterns in the dataset. Under the tables option, we select Missing value pattern. Based on our analysis, we observe that the dataset does not contain any missing values and hence no data cleaning or amendments would be made to the file.
Creating Calculated Fields