ISSS608 2017-18 T3 Assign Kalai Selvi Data Preparation

From Visual Analytics and Applications
Revision as of 20:44, 7 July 2018 by Kalaisr.2017 (talk | contribs)
Jump to navigation Jump to search

MC2 2018.jpg VAST Challenge 2018 MC2: Like a Duck to Water

Problem Background

Data Preparation

Methodology and Dashboard Design

Insights

Conclusion

 

Check for Missing Records

The Boonsong Lekagul waterways readings.csv data consists of the 5 fields: id, value, location, sample and measure. Vlookup function is used in the Boonsong Lekagul waterways readings to add the ‘unit’ from the chemical units of measure.csv data. Hence, now the waterway readings data now also consists of the unit field.

1.jpg


The Boonsong Lekagul waterways readings data was imported into JMP to check for missing records. There were 1,273 records with missing units and these are for the Macrozoobenthos chemical. Macrozoobenthos is a type of animal species and a unit of measurement is not required for this Macrozoobenthos measure. 2.jpg


Check for Chemical Measures with 0 values

There are 9,700 measures with 0 values. Records with 0 chemical reading value are deleted from the data as these are treated as incorrect entries. There were 127,124 records in the Final Boonsong Lekagul waterways readings after deleting the 9,700 records.

3.jpg


Create Locations.csv data and find x,y coordinates

A new excel file, Locations.csv is created and the fields are the Locations, X and Y coordinates.

The Location data was imported into Tableau. Using the Background Image tab, Waterways Final.jpg image was imported. After editing the background image, X is dragged into Column shelf and Y is dragged into Rows shelf as shown below.

4.jpg


Each location in the image is annotated as a point to find the (x,y) coordinate respectively. An example of the annotation for Boonsri is as below. After finding all the (x,y) coordinates for each location, the X and Y coordinates are updated in the Locations data.

5.jpg


Join Tables

Boonsong Lekagul waterways readings.csv and Locations.csv data are imported into Tableau and joined using the common key, location in both datasets.

6.jpg