ISSS608 2017-18 T3 Assign Vaishnavi Praveen Agarwal DataPrep

From Visual Analytics and Applications
Jump to navigation Jump to search

Pic.jpg   VAST Challenge: Mini Challenge 2

The Challenge

Data Preparation

Visualizations

Insights and Conclusion

Feedback and Comments

All Assignments

 


Data Description

File Name

Variables

Boonsong Lekagul waterways readings (.csv file)

The given data file has reading of 106 chemicals at 10 different locations from 1998 - 2016.

i. id (numeric , unique)
ii. value (numeric)
iii. location (string)
iv. sample date (string)
v. measure (string)

chemical units of measure (.csv file)

The given data file has the unit of each measure (106 measures) in which the readings were taken.

i. measure (string, unique)
ii. unit (string)

Waterways Final (.jpg file)

The Waterways Final is a map image that shows the location of dumping site and the waterways.

Waterways Final.jpg


Data Preparation

1. Grouping Measures

There are a total of 106 measures given in the data file, and it is difficult to view all the measures at a glance. In order to make the visualization simple I have grouped the measure based on their Chemical Composition and Behavior.

- A new column was added in the file chemical units of measure.csv
- Name the column as groups
- Chemical properties of each measure was studied and they were divided in 12 groups.
- Manually assigned the relevant group to each measure.

The 12 groups formed are:

Groups

Measures

Field measurements

Anionic active surfactants, Fecal coliforms, Fecal streptococci, Total coliforms, Total extractable matter, Total hardness

Heavy metals

Arsenic, Cadmium, Chromium, Copper, Iron, Lead, Manganese, Mercury, Nickel, Zinc

Hydrocarbon

Acenaphthene, Acenaphthylene, Anthracene, Benzo(a)anthracene, Benzo(a)pyrene, Benzo(b)fluoranthene, Benzo(g,h,i)perylene, Benzo(k)fluoranthene, Chrysene, Fluoranthene, Fluorene, Indeno(1,2,3-c,d)pyrene, Naphthalene, PAHs, Pentachlorobenzene, Petroleum hydrocarbons, Phenanthrene, Pyrene, Tetrachloromethane

Insecticides, Pesticides and Herbicides

Alachlor, Aldrin, Atrazine, Endosulfan (alpha), Endosulfan (beta), gamma-Hexachlorocyclohexane, Isodrin, Methoxychlor, Metolachlor, p,p-DDD, p,p-DDE, p,p-DDT, Simazine, Trifluralin

Metal

Aluminium, Barium, Berilium, Boron, Calcium, Cesium, Magnesium, Potassium, Selenium, Sodium

Mineral and Nutrients

Dissolved silicates, Orthophosphate-phosphorus, Silica (SiO2), Total dissolved phosphorus, Total nitrogen, Total phosphorus

Nitrogen

Inorganic nitrogen, Organic nitrogen, Total organic carbon

Organic

alpha-Hexachlorocyclohexane, AOX, beta-Hexaxchlorocyclohexane, Dieldrin, Endrin, Heptachlor, Heptachloroepoxide, Hexachlorobenzene, PCB 101, PCB 118, PCB 138, PCB 153, PCB 180, PCB 28, PCB 52

Others

1,2,3-Trichlorobenzene, 1,2,4-Trichlorobenzene, AGOC-3A, Macrozoobenthos, Methylosmoline

Oxygen

Biochemical Oxygen, Chemical Oxygen Demand (Cr), Chemical Oxygen Demand (Mn), Dissolved organic carbon, Dissolved oxygen, Oxygen saturation

Salt

Ammonium, Bicarbonates, Carbonates, Chlorides, Chlorodinine, Cyanides, Nitrates, Nitrites, Sulfides, Sulphates, Total dissolved salts

Water temperature

Water temperature

2. Coordinate Plotting

Waterways Final is a Map image provided in the data, but the data files does not contain Latitude and Longitude for the Places shown on the Map. Therefore, to use the Map as an interactive sheet, I will create the Latitude and Longitude manually in the following way:

Steps

Image

Adding Data and Background Image for Map

  • Open Boonsong Lekagul waterways readings (.csv file) in Tableau, right click on Location Dimension, click on Geographic Role and select ZIP Code / Postcode. This will generate Latitude and Longitude in Measures. Add the generated fields to Columns and Rows respectively.
  • Go to Map options and click on Background Images. In the dialogue box, browse the Waterways Final (.jpg file). In the X Field set Right value to 249 and in the Y Field set the Top value to 249. Go to options and select Always show entire image option and Apply.
Vaishnavi 1.png

Annotate Point to generate Coordinates

  • Create a new excel file Locations, add a column named Location and manually enter the places. Add other two column named X and Y.
  • Right click on the tip of the arrow on map, click on Annotate and select Point.
  • In the dialog box, insert fields Latitude(generated) and Longitude(generated).
  • Manually copy paste the annotated values in the Location.xlsx file, X and Y columns.
  • Do this for all the locations.
Vaishnavi 2.png

3. Combining Files

After cleaning and preparing the data, we have total 3 files: Boonsong Lekagul waterways readings (.csv file), chemical units of measure (.csv file) and Location (.xlsx file). We will combine these file using the common columns as Join clause and then do the Visualization.

Steps

Image

Combining Boonsong Lekagul waterways readings (.csv file) and chemical units of measure (.csv file)

  • Open the Data Source, add a new Text File Boonsong Lekagul waterways readings (.csv file).
  • Again click on Add in Connections, click on Text File and add chemical units of measure (.csv file).
  • Click on Join and select Left Join.
Vaishnavi 3.png
  • Use Measure as the Join clause between the two files.
Vaishnavi 4.png


Combining Boonsong Lekagul waterways readings (.csv file) and Location (.xlsx file)

  • Open the Data Source, add a new Microsoft Excel File Location (.xlsx file).
  • Click on Join and select Left Join.
Vaishnavi 5.png
  • Use Location as the Join clause between the two files.
Vaishnavi 6.png


4. Final Dataset

Vaishnavi 7.png