Difference between revisions of "Data Preparation Q3 Sumalika"

From Visual Analytics and Applications
Jump to navigation Jump to search
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
<div style="background:#FFFFFF ; border:#001a66; padding-left:15px; text-align:center;">
 +
<font size = 5; color="#001a66"><span style="font-family:Century Gothic;">ISSS608: Visual Analytics and Applications</span></font>
 +
<br/>
 +
<font size = 5; color="#001a66"><span style="font-family:Century Gothic;">VAST CHALLENGE 2017 </span></font>
 +
<br/>
 +
<font size = 4; color="#001a66"><span style="font-family:Century Gothic;"> '''- SUMALIKA KODUMURU''' </span></font>
 +
 +
<!--MAIN HEADER -->
 +
{|style="background-color:#001a66;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #000000; background:#001a66; text-align:center;" width="16.6%" | 
 +
;
 +
[[ISSS608 2016-17 T3 Assign SUMALIKA KODUMURU | <font color="#FFFFFF">Assignment Overview</font>]]
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" | 
 +
;
 +
[[Data Overview| <font color="#FFFFFF"> Data Overview </font>]]
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" | 
 +
;
 +
[[Question1_SUMALIKA KODUMURU| <font color="#FFFFFF"> Sensor Performance </font>]]
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" | 
 +
;
 +
[[Question 2_SUMALIKA KODUMURU| <font color="#FFFFFF"> Patterns of Chemical Release </font>]]
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#b30000; text-align:center;" width="16.6%" | 
 +
;
 +
[[Question 3_SUMALIKA KODUMURU| <font color="#FFFFFF"> Factories Responsible </font>]]
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" | 
 +
;
 +
[[References & Feedback | <font color="#FFFFFF"> References & Feedback </font>]]
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#001a66; text-align:center;" width="16.6%" | 
 +
;
 +
|  &nbsp;
 +
|}
 +
</div>
 +
<br/>
 +
<!--MAIN HEADER -->
 +
{|style="background-color:#FFFFFF;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
 +
 +
| style="font-family:Century Gothic; font-size:100%; solid #000000; background:#FFFFFF; text-align:center;" width="16.6%" | 
 +
;
 +
[[Question 3_SUMALIKA KODUMURU| <font color="#b30000"> '''Go back to Analysis'''</font>]]
 +
 +
| style="font-family:Century Gothic; font size = 5; solid #1B338F; background:#FFFFFF; text-align:center;" width="16.6%" | 
 +
;
 +
|  &nbsp;
 +
|}
 +
 
''' Data Pre-Processing: '''
 
''' Data Pre-Processing: '''
  
Line 109: Line 161:
 
<br/>
 
<br/>
  
''' Step 6: ''' The above transformed data is used to make cox comb plot on tableau ( refer to the Visualization tab under Factories section)
+
''' Step 6: ''' The above transformed data is used to make cox comb plot on tableau ( refer to the Visualization tab under Factories Responsible section for tableau sheet explanation)
 
<br/>
 
<br/>

Latest revision as of 12:53, 16 July 2017

ISSS608: Visual Analytics and Applications
VAST CHALLENGE 2017
- SUMALIKA KODUMURU

Assignment Overview

Data Overview

Sensor Performance

Patterns of Chemical Release

Factories Responsible

References & Feedback

 


Go back to Analysis

 

Data Pre-Processing:


Dataset: Meteorological Data
Tools & Techniques:
1. JMP
2. Tableau
3. Excel


1. Check for Missing Values in the given data set:
The Meteorological data set given by the weather department consists of few missing values which can be excluded from the analysis. The two rows that contained missing values were discovered using JMP a screenshot of it is provided.
100%

2. Check for Duplicate Values:
Since there is no unique identifier, no issue if duplicate values.
The date field has 1 value for each, hence no repetitions in date value.

3. Analyse variable distributions:

Date: Refers to date fields for Months April, August and December

100%


Wind Direction: Wind direction ranges from o.1 to 359.1 with a mean of 236 degree.

100%


Wind Speed: Wind speed ranges from 0.1 to 6.8 in terms of m/s units.

100%


4. Data Transformation:


- For Wind Speed Trend Analysis: Join Sensor Data with Meteorological data to obtain the trends in wind speed with respect to chemicals.
Step 1: For each date point available in weather data, identify the average wind speed using Pivot Tables.
Step 2: For each date point available in sensor data, identify the sum of reading for the 4 chemicals using Pivot and map it to its respective dates.
Step 3: The data source (final) for wind storm trend analysis is formed by joining both the transformed tables in Step 1 and Step 2. The table is as follows:
100%


- For Cox Comb Plot: Join the Sensor Data with Meteorological data to obtain the Cox Comb Plot which will aid in identifying the factories with high chemical release while taking into consideration the wind speed and its direction.
Step 1: Convert the wind speed from m/s to MPH as it is given that the given nature park map is divided is 200 X 200 dimensions ( / 12 each) .
Hence convert the wind speed value using the formula:
Wind speed (MPH) = Wind Speed (m/s) * 2.23
Wind Speed (Grids) = Wind Speed (MPH) * (200/12)
Step 2: Map the sensor to their location and name them X and Y. Refer to the table below for clarity. The location column is for information purpose provided by VAST and it is used to obtain X and Y values.

100%


Step 3: Map the meteorological data with the location data. Data needs to be transformed in such a way that all the days and weather data are mapped to its respective sensors and chemical. The transformed data is as follows.
Step 4: To compute the direction of wind on tableau, cox comb plot works on the concept of a plume model.

100%


Step 5: Give path ID's as 1, 2 , 3 to calculate the other co-ordinates of the triangle shown above.

The transformed table is as follows:


70%


The application of the above formula on out table variables:

100%

Step 6: The above transformed data is used to make cox comb plot on tableau ( refer to the Visualization tab under Factories Responsible section for tableau sheet explanation)