Data Preparation Q2 Sumalika
Data Pre-Processing:
Dataset: Sensor Data
Tools & Techniques:
1. JMP
2. Tableau
3. Excel
1. Check for Missing Values in the given data set:
The Sensor data set given by VAST and it consists of no missing values.
2. Check for Duplicate Values:
Since there is no unique identifier, no issue of duplicate values.
The date field has 1 value for each, hence no repetitions in date value.
3. Analyse variable distributions:
Date: Refers to date fields for Months April, August and December
Chemical: Almost equal in number with a maximum count of AGOG-3A and minimum count of Methylosmolene
Monitor: A total of 9 sensors are located around the Factories.
Data Transformation
To identify the correlations between each chemical transform the data using Pivot function in Excel. The data for this plot is transformed as below: