Difference between revisions of "ISSS608 2016-17 T3 Assign APARAJITA SHUKLA DataPrep"
(dataprep) |
|||
(2 intermediate revisions by the same user not shown) | |||
Line 48: | Line 48: | ||
Where two files are .csv and one is .txt. | Where two files are .csv and one is .txt. | ||
+ | [[Image:dp1.png|700px]] | ||
− | The first file, Boonsong Lekagul waterways readings.csv contains 5 columns namely Id, value, location, sample date, measure and have 136924 rows. | + | * The first file, Boonsong Lekagul waterways readings.csv contains 5 columns namely '''''Id, value, location, sample date, measure''''' and have 136924 rows. |
− | The second file, chemical units of measure.csv has two columns namely, measure | + | * The second file, chemical units of measure.csv has two columns namely, '''''measure and unit'''''. |
− | The third .txt file basically contains all the information about the data present in the other files. | + | * The third .txt file basically contains all the information about the data present in the other files. |
+ | |||
+ | * The last .jpg file contains the map of Boonsong Lekagul waterways. | ||
+ | |||
− | |||
<div style=background:#438787 border:#A3BFB1> | <div style=background:#438787 border:#A3BFB1> | ||
Line 63: | Line 66: | ||
To check the missing values, I have used JMP. | To check the missing values, I have used JMP. | ||
+ | |||
+ | [[Image:picture2_apa.png|700px]] | ||
As can be seen from the image above, there were no missing value present in given dataset. | As can be seen from the image above, there were no missing value present in given dataset. | ||
+ | |||
<div style=background:#438787 border:#A3BFB1> | <div style=background:#438787 border:#A3BFB1> | ||
<font size = 3; color="#FFFFFF">Getting the Map</font> | <font size = 3; color="#FFFFFF">Getting the Map</font> | ||
</div> | </div> | ||
+ | |||
After importing data file into tableau, we first import the image of the map and using annotate using points, I marked all the locations in the map and manually created a separate .xslx file: | After importing data file into tableau, we first import the image of the map and using annotate using points, I marked all the locations in the map and manually created a separate .xslx file: | ||
− | + | [[Image:picture3_apa.png|700px]] | |
Line 85: | Line 92: | ||
As we needed to join Location table, measure.csv and Boonsong Lekagul waterways readings.csv tables together, the tool used for this is Tableau: | As we needed to join Location table, measure.csv and Boonsong Lekagul waterways readings.csv tables together, the tool used for this is Tableau: | ||
− | + | [[Image:picture4_apa.png|700px]] | |
− | |||
− | |||
− |
Latest revision as of 20:35, 7 July 2018
VAST MINI CHALLENGE 2:
Like a Duck to Water
|
|
|
Methodology
We have been provided with three files along with a map of the Boonsong Lekagul preserve area. Where two files are .csv and one is .txt.
- The first file, Boonsong Lekagul waterways readings.csv contains 5 columns namely Id, value, location, sample date, measure and have 136924 rows.
- The second file, chemical units of measure.csv has two columns namely, measure and unit.
- The third .txt file basically contains all the information about the data present in the other files.
- The last .jpg file contains the map of Boonsong Lekagul waterways.
Checking Missing Values
To check the missing values, I have used JMP.
As can be seen from the image above, there were no missing value present in given dataset.
Getting the Map
After importing data file into tableau, we first import the image of the map and using annotate using points, I marked all the locations in the map and manually created a separate .xslx file:
And then create map by dragging and dropping the coordinates on the worksheet.
Joining the Tables
As we needed to join Location table, measure.csv and Boonsong Lekagul waterways readings.csv tables together, the tool used for this is Tableau: