Difference between revisions of "ISSS608 2016-17 T3 Assign Chan En Ying Grace Methodology"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 65: Line 65:
  
 
<br/>
 
<br/>
 +
 +
==<font size="5"><font color="#000000">'''Approach Taken'''</font></font>==
 +
 +
The following outlines the approach used for the analysis.
 +
 +
<div style="margin:0px; padding: 2px; background: #E6E6FA; font-family: Arial; border-radius: 1px; text-align:left">
 +
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 +
|-
 +
|
 +
<b>Step</b>
 +
||
 +
<b>Approach</b>
 +
||
 +
<b>Description</b>
 +
 +
|-
 +
|
 +
1.
 +
||
 +
<b>Data Understanding</b>
 +
||
 +
<b>i. Read in Raster Layer (Lekagul Roadways Map)</b>
 +
<br> - It is a single layer raster file. 200x200.
 +
<br> class      : RasterLayer
 +
<br> dimensions  : 200, 200, 40000  (nrow, ncol, ncell)
 +
<br> resolution  : 1, 1  (x, y)
 +
<br> extent      : 0, 200, 0, 200  (xmin, xmax, ymin, ymax)
 +
<br> coord. ref. : NA
 +
<br> names      : Lekagul_Roadways_2018
 +
<br> values      : 0, 255  (min, max)
 +
 +
 +
<b>ii. Find out structure of Raster Layer</b>
 +
<br> Extent          : 40000
 +
<br> CRS arguments  : NA
 +
<br> File Size      : 41078
 +
<br> Object Size    : 14376 bytes
 +
<br> Layer          : 1
 +
|-
 +
|
 +
 +
2.
 +
||
 +
<b>Data Cleaning</b>
 +
||
 +
<b>i. Import two CSV Files (Birds)</b>
 +
* 2081 Training Birds (Metadata)
 +
* 15 Test Birds (Provided by Kasios)
 +
 +
 +
<b>ii. Fix Data Quality Issues</b>
 +
* Change File ID from numeric to character
 +
* Change coordinates to numeric
 +
* Change Date from Character to Date
 +
* Omit the two NA values for the Y coordinate.
 +
* Clean the Dates (All standardise to m/d/y. For missing month/year, I will replace with NA. For missing day, I will impute as 1st day of the month.)
 +
* Clean the Timing (Standardise all to 24 hour formatting. Use “.” instead of ":")
 +
 +
|}
 +
</div>
 +
<br>

Revision as of 20:49, 22 June 2018

Rose Pipits.png “Mine dear rose pipits, whence did do thou vanish?”

Background

Methodology

Did Rose Pipit kicketh the bucket?

Which song belongs to thee?

Conclusion

 


Tools

R is the primary tool used in this analysis. The following lists the packages used for the project’s scope - for data cleaning, data visualisation, geospatial analysis and audio processing.

  • R libraries
    • sp
    • rgdal
    • sf
    • raster
    • spatstat
    • maptools
    • gplots
    • ggplot2
    • ggmap
    • rasterVis
    • lattice
    • latticeExtra
    • tidyverse
    • zoo
    • tmap
    • reshape2
    • quantmod
    • ggTimeSeries
    • viridis
    • rlang
    • soundgen
    • tuneR
    • phonTools
    • seewave


Approach Taken

The following outlines the approach used for the analysis.

Step

Approach

Description

1.

Data Understanding

i. Read in Raster Layer (Lekagul Roadways Map)
- It is a single layer raster file. 200x200.
class : RasterLayer
dimensions : 200, 200, 40000 (nrow, ncol, ncell)
resolution : 1, 1 (x, y)
extent : 0, 200, 0, 200 (xmin, xmax, ymin, ymax)
coord. ref. : NA
names : Lekagul_Roadways_2018
values : 0, 255 (min, max)


ii. Find out structure of Raster Layer
Extent : 40000
CRS arguments : NA
File Size : 41078
Object Size : 14376 bytes
Layer : 1

2.

Data Cleaning

i. Import two CSV Files (Birds)

  • 2081 Training Birds (Metadata)
  • 15 Test Birds (Provided by Kasios)


ii. Fix Data Quality Issues

  • Change File ID from numeric to character
  • Change coordinates to numeric
  • Change Date from Character to Date
  • Omit the two NA values for the Y coordinate.
  • Clean the Dates (All standardise to m/d/y. For missing month/year, I will replace with NA. For missing day, I will impute as 1st day of the month.)
  • Clean the Timing (Standardise all to 24 hour formatting. Use “.” instead of ":")