Difference between revisions of "G1-Group01 Datacleaning"
(2 intermediate revisions by the same user not shown) | |||
Line 34: | Line 34: | ||
== Data Cleaning with R == | == Data Cleaning with R == | ||
− | As mentioned above, our team used R to clean up data for our analysis. Thus, we use one of age group as example. | + | As mentioned above, our team used R to clean up data for our analysis. Thus, we use one of age group as example.<br/> |
− | [[File:1.loading.jpg| | + | [[File:1.loading.jpg|600px]] <br/> |
− | [[File:2..jpg| | + | [[File:2.data process.jpg|600px]]<br/> |
− | [[File:Outputa1.jpg| | + | '''Output of data frames: a1/b1/c1/d1/e1, please refer to the pictures below'''<br/> |
− | [[File:Outputb1.jpg| | + | |
− | [[File:Outputc1.jpg| | + | [[File:Outputa1.jpg|600px]] <br/> |
− | [[File:Outputd1.jpg| | + | '''Data Frame: a1''' <br/> |
− | [[File:Outpute1.jpg| | + | [[File:Outputb1.jpg|600px]] <br/> |
+ | '''Data Frame: b1''' <br/> | ||
+ | [[File:Outputc1.jpg|600px]] <br/> | ||
+ | '''Data Frame: c1''' <br/> | ||
+ | [[File:Outputd1.jpg|400px]] <br/> | ||
+ | '''Data Frame d1''' <br/> | ||
+ | [[File:Outpute1.1.jpg|600px]]<br/> | ||
+ | '''Data Frame e1''' <br/> |
Latest revision as of 21:11, 23 November 2019
Due to some of files were not in the proper format for our analysis.
Thus, our team used R and tableau for our data cleaning and data Visualization. However, in this section, we will only share how we use the R code to "Geocode" and "data clean".
Geocode
Our group used R package "ggmap" and the function "geocode". And the coordinate it has generated will be base on projection 4326. Thus we need to export the file again to become projection 3414 for our further analysis. Requirement of using ggmap:
1. "ggmap" must be installed in your R studio
2. Must have google API in order to require information from google
Data Cleaning with R
As mentioned above, our team used R to clean up data for our analysis. Thus, we use one of age group as example.
Output of data frames: a1/b1/c1/d1/e1, please refer to the pictures below
Data Frame: a1
Data Frame: b1
Data Frame: c1
Data Frame d1
Data Frame e1