Difference between revisions of "ISSS608 2017-18 T1 Assign ZHANG Lidan Data Preparation"
Jump to navigation
Jump to search
Line 24: | Line 24: | ||
<br/> | <br/> | ||
==Data Preparation== | ==Data Preparation== | ||
− | + | I import the microblog data set into the JMP at first. | |
− | + | Firstly, I exclude 48 rows of missing text. | |
− | + | Next, I separate the location into longitude and latitude through Word function. | |
− | + | Then, because these locations are at the western hemisphere, I change the longitude coordinates into negative value by Num function. | |
+ | In addition, to exclude the irrelevant information, I create the subset dataset which consists of main flulike symptoms, such as chill, flu, fever, sweat, pain, fatigue, ache, cough, breath, nausea, vomit, diarrhea. Here, I use the Text Explorer in JMP to generate these new columns. | ||
+ | |||
[[File:1.png|600px|center]] | [[File:1.png|600px|center]] |
Revision as of 16:01, 15 October 2017
|
|
|
|
Data Preparation
I import the microblog data set into the JMP at first. Firstly, I exclude 48 rows of missing text. Next, I separate the location into longitude and latitude through Word function. Then, because these locations are at the western hemisphere, I change the longitude coordinates into negative value by Num function. In addition, to exclude the irrelevant information, I create the subset dataset which consists of main flulike symptoms, such as chill, flu, fever, sweat, pain, fatigue, ache, cough, breath, nausea, vomit, diarrhea. Here, I use the Text Explorer in JMP to generate these new columns.