Difference between revisions of "ISSS608 2017-18 T1 Assign ZHANG Lidan Data Preparation"
Jump to navigation
Jump to search
Line 33: | Line 33: | ||
* In addition, to exclude the irrelevant information, I create the subset dataset which consists of main flulike symptoms, such as '''''chill, flu, fever, sweat, pain, fatigue, ache, cough, breath, nausea, vomit, diarrhea'''''. Here, I use the Text Explorer in JMP to generate these new columns. | * In addition, to exclude the irrelevant information, I create the subset dataset which consists of main flulike symptoms, such as '''''chill, flu, fever, sweat, pain, fatigue, ache, cough, breath, nausea, vomit, diarrhea'''''. Here, I use the Text Explorer in JMP to generate these new columns. | ||
[[File:wordget.PNG|500px]]<br/><br/> | [[File:wordget.PNG|500px]]<br/><br/> | ||
− | [[File: | + | [[File:Symptoms.PNG|600px]] |
Latest revision as of 15:25, 16 October 2017
|
|
|
|
Data Preparation
- Import the microblog data set into the JMP 13
- Exclude 48 rows of missing text
- Separate the location into longitude and latitude through Word function
- Because these locations are at the western hemisphere, change the longitude coordinates into negative value by Num function
- In addition, to exclude the irrelevant information, I create the subset dataset which consists of main flulike symptoms, such as chill, flu, fever, sweat, pain, fatigue, ache, cough, breath, nausea, vomit, diarrhea. Here, I use the Text Explorer in JMP to generate these new columns.