Difference between revisions of "ISSS608 2017-18 T1 Assign ZHANG Lidan Data Preparation"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 24: Line 24:
 
<br/>
 
<br/>
 
==Data Preparation==
 
==Data Preparation==
I import the microblog data set into the JMP at first.
+
* Import the microblog data set into the JMP 13<br/>
Firstly, I exclude 48 rows of missing text.
+
* Exclude 48 rows of missing text<br/>
Next, I separate the location into longitude and latitude through Word function.  
+
[[File:Missingdata.PNG|400px]]
Then, because these locations are at the western hemisphere, I change the longitude coordinates into negative value by Num function.
+
* Separate the location into longitude and latitude through Word function<br/>
In addition, to exclude the irrelevant information, I create the subset dataset which consists of main flulike symptoms, such as chill, flu, fever, sweat, pain, fatigue, ache, cough, breath, nausea, vomit, diarrhea. Here, I use the Text Explorer in JMP to generate these new columns.
+
[[File:Location.PNG|300px]]
 +
* Because these locations are at the western hemisphere, change the longitude coordinates into negative value by Num function<br/>
 +
* In addition, to exclude the irrelevant information, I create the subset dataset which consists of main flulike symptoms, such as '''''chill, flu, fever, sweat, pain, fatigue, ache, cough, breath, nausea, vomit, diarrhea'''''. Here, I use the Text Explorer in JMP to generate these new columns.
  
 
[[File:1.png|600px|center]]
 
[[File:1.png|600px|center]]

Revision as of 17:51, 15 October 2017

Title momo.png Epidemic Spread in Smartpolis

Background

Data Preparation

Data Visualization

Conclusion

 


Data Preparation

  • Import the microblog data set into the JMP 13
  • Exclude 48 rows of missing text

Missingdata.PNG

  • Separate the location into longitude and latitude through Word function

Location.PNG

  • Because these locations are at the western hemisphere, change the longitude coordinates into negative value by Num function
  • In addition, to exclude the irrelevant information, I create the subset dataset which consists of main flulike symptoms, such as chill, flu, fever, sweat, pain, fatigue, ache, cough, breath, nausea, vomit, diarrhea. Here, I use the Text Explorer in JMP to generate these new columns.
1.png