Difference between revisions of "ISSS608 2017-18 T1 Assign XING SIYUAN Data Preparation"
(One intermediate revision by the same user not shown) | |||
Line 34: | Line 34: | ||
</tr> | </tr> | ||
<tr> | <tr> | ||
− | <td><b> 1.Data Cleaning</b> | + | <td width=40%><b> 1.Data Cleaning</b> |
<br>Tools: JMP | <br>Tools: JMP | ||
<br>Method: | <br>Method: | ||
Line 42: | Line 42: | ||
<br>4. Export table into CSV format. | <br>4. Export table into CSV format. | ||
</td> | </td> | ||
− | <td>[[File:SY_Clean_data.png|500px|center]]</td> | + | <td width=60%>[[File:SY_Clean_data.png|500px|center]]</td> |
</tr> | </tr> | ||
Line 125: | Line 125: | ||
<br>Use Tableau filter out all the macroblogs posted on 17 May, there seems to an unusual density change at the intersection of red road and Vast river. select points around the area and create a set called event. Export user ID of the event set. | <br>Use Tableau filter out all the macroblogs posted on 17 May, there seems to an unusual density change at the intersection of red road and Vast river. select points around the area and create a set called event. Export user ID of the event set. | ||
<br> | <br> | ||
− | <br>In JMP, join the event table with macroblogs table. Filter out the row that are being posted on 17 May. From the output of text explorer, it seems that a terrible truck accident happened on 17 May around the intersection of Westside, Northville, Downtown and Plainville and a music festival happened in | + | <br>In JMP, join the event table with macroblogs table. Filter out the row that are being posted on 17 May. From the output of text explorer, it seems that a terrible truck accident happened on 17 May around the intersection of Westside, Northville, Downtown and Plainville. |
− | + | <br>To further explore what are the major events happened in Smartpolis, we did text analysis for macroblogs post in each week (as shown on left). It seems that on 6 May, there is a car accident happened alone the boundary of Riverside and Suburbia. On 7 May, there was a music festival held in Downtown area. On 17 May, there was a truck accident happened in the intersection area of road 610 and Vast river. The first two events are held far before the break out of the epidemic, hence, the events shall had no effect on spread of the epidemic. | |
</td> | </td> | ||
<td> | <td> | ||
− | [[File:SY_18.png| | + | |
− | [[File: | + | |
+ | Event detection: | ||
+ | {| class="wikitable" | ||
+ | |- | ||
+ | ! Group !! Text Exployer | ||
+ | |- | ||
+ | | [[File:SY_18.png|300px|center]] || [[File:SY_17.png|300px|center]] | ||
+ | |} | ||
+ | |||
+ | Week key words explore: | ||
+ | {| class="wikitable" | ||
+ | |- | ||
+ | ! Week 1 !! Week 2 !! Week 3 | ||
+ | |- | ||
+ | | [[File:week1.png|200px|center]] || [[File:week2.png|200px|center]] || [[File:week3.png|200px|center]] | ||
+ | |} | ||
+ | |||
+ | Key Events | ||
+ | {| class="wikitable" | ||
+ | |- | ||
+ | ! Music Festival !! Car Accident !! Truck Accident | ||
+ | |- | ||
+ | | 7 May || 6 May || 17 May | ||
+ | |- | ||
+ | | [[File:SY_26.png|200px|center]] || [[File:SY_27.png|200px|center]] || [[File:SY_28.png|200px|center]] | ||
+ | |} | ||
</td> | </td> | ||
</tr> | </tr> |
Latest revision as of 20:53, 16 October 2017
|
|
|
|
Data Preparation
Description | Illustration | |||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1.Data Cleaning
|
||||||||||||||||||||
2.Identify infected patients
|
Heatmap of Number of Macroblogs by days: Macroblogs distribution in the last day: |
|||||||||||||||||||
3.Identify Symptom of Infected Patients - Data Preparation
|
Words detection:
Words table:
Check Symptoms:
|
|||||||||||||||||||
4.Identify Origin of the Epidemic - Data Visulazation
|
Visualization:
|
|||||||||||||||||||
5.Identify Major Events in Smartpolis
|
Week key words explore:
Key Events
|