ISSS608 2017-18 T1 Assign XU YANRU

From Visual Analytics and Applications
Jump to navigation Jump to search

ISS608_2017-18_T1_Assign_XU YANRU

Mini Challenge:Illness in Smartpolis

Background

In the past few days, health professionals from Smartpolis hospitals noticed the reported illness increased significantly. In order to analyze whether epidemic would happen to this major metropolitan, city officials provided some datasets, microblog messages collected, satellite map, weather and population of Smartpolis.

Data Preparation

1. Coordinate detail

The location collected from microblog contains latitude and longitude of where the message was posted. However, the two values are stored in one column. It is split into two columns named "Latitude" and "Longitude" as Numeric data in JMP. "Location" is hidden to avoid confusion.

2. Symptom The posting in microblog is unstructured data. Text Explore in JMP is used to extract and analyze those messages. <>

It reveals the repeated terms and phrases. According to this list, some phrases obviously represents illness are taken out and indicated in a new column "Symptom".

  • Aching Muscles
  • Breathing (Including "shortness of breath")
  • Caught a fever
  • Caught a Pneumonia
  • Chill
  • Declining Health
  • Dry Cough
  • Hurt to Move
  • Running Nose
  • Sore Throat
  • Sick Sucks
  • Medicine Medicine