Wyz-Visualization & Insights

From Visual Analytics and Applications
Revision as of 16:37, 14 October 2017 by Yizhou.wang.2016 (talk | contribs)
Jump to navigation Jump to search

Infectious disease.jpg Vast2011 MC1: Characterization of an Epidemic Spread

Background

Methodology

Visualization & Insights

Feedback

 


Visualization & Insights

Origin and Epidemic Spread

Identify approximately where the outbreak started on the map (ground zero location). Outline the affected area. Explain how you arrived at your conclusion.
After a quick browse around the dataset, we can find that the content of over 1M microblog messages contains all aspects of life. So first, we start by extracting flu-related microblog messages. A list of keywords was selected to filter raw data. The list consists of keywords including flu, chill, fever, sweat, ache, pain, fatigue, cough, breath, nausea, vomit, diarrhea, and lymph, which are from observed symptoms and human judgement. After filtering, there are 71939 flu-related microblog messages left.

Visualization Insights
Wyz flu-related microblog messages day.png
Infer the disease outbreak date


The line chart is to show daily total flu-related microblog messages trend. It can be seen from the graph that there is a significant rise on May 17th.

Wyz -map of flu-related microblog messages17.png
Map of flu-related microblog messages on May 17


The few reports were scattered over the map.

Wyz -map of flu-related microblog messages18.png
Map of flu-related microblog messages on May 18


From the geographical map, it is clear to see that the disease broke out in the Downtown and then spread to Eastside.

Wyz -map of flu-related microblog messages19.png
Map of flu-related microblog messages on May 17


On May 19th, the disease had spread throughout the downstream of the Vast river.

Wyz -magnification of ground zero location.png
Magnification of ground zero location, May 18


If we zoom in, we can deduce from this picture that first 'ground zero' was in Downtown, around the Dome, City Hospital, and Convention Center.

Epidemic Spread

Mode of transmission

Present a hypothesis on how the infection is being transmitted. For example, is the method of transmission person-­to­-person, airborne, waterborne, or something else? Identify the trends that support your hypothesis.

Person-­to­-person

Considering that most of the flu is spread by person-­to­-person, we will focus on this route of transmission first.

Visualization Insights
Wyz flu-related microblog messages hour.png
Number of flu-related microblog messages breakdown by hour, May 18 2011


As above, we assumed that the disease broke out on the May 18th. On that day, most of conversations started from 7 AM and suddenly rose at 5 PM while reaching a peak at 6 PM. The period just fit the work time.

Wyz-map of flu-related microblog messages 18 work status.png
Map of flu-related microblog messages by work status


As is shown in the diagram, blue points stand for people location during work time from 7 am to 5 pm while yellow points represent the rest of the day. There is an intensive blue part in the downtown area during work time, which facilitates the spread of the disease.

Airborne

Visualization Insights
Wyz-changes in human traffic.png
Changes in human traffic during the day and night


However, by comparing the changes in human traffic during the day and night among 13 city zones. There is no significant difference on change of human traffic between Eastside which is hard hit and other zones like Suburbia, Plainville, Southville. The weather information that it was west wind also helped diffusion of pathogen.

Waterborne

Visualization Insights
Wyz-flu-related microblog messages symptom.png
Changes in human traffic during the day and night


What is also noticeable is that the areas affected spread to the downstream of Vast river on May 19th. What’s more, we discovered that the posts related to gastrointestinal problems increased gradually. The bar chart gives a breakdown of the different symptoms reported in microblog from May 17th to May 20th. It is apparently manifest from the graph that the posts related to gastrointestinal issues experienced a dramatic increase over the period from May 19th to May 20th. From the map, the reports are dense in the lower reaches of the Vast river.

Wyz-wordcloud.png
Wordcloud by term frequency


The word cloud shows the flu-related words with high frequency in microblogs of 19th and 20th. Highlighted words "stomach" and "diarrhea" which describe abdominal problems hadn't appeared until 19th.

Wyz-diarrheatext.png
Wordcloud by term frequency


To go further about this issue, the messages containing abdominal problems related words are extracted, which is shown in the screenshot to see the details.