Difference between revisions of "ISSS608 2017-18 T1 Assign WU YUQING Analysis & Solutions"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 45: Line 45:
 
=<font color=6A5ACD><font size=3><b>Question 2: Epidemic Spread</b></font></font>=
 
=<font color=6A5ACD><font size=3><b>Question 2: Epidemic Spread</b></font></font>=
 
<font color=6A5ACD>#2.1 Present a hypothesis on how the infection is being transmitted. For example, is the method of transmission person-to-person, airborne, waterborne, or something else? Identify the trends that support your hypothesis. (Please limit your answer to ten images and 1000 words.)</font><br>
 
<font color=6A5ACD>#2.1 Present a hypothesis on how the infection is being transmitted. For example, is the method of transmission person-to-person, airborne, waterborne, or something else? Identify the trends that support your hypothesis. (Please limit your answer to ten images and 1000 words.)</font><br>
 +
'''Hypothesis: The method of transmission is airborne, person-to-person and waterborne respectively at different outbreaks/periods, which is based on the following reasoning.'''<br>
 +
From previous analysis, we already know that the epidemic outbroke from May 18th onwards. Thus, firstly, based on the TOP three percentage of change of number of flu-related messages on an hourly basis from May 18th to May 20th as shown below, three relatively significant outbreaks can be detected. Then, these three outbreaks will be analysed respectively in the following sections to observe the spread of the epidemic and the transmission method.<br>
 +
[[Image:yqwu_3_outbreaks.png|500px]]<br>
 +
<font size=2><b>''Hourly Number of Flu-related Messages and Hourly Percentage of Change From May 18th to May 20th''</b></font><br>
 +
 +
From the graph above, we can find that these three outbreaks took place at the following three point of time:<br>
 +
• 08:00-09:00 May 18th, 2011<br>
 +
• 18:00-19:00 May 18th, 2011<br>
 +
• 02:00-03:00 May 19th, 2011<br>
 +
 +
'''# 1st outbreak: 08:00-09:00 May 18th, 2011:'''<br>
 +
[[Image:yqwu_1st_outbreak.png|border|600px]]<br>
 +
From the weather dataset, we know that the wind direction on May 18th is west. From the graph above, we can see that the flu-related messages suddenly increased significantly from 7am(to 8am) to 8am(to 9am) and the affected area expanded from Downtown to Eastside, which is completely consistent with the wind direction. Thus, we can infer that the method of transmission is airborne in this outbreak.<br>
 +
 +
'''#2nd outbreak: 18:00-19:00 May 18th, 2011:'''<br>
 +
[[Image:yqwu_2nd_outbreak.png|border|600px]]<br>
 +
From the graph above, we can see that the epidemic still concentrated on Downtown and Eastside at 5PM on May 18th but the epidemic suddenly outbroke again and has roughly spread to all directions at 6PM, which is not consistent with the wind direction. Obviously, in this outbreak, the transmission of the epidemic is not by wind.
 +
From the population statistics, we can know that Downtown is the most densely populated area in the day. And the time 5PM to 7PM is almost the time for people to get out of office to go back home. Thus, the movement of the population is very likely to prompt the epidemic spread to all directions. I suppose that the transmission is person-to-person in this outbreak.
 +
 +
'''#3rd outbreak: 02:00-03:00 May 19th, 2011:'''<br>
 +
[[Image:yqwu_3rd_outbreak.png|border|600px]]<br>
 +
From the graph above, we can see that the epidemic still concentrated on Downtown 1AM on May 19th but the epidemic spread to the river sides in Southern Westside and Plainville at 2AM, which is also obviously not consistent with the wind direction (WNW) on May 19th, 2011. We can easily see that the flu-related messages concentrated along the Vast River at 2AM.
 +
In addition, from the additional information, we can also know that the residents and businesses get their drinking water by pumping water from nearby reservoirs or rivers.
 +
Besides, from the word cloud of the messages as shown below, the word ‘stomach’ becomes very frequent on May 19th while its frequency is very low on May 18th (See the distribution below). From the further check in the text, ‘stomach’ indicated the ‘stomach ache’ in the messages (See the sample text below). And the word like ‘diarrhea’, ‘pneumonia’, ‘vomit’ has already started on May 19th but they are widely used on May 20th, that’s why they didn’t appear on the word  until May 20th(See the word cloud below).
 +
 
 +
 +
 +
From the following reasonsing, we can infer that people near the river sides drink the pathogen-polluted water from the river so that people along the riverside collectively had the stomach-related symptoms mentioned above. Thus, the infection in the third outbreak is transmitted by water.
 +
Overall, we can conclude that the infection is transmitted by wind, by water and person-to-person from the reasoning of all these three outbreaks above. However, there may be many other factors directly affecting the observed patterns mentioned above, which can lead to the wrong conclusions.
 +
 +
 +
 +
<font color=6A5ACD>#2.2 Is the outbreak contained? Is it necessary for emergency management personnel to deploy treatment resources outside the affected area? Explain your reasoning. (Please limit your answer to ten images and 1000 words. )</font><br>

Revision as of 12:08, 15 October 2017

Yqwu pic.jpg   Vast Challenge 2011 MC1: Characterization of an Epidemic Spread

Background

Data Description

Data Preparation

Analysis & Solutions

Feedback

 

Analysis & Solutions  Tool: JMP Pro & Tableau

Yqwu solutions.jpg


Question 1: Origin and Epidemic Spread

Identify approximately where the outbreak started on the map (ground zero location). Outline the affected area. Explain how you arrived at your conclusion. (Please limit your answer to six images and 500 words.)
Firstly, from the ‘preprocess.txt’, the messages containing symptoms are used to build the distribution of the number of flu-related messages by date as shown below. From the distribution, we can see that the number of flu-related messages on May 18th, May 19th, May 20th are significantly larger than the previous date. The number of flu-related messages peaked on May 19th. Thus, we can easily conclude that the epidemic outbroke on May 18th.
Yqwu flu-related Message Distribution.png
Distribution of Flu-related Messages By Date
To further identify the origin and the specific time of the outbreak, we can further check the geographical distribution of flu-related messages on May 18th by hour as shown below.
Yqwu May18 7AM.png
Yqwu May18 8AM.png
Yqwu May18 9AM.png
From the geographical distribution of flu-related messages (One yellow point stands for one flu-related message from one user) from 7am to 10am on May 18th, we can see that the number of flu-related microblogs suddenly increased significantly between 8am to 9am on May 18th and kept stable between 9am to 10am and these messages mainly concentrated in the zone of Downtown, Uptown and Eastside, especially the Downtown and southern Uptown. In addition, these messages seem that they concentrated near the Vastopolis Dome (one stadium), Vastopolis City Hospital and Convention Centre.
After the outbreak at 8am, we can find that the epidemic suddenly expanded to the Vast River’s both sides mainly in the zone of Southern Westside and Plainville at 2AM on May 19th, 2011 as shown below. This is another serious outbreak.
Yqwu May19 2AM.png
In this epidemic, the main affected areas have been highlighted in the red rectangle on May 18th and in the white rectangle on May 19th respectively above. The most affected area is Downtown and the river sides in Westside and Plainville.

Question 2: Epidemic Spread

#2.1 Present a hypothesis on how the infection is being transmitted. For example, is the method of transmission person-to-person, airborne, waterborne, or something else? Identify the trends that support your hypothesis. (Please limit your answer to ten images and 1000 words.)
Hypothesis: The method of transmission is airborne, person-to-person and waterborne respectively at different outbreaks/periods, which is based on the following reasoning.
From previous analysis, we already know that the epidemic outbroke from May 18th onwards. Thus, firstly, based on the TOP three percentage of change of number of flu-related messages on an hourly basis from May 18th to May 20th as shown below, three relatively significant outbreaks can be detected. Then, these three outbreaks will be analysed respectively in the following sections to observe the spread of the epidemic and the transmission method.
Yqwu 3 outbreaks.png
Hourly Number of Flu-related Messages and Hourly Percentage of Change From May 18th to May 20th

From the graph above, we can find that these three outbreaks took place at the following three point of time:
• 08:00-09:00 May 18th, 2011
• 18:00-19:00 May 18th, 2011
• 02:00-03:00 May 19th, 2011

# 1st outbreak: 08:00-09:00 May 18th, 2011:
Yqwu 1st outbreak.png
From the weather dataset, we know that the wind direction on May 18th is west. From the graph above, we can see that the flu-related messages suddenly increased significantly from 7am(to 8am) to 8am(to 9am) and the affected area expanded from Downtown to Eastside, which is completely consistent with the wind direction. Thus, we can infer that the method of transmission is airborne in this outbreak.

#2nd outbreak: 18:00-19:00 May 18th, 2011:
Yqwu 2nd outbreak.png
From the graph above, we can see that the epidemic still concentrated on Downtown and Eastside at 5PM on May 18th but the epidemic suddenly outbroke again and has roughly spread to all directions at 6PM, which is not consistent with the wind direction. Obviously, in this outbreak, the transmission of the epidemic is not by wind. From the population statistics, we can know that Downtown is the most densely populated area in the day. And the time 5PM to 7PM is almost the time for people to get out of office to go back home. Thus, the movement of the population is very likely to prompt the epidemic spread to all directions. I suppose that the transmission is person-to-person in this outbreak.

#3rd outbreak: 02:00-03:00 May 19th, 2011:
Yqwu 3rd outbreak.png
From the graph above, we can see that the epidemic still concentrated on Downtown 1AM on May 19th but the epidemic spread to the river sides in Southern Westside and Plainville at 2AM, which is also obviously not consistent with the wind direction (WNW) on May 19th, 2011. We can easily see that the flu-related messages concentrated along the Vast River at 2AM. In addition, from the additional information, we can also know that the residents and businesses get their drinking water by pumping water from nearby reservoirs or rivers. Besides, from the word cloud of the messages as shown below, the word ‘stomach’ becomes very frequent on May 19th while its frequency is very low on May 18th (See the distribution below). From the further check in the text, ‘stomach’ indicated the ‘stomach ache’ in the messages (See the sample text below). And the word like ‘diarrhea’, ‘pneumonia’, ‘vomit’ has already started on May 19th but they are widely used on May 20th, that’s why they didn’t appear on the word until May 20th(See the word cloud below).


From the following reasonsing, we can infer that people near the river sides drink the pathogen-polluted water from the river so that people along the riverside collectively had the stomach-related symptoms mentioned above. Thus, the infection in the third outbreak is transmitted by water. Overall, we can conclude that the infection is transmitted by wind, by water and person-to-person from the reasoning of all these three outbreaks above. However, there may be many other factors directly affecting the observed patterns mentioned above, which can lead to the wrong conclusions.


#2.2 Is the outbreak contained? Is it necessary for emergency management personnel to deploy treatment resources outside the affected area? Explain your reasoning. (Please limit your answer to ten images and 1000 words. )