Difference between revisions of "Group15 proposal"

From ISSS608-Visual Analytics and Applications
Jump to navigation Jump to search
 
(8 intermediate revisions by 2 users not shown)
Line 2: Line 2:
 
<!--MAIN HEADER -->
 
<!--MAIN HEADER -->
 
<div style="background-color:#212d41; text-align:center">
 
<div style="background-color:#212d41; text-align:center">
[[File:Group15 banner.png|650px|frameless|center]]
+
[[File:Group15 banner.png|1600px|frameless|center]]
<font size = 5; color = #ffffff><span style="font-family:Helvetica;"><b>COVID-19 Visualization for China</b><br></span></font>
 
 
<br><br>
 
<br><br>
 
</div>
 
</div>
Line 19: Line 18:
 
| style="font-family:Helvetica; font-size:100%; solid #103080; background:#a8a8a8; text-align:center;" width="20%" |  
 
| style="font-family:Helvetica; font-size:100%; solid #103080; background:#a8a8a8; text-align:center;" width="20%" |  
 
;
 
;
[[Group15_application| <font color="#FFFFFF">Application</font>]]
+
[[Group15_application&user-guide| <font color="#FFFFFF">Application & User Guide</font>]]
  
 
| style="font-family:Helvetica; font-size:100%; solid #103080; background:#a8a8a8; text-align:center;" width="20%" |  
 
| style="font-family:Helvetica; font-size:100%; solid #103080; background:#a8a8a8; text-align:center;" width="20%" |  
Line 35: Line 34:
 
== <big>Overview</big> ==
 
== <big>Overview</big> ==
  
Since December 2019, new cases of COVID-19 have been discovered in Wuhan, it has spread to all over China and other countries. As of 2019-03-01, a total of 79,971 cases have been diagnosed and a total of 2,873 deaths have been confirmed in China. The situation in the rest of the world is not optimistic,  7,277 cases have been confirmed and total 18 deaths occurred in other countries. Its widespread and rapid speed is alarming. The 2019 nCov has caused a certain impact on the economy of China and even countries around the world, even has been taken seriously by WHO. In order to study trends in number of diagnoses, number of suspected patients, and deaths over time for different areas, we use the 2019 nCov data for further analysis and visualisation.
+
Since December 2019, new cases of COVID-19 have been discovered in Wuhan, it has spread to all over China and other countries. As of 2019-03-01, a total of 79,971 cases have been diagnosed and a total of 2,873 deaths have been confirmed in China. The situation in the rest of the world is not optimistic,  7,277 cases have been confirmed and the total 18 deaths occurred in other countries. Its widespread and rapid speed is alarming. The 2019 nCov has caused a certain impact on the economy of China and even countries around the world, even has been taken seriously by WHO. In order to study trends in the number of diagnoses, the number of suspected patients, and deaths over time for different areas, we use the 2019 nCov data for further analysis and visualization.
  
 
== <big>Project Objectives</big> ==
 
== <big>Project Objectives</big> ==
Line 44: Line 43:
 
2)Analyze trends in new diagnoses and mortality rates across the country and provinces.
 
2)Analyze trends in new diagnoses and mortality rates across the country and provinces.
  
3)Using multiple measurement methods to display epidemic data maps-based on nuclear probability density, based on clustering,etc.
+
3)Using multiple measurement methods to display epidemic data maps-based on nuclear probability density, based on clustering, etc.
  
 
== <big>Data Source</big> ==
 
== <big>Data Source</big> ==
Since the outbreak of the new coronavirus pneumonia, many agencies / portals (Clove Garden, Tencent, Sina, ...) have provided pages for "real-time tracking of the epidemic", which mainly displays the number and area of suspected, confirmed, cured, and dead cases distributed, the data used in this project is collected from the "Clove Garden "website.
+
Since the outbreak of the new coronavirus pneumonia, many agencies/portals (Clove Garden, Tencent, Sina, ...) have provided pages for "real-time tracking of the epidemic", which mainly displays the number and area of suspected, confirmed, cured, and dead cases distributed. At the same time, some people collect data from various websites for consolidation to facilitate further analysis by others.
  
The data is from the website below. It includes Province name(Chinese and English), City name(Chinese and English), zipcode, province confirmed count, province suspected count, province cured count, province dead count, city confirmed count, city suspected count, city cured count, city dead count and update time.
+
The data we used in this project is collected from the Github website. The link is as shown below.
  
https://ncov.dxy.cn/ncovh5/view/pneumonia
+
https://github.com/GuangchuangYu/nCov2019
  
 
== <big>Methodology</big> ==
 
== <big>Methodology</big> ==
  
(1)Data Exploration and Visualization: We will make full use of the dataset to visualize the data in different ways. Maps is the most intuitive way to show the spread and control of the epidemic. Line charts are the best way to show data trends and forecast data.
+
(1)Data Exploration and Visualization: We will make full use of the dataset to visualize the data in different ways. Maps are the most intuitive way to show the spread and control of the epidemic. Line charts are the best way to show data trends and forecast data.
  
(2)Clustering: Cluster the city and pations according to the daily addtion rate, cured rate and dead rate.
+
(2)Clustering: Cluster the city and cases according to the daily addition rate, cured rate, and death rate.
  
 
(3)Time Series Analytics: Based on the current data to predict the future trend for every single city.
 
(3)Time Series Analytics: Based on the current data to predict the future trend for every single city.
Line 64: Line 63:
 
Tools: Tableau; R-shiny
 
Tools: Tableau; R-shiny
  
Packages:
+
Packages: magick, dplyr, ggplot2, shinydashboard, forcats, mapproj, etc.

Latest revision as of 23:55, 26 April 2020

Group15 banner.png



Proposal

Poster

Application & User Guide

Research Paper

Back to Main


Overview

Since December 2019, new cases of COVID-19 have been discovered in Wuhan, it has spread to all over China and other countries. As of 2019-03-01, a total of 79,971 cases have been diagnosed and a total of 2,873 deaths have been confirmed in China. The situation in the rest of the world is not optimistic, 7,277 cases have been confirmed and the total 18 deaths occurred in other countries. Its widespread and rapid speed is alarming. The 2019 nCov has caused a certain impact on the economy of China and even countries around the world, even has been taken seriously by WHO. In order to study trends in the number of diagnoses, the number of suspected patients, and deaths over time for different areas, we use the 2019 nCov data for further analysis and visualization.

Project Objectives

We plan to use the 2019 nCov data to:

1)Analyze trends in cumulative cure rates and mortality across the country and provinces.

2)Analyze trends in new diagnoses and mortality rates across the country and provinces.

3)Using multiple measurement methods to display epidemic data maps-based on nuclear probability density, based on clustering, etc.

Data Source

Since the outbreak of the new coronavirus pneumonia, many agencies/portals (Clove Garden, Tencent, Sina, ...) have provided pages for "real-time tracking of the epidemic", which mainly displays the number and area of suspected, confirmed, cured, and dead cases distributed. At the same time, some people collect data from various websites for consolidation to facilitate further analysis by others.

The data we used in this project is collected from the Github website. The link is as shown below.

https://github.com/GuangchuangYu/nCov2019

Methodology

(1)Data Exploration and Visualization: We will make full use of the dataset to visualize the data in different ways. Maps are the most intuitive way to show the spread and control of the epidemic. Line charts are the best way to show data trends and forecast data.

(2)Clustering: Cluster the city and cases according to the daily addition rate, cured rate, and death rate.

(3)Time Series Analytics: Based on the current data to predict the future trend for every single city.

Tools&Packages

Tools: Tableau; R-shiny

Packages: magick, dplyr, ggplot2, shinydashboard, forcats, mapproj, etc.