Difference between revisions of "ISSS608 2016-17 T1 Project Team 8"
| Line 24: | Line 24: | ||
| * value - The observation indicated for the specific 'report_date', 'location', 'data_field' and when appropriate, 'time_period'. | * value - The observation indicated for the specific 'report_date', 'location', 'data_field' and when appropriate, 'time_period'. | ||
| * unit - The unit of measurement for the 'data_field'. This should conform to the 'data_field' unit options as described in the country-specific data guide. | * unit - The unit of measurement for the 'data_field'. This should conform to the 'data_field' unit options as described in the country-specific data guide. | ||
| + | |||
| + |  https://www.cdc.gov/zika/geo/index.html | ||
| + | |||
| + | In addition, the data source above can provide local zika cases in Singapore. | ||
| == Group Member == | == Group Member == | ||
Revision as of 17:22, 17 October 2016
|  |  |  |  | 
Motivation
Zika virus was first discovered in 1947 and is named after the Zika Forest in Uganda. In 1952, the first human cases of Zika were detected and since then, outbreaks of Zika have been reported in tropical Africa, Southeast Asia, and the Pacific Islands. Zika outbreaks have probably occurred in many locations. Before 2007, at least 14 cases of Zika had been documented, although other cases were likely to have occurred and were not reported. Because the symptoms of Zika are similar to those of many other diseases, many cases may not have been recognized.
In Singapore, the first case was found in August 2016. Within 2 months, there are more than 400 cases identified locally. In this project, we will exam the spread pattern of the Zika virus, all over the world, to check out whether there is any correlation between the weather or geolocation and the spread of this virus leveraging data visualization. After that, we would also dig deeper in terms of the local cases in Singapore to figure out how can we take some appropriate action to prevent the spread of Zika in Singapore.
Data-set
https://www.kaggle.com/cdc/zika-virus-epidemic
This dataset shares publicly available data related to the ongoing Zika epidemic. It is being provided as a resource to the scientific community engaged in the public health response
The dataset includes the following fields:
- report_date - The report date is the date that the report was published. The date should be specified in standard ISO format (YYYY-MM-DD).
- location - A location is specified for each observation following the specific names specified in the country place name database. This may be any place with a 'location_type' as listed below, e.g. city, state, country, etc. It should be specified at up to three hierarchical levels in the following format: [country]-[state/province]-[county/municipality/city], always beginning with the country name. If the data is for a particular city, e.g. Salvador, it should be specified: Brazil-Bahia-Salvador.
- location_type - A location code is included indicating: city, district, municipality, county, state, province, or country. If there is need for an additional 'location_type', open an Issue to create a new 'location_type'.
- data_field - The data field is a short description of what data is represented in the row and is related to a specific definition defined by the report from which it comes.
- data_field_code - This code is defined in the country data guide. It includes a two letter country code (ISO-3166 alpha-2, list), followed by a 4-digit number corresponding to a specific report type and data type.
- time_period - Optional. If the data pertains to a specific period of time, for example an epidemiological week, that number should be indicated here and the type of time period in the 'time_period_type', otherwise it should be NA.
- time_period_type - Required only if 'time_period' is specified. Types will also be specified in the country data guide. Otherwise should be NA.
- value - The observation indicated for the specific 'report_date', 'location', 'data_field' and when appropriate, 'time_period'.
- unit - The unit of measurement for the 'data_field'. This should conform to the 'data_field' unit options as described in the country-specific data guide.
https://www.cdc.gov/zika/geo/index.html
In addition, the data source above can provide local zika cases in Singapore.
Group Member
YE JIATAO jiatao.ye.2015@mitb.smu.edu.sg YANG YUWEI yuwei.yang.2015@mitb.smu.edu.sg CHEN YIFAN yifan.chen.2015@mitb.smu.edu.sg


