Dangy Proposal

From Geospatial Analytics and Applications
Jump to navigation Jump to search


HOME

PROPOSAL

POSTER

APPLICATION

RESEARCH PAPER


Introduction

Dengue fever has for centuries been a prominent epidemic disease that plagued humanity. While normal dengue infections take approximately a week to recover, complications like dengue haemorrhagic fever and dengue shock syndrome can be extremely severe, causing death. Today, even with our advanced healthcare and technology, there remains no proper cure or vaccine to combat the disease. This has allowed dengue fever to stay rampant in both developed and developing countries.

Consider Taiwan, a country that has proven itself through its evident economic growth and development. Even with healthcare services that meets international standards, Taiwan continue to stay susceptible to dengue fever. In fact, One of Taiwan’s health catastrophe was the 2015 dengue outbreak. From 2015 to 2016, there were 15,732 DF cases reported. Then, amongst the reported cases, 136 of them resulted in dengue haemorrhagic fever (DHF), of which 20 patients died.

We can see that dengue fever cannot be underestimated. Like any other epidemical diseases, it can have a ripple‐effect on its transmission resulting to an exponential increase of cases. Maintenance and prevention should always be done in order to reduce the possibility of its spread. This is especially so, considering how there are no vaccine or specific therapy for dengue fever. This leaves us with implementing effective control measures to combat the disease, which is what this study is about: to find potential areas where control measures can be implemented.

Project Objective

Our project goal is to study the possible spreading pattern of dengue fever and offer potential countermeasures to contain its spread. Previous researches on dengue fever generally studied factors contributing to the breeding of Aedes mosquitoes or the disease’s outbreak. To achieve this, most researchers have delved into spatial analysis, utilizing models such as Geographical Weighted Regression (GWR), Moran I and Geary C statistics. To enhance their spatial analyses, some have also combined their studies with temporal analysis to identify the patterns of dengue fever’s outbreak. However, these researches conducted their temporal analysis at a large timeframe, which provided an overview of the distribution of dengue cases in a region and not how the disease spread. Though managing the disease's origin is important, it is also important for us to learn more about how to contain the spread of the disease when it occurs.

In the previous section, we mentioned briefly about Taiwan. Data about dengue fever in Taiwan is readily available for analysis. Also, considering how Taiwan has a good mix of different settlements and terrains, it is a good case study for us to conduct our research. To scope our research, we will be focusing on the major dengue outbreak that has happened in 2015.


Data Sources
Data Source Remarks
Taiwan Main Island (level I administrative boundaries) data.gov.tw Municipality, county and city level I administrative boundaries of the main Taiwan island retrieved from the Ministry of Interior Land Surveying and Mapping Center.
Taiwan Boundaries (level III administrative boundaries) Map Store Municipality, county and city level III administrative boundaries of the greater Taiwan region retrieved from the National Land Surveying and Mapping Center (Ministry of the Interior)
Taiwan's Daily confirmed dengue cases Data Portal Dengue daily confirmed cases since 1998 retrieved from the Taiwan CDC (Centers for Disease Control)
Academic References


Data preparation

Data extracted directly from the various sources is mostly in CSV and GEOJSON format. One key challenge to data manipulation was the translation of chinese characters and also its accuracy.

Translation of Chinese Characters
We took the initial step to translate the JSON files directly with Google Translate. However, we found that this would alter the original structure of GEOJSON where there was missing parenthesis. Hence we took an alternative approach of using existing python library such as googletrans. Unfortunately, we encountered limitations such as character limit of 15,000.

We finalised with a safer approach using writing our own python script. We utilised selenium module to automate the process of inputting raw content directly into google translation engine and outputting them into proper JSON or CSV data structures.

Accuracy of Translation
Google translation engine does not offer translation for every word in our JSON data files. Our teams encounter a few words without translation after running the script. Hence manual translation is necessary.

Taiwan geographical data we sourced have slightly different county namings from the google translations we received. For example, Google Translation offers translation of “Taipei City” while Taiwan geographical data contains only “Taipei”. Hence further data transformation is required to standardise the county namings. Our team creates a dictionary to store words which involve translation discrepancies and replace the word using vlookup in Excel.


Project Prototype
Paper Prototype 1.jpg