Difference between revisions of "IS428 AY2018-19T1 Gokarn Malika Nitin"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 168: Line 168:
 
=<div style="background: #581845; padding: 15px; line-height: 0.3em; text-indent: 15px; font-size:18px; font-family:Open Sans, Arial, sans-serif"><font color= #ffffff>Task 2: Spatio-temporal Analysis of Citizen Science Air Quality Measurements</font></div>=
 
=<div style="background: #581845; padding: 15px; line-height: 0.3em; text-indent: 15px; font-size:18px; font-family:Open Sans, Arial, sans-serif"><font color= #ffffff>Task 2: Spatio-temporal Analysis of Citizen Science Air Quality Measurements</font></div>=
  
<!--
 
 
==<div style="font-family:Open Sans, Arial, sans-serif;font-size:17px">Sensor's Coverage</div>==
 
==<div style="font-family:Open Sans, Arial, sans-serif;font-size:17px">Sensor's Coverage</div>==
I brought in the sensors for the Sofia region only, and use the Density Marks feature of Tableau 10.3 to plot the coverage of the sensors. I find that the sensors are generally placed in central Sophia, with lack of coverage in South Eastern and North Eastern regions. The number of sensors increases from 2017 to 2018, however, this is mainly concentrated in the central region, with little focus on the areas that lack coverage. Below is the coverage for the year 2018.
+
I brought in the sensors for the Sofia region only, and use the Density Marks feature of Tableau 10.3 to plot the coverage of the sensors. I find that the sensors are generally placed in central Sophia, with lack of coverage in South Eastern and North Eastern regions. The number of sensors increases from 2017 to 2018, however, this is mainly concentrated in the central region, with little focus on the areas that lack coverage. <br/>
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
+
Below is the coverage for the year 2017.
|- [[File:Coverage 2017.jpg|500px|left]] [[File:Coverage.jpg|500px|right]]
+
[[File:Coverage 2017.jpg|550px|center]]  
|}
+
Below is the coverage for the year 2018.
 +
[[File:Coverage.jpg|550px|center]]
  
 
<br/>
 
 
==<div style="font-family:Open Sans, Arial, sans-serif;font-size:17px">Sensors' Stable Operations</div>==
 
==<div style="font-family:Open Sans, Arial, sans-serif;font-size:17px">Sensors' Stable Operations</div>==
 
To understand whether the sensors are operating properly at all times, I looked into the time series data of the number of records being captured by the sensors over the hours. This shows that indeed the number of records and thereby sensors has increased over time. However, they are not working properly all the time, as shown by the sudden steep drop in the number of records collected and a sudden spike. Perhaps this is due to sensors suddenly not taking as many or taking too many records as compared to their configuration due to malfunction. A closer look at the time axis shows that these dips started in 2018. This could be due to the new sensors that were added in 2018. Perhaps certain members of the new group of sensors are not working correctly.
 
To understand whether the sensors are operating properly at all times, I looked into the time series data of the number of records being captured by the sensors over the hours. This shows that indeed the number of records and thereby sensors has increased over time. However, they are not working properly all the time, as shown by the sudden steep drop in the number of records collected and a sudden spike. Perhaps this is due to sensors suddenly not taking as many or taking too many records as compared to their configuration due to malfunction. A closer look at the time axis shows that these dips started in 2018. This could be due to the new sensors that were added in 2018. Perhaps certain members of the new group of sensors are not working correctly.
Line 182: Line 180:
  
 
==<div style="font-family:Open Sans, Arial, sans-serif;font-size:17px">Sensors' Performance Accuracy</div>==
 
==<div style="font-family:Open Sans, Arial, sans-serif;font-size:17px">Sensors' Performance Accuracy</div>==
 +
Checking the performance accuracy of the sensors requires some basic knowledge of the measures that are being acquired by the sensors. The three measures I am looking into here are temperature, pressure and humidity.
 +
# <b>Temperature in Bulgaria: </b> The highest ever [https://www.kashkaval-tourist.com/weather-in-bulgaria-all-year-guide/ temperature] recorded is 45.2 degrees Celsius, while the lowest ever recorded is -38.3 degrees Celsius. When the sensors recorded temperatures nearing 400 degrees Celsius and -5K Celsius, they are obviously not behaving accuractely.
 +
# <b> Pressure across the world: </b>
 +
# <b> Humidity across the world: </b>  [https://en.wikipedia.org/wiki/Humidity Relative humidity] is always recorded as a percentage value, therefore values of 898 and -999 are just not possible or anywhere near accurate. Additionally we see some humidity values of around 110 as well.
 +
 +
 +
[[File:Performance Accuracy.jpg|550px|center]]
  
  
* Characterize the sensors’ coverage, performance and operation. Are they well distributed over the entire city? Are they all working properly at all times? Can you detect any unexpected behaviours of the sensors by analyzing the readings they capture? Limit your response to no more than 4 images and 600 words.
 
 
* Now turn your attention to the air pollution measurements themselves.  Which part of the city shows relatively higher readings than others?  Are these differences time-dependent? Limit your response to no more than 6 images and 800 words.
 
* Now turn your attention to the air pollution measurements themselves.  Which part of the city shows relatively higher readings than others?  Are these differences time-dependent? Limit your response to no more than 6 images and 800 words.
  
 
==<div style="font-family:Open Sans, Arial, sans-serif;font-size:17px">References</div>==
 
==<div style="font-family:Open Sans, Arial, sans-serif;font-size:17px">References</div>==
# Air quality index. (2018, May 04). Retrieved from https://www.eea.europa.eu/themes/air/air-quality-index/index#tab-based-on-data
+
# Bozhinov, T. (2018, August 15). The weather in Bulgaria: An all-year guide. Retrieved from https://www.kashkaval-tourist.com/weather-in-bulgaria-all-year-guide/
# Air Quality Standards. (n.d.). Retrieved from http://ec.europa.eu/environment/air/quality/standards.htm
+
# Humidity. (2018, October 30). Retrieved from https://en.wikipedia.org/wiki/Humidity
# Bulgaria Celebrates with Christmas Eve Traditions. (n.d.). Retrieved from https://www.novinite.com/articles/135151/Bulgaria Celebrates with Christmas Eve Traditions
 
#
 
# Climate - Bulgaria. (n.d.). Retrieved from https://www.climatestotravel.com/climate/Bulgaria
 
# Public Holidays in Bulgaria in 2018. (n.d.). Retrieved from https://www.officeholidays.com/countries/bulgaria/index.php
 
# Bulgaria: Kresna Gorge forest fires lead to more evacuations. (2017, August 30). Retrieved from https://sofiaglobe.com/2017/08/28/bulgaria-kresna-gorge-forestfires-lead-to-more-evacuations/
 
 
 
-->
 
  
 
=Task 3=
 
=Task 3=

Revision as of 18:16, 11 November 2018

Problem and Motivation

Dataset Analysis and Transformation Process

Task 1: Spatio-temporal Analysis of Official Air Quality

Task 2: Spatio-temporal Analysis of Citizen Science Air Quality Measurements

Sensor's Coverage

I brought in the sensors for the Sofia region only, and use the Density Marks feature of Tableau 10.3 to plot the coverage of the sensors. I find that the sensors are generally placed in central Sophia, with lack of coverage in South Eastern and North Eastern regions. The number of sensors increases from 2017 to 2018, however, this is mainly concentrated in the central region, with little focus on the areas that lack coverage.
Below is the coverage for the year 2017.

Coverage 2017.jpg

Below is the coverage for the year 2018.

Coverage.jpg

Sensors' Stable Operations

To understand whether the sensors are operating properly at all times, I looked into the time series data of the number of records being captured by the sensors over the hours. This shows that indeed the number of records and thereby sensors has increased over time. However, they are not working properly all the time, as shown by the sudden steep drop in the number of records collected and a sudden spike. Perhaps this is due to sensors suddenly not taking as many or taking too many records as compared to their configuration due to malfunction. A closer look at the time axis shows that these dips started in 2018. This could be due to the new sensors that were added in 2018. Perhaps certain members of the new group of sensors are not working correctly.

Stability Operation.jpg

Sensors' Performance Accuracy

Checking the performance accuracy of the sensors requires some basic knowledge of the measures that are being acquired by the sensors. The three measures I am looking into here are temperature, pressure and humidity.

  1. Temperature in Bulgaria: The highest ever temperature recorded is 45.2 degrees Celsius, while the lowest ever recorded is -38.3 degrees Celsius. When the sensors recorded temperatures nearing 400 degrees Celsius and -5K Celsius, they are obviously not behaving accuractely.
  2. Pressure across the world:
  3. Humidity across the world: Relative humidity is always recorded as a percentage value, therefore values of 898 and -999 are just not possible or anywhere near accurate. Additionally we see some humidity values of around 110 as well.


Performance Accuracy.jpg


  • Now turn your attention to the air pollution measurements themselves. Which part of the city shows relatively higher readings than others? Are these differences time-dependent? Limit your response to no more than 6 images and 800 words.

References

  1. Bozhinov, T. (2018, August 15). The weather in Bulgaria: An all-year guide. Retrieved from https://www.kashkaval-tourist.com/weather-in-bulgaria-all-year-guide/
  2. Humidity. (2018, October 30). Retrieved from https://en.wikipedia.org/wiki/Humidity

Task 3

Urban air pollution is a complex issue. There are many factors affecting the air quality of a city. Some of the possible causes are:

  • Local energy sources. For example, according to Unmask My City, a global initiative by doctors, nurses, public health practitioners, and allied health professionals dedicated to improving air quality and reducing emissions in our cities, Bulgaria’s main sources of PM10, and fine particle pollution PM2.5 (particles 2.5 microns or smaller) are household burning of fossil fuels or biomass, and transport.
  • Local meteorology such as temperature, pressure, rainfall, humidity, wind etc
  • Local topography
  • Complex interactions between local topography and meteorological characteristics.
  • Transboundary pollution, for example, the haze that intruded into Singapore from our neighbours.

In this third task, you are required to reveal the relationships between the factors mentioned above and the air quality measure detected in Task 1 and Task 2. Limit your response to no more than 5 images and 600 words.

Software

  • Tableau - for visualization of the various tasks
  • Python - for geocoding

References