ISSS608 2018-19 T1 Assign Debbie Siah Mei Ping Task 2

From Visual Analytics and Applications
Jump to navigation Jump to search

Sofia City - Air Quality Analysis

Background

Data Preparation

Task 1

Task 2

Task 3

 


Task 2: Spatio-temporal Analysis of Citizen Science Air Quality Measurements

Using appropriate data visualisation, you are required will be asked to answer the following types of questions:

  • Characterize the sensors’ coverage, performance and operation. Are they well distributed over the entire city? Are they all working properly at all times? Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture?
  • Now turn your attention to the air pollution measurements themselves. Which part of the city shows relatively higher readings than others? Are these differences time dependent?

Click here to access the Tableau Dashboard for Task 2.

Coverage of Sensors

Sofia City is well covered with citizen sensors.

Task2-Map.png

Performance & Operation of Sensors

Although the coverage of the citizen sensors is much better than the official air quality stations, however, it will be necessary to show the robustness of the performance and operation of the sensors before the data collected can be trusted.

Based on the overall time series trend of the 2 major pollutant and 3 major atmospheric measurements, it can be seen that there are 2 dips in the pressure readings that went to 0. This is illogical and may be a sign of failure at the receiving end of the station as it is not statistically possible for all the transmitting sensors to be faulty at the same time, resulting in 0. This will be further examined below in the Pressure section.

Task2-EnvironmentPreCleaning.png


Pressure

The initial plotting of the pressure readings collected by the citizen sensors shows that the readings range from -20k to 260k. This is illogical and may be an indication of faulty sensors or receivers.

Task2-Pressure.png

Upon closer examination, it is found that there is a large number of readings (more than 400,000) that equate to 0. This is a sign that enforces the hypothesis made earlier on about a fault in the receiving end for pressure related readings. Besides these, there are also various outliers which should be removed as they are out of the logical range.

Task2-PressureIncludeOutliers.png

Humidity

The initial plotting of the humidity readings collected by the citizen sensors shows that the readings range from -1000 to 1000. This is illogical and may be an indication of faulty sensors or receivers.

Task2-Humidity.png

Upon closer examination, it is found that there are a small number of readings beyond the logical range of 0 - 100 for humidity. These outliers should be removed prior to any analysis.

Task2-HumidityIncludeOutliers.png

Temperature

The initial plotting of the temperature readings collected by the citizen sensors shows that the readings range from -6000 to 1000. This is illogical and may be an indication of faulty sensors or receivers.

Task2-Temperature.png

Upon closer examination, it is found that there are a small number of readings beyond the logical range of -20 to 50 for general temperature on Earth. These outliers should be removed prior to any analysis.

Task2-TemperatureOutliers.png

Air Pollutants as Detected by Citizen Sensors

It is noted that there are 2 particular areas where the P1 readings are much higher than usual. They are boxed out in the map below. The highest polluted areas appear to be near the airport, as well as the stretch along roads.

Task2-MapAreaCircled.png

It appears that the pollutants are the highest during the colder months of the year.

Task2-EnvironmentRemoveOutliers.png

The maximumm P1 reading in the year occured on 27th January. Upon closer examination, the pollutant level is higher during the night where temperature is the lowest and humidity is the highest (indicative that it could be raining or snowing).

Task2-Environment27Jan.png