Task 2: Citizen Science Air Quality
|
|
|
|
|
|
Understanding Citizen Science Air Quality Data
Contents
Characteristics of sensors’ coverage, performance and operation
Distribution of sensors
Congregation of sensors at City Center
Initial exploratory analysis in JMP Pro revealed that there are 1265 unique geohashed location of sensors. By plotting the location of sensors onto the map of the city, we observed that these sensors are mostly located at the city center with sparse representation at the extremities of Sofia City. Hex binning is performed to prevent overlap in pointers when plotting the sensors onto the Tableau online map.
Performance of sensors
Extreme sensor values recorded for period of April-Aug18
Using descriptive analysis function in JMP Pro 14.0, we were able to perform exploratory visualization of the dataset. Initial investigation shows that the distribution of P1 and P1 hovers around range of 0-300. Upon closer inspection, we noticed that there are 3262 counts of value 2000 recorded for P1 (PM10) and 3314 counts of value 1000 recorded for P2 (PM2.5). Incidentally, these values coincide at similar period of recording.
To preserve the integrity of the dataset, these values has been retained in data preparation steps but kept in consideration as outliers in later visualization steps in Tableau.
Distribution of air pollutant in Sofia City
Which part of the city shows higher concentration in air pollutant?
Concentration of pollutant is noticeably higher at a few locations in Sofia City
The dotplot of average PM10 and PM2.5 allow user to visually discern areas with higher concentration as represented by larger circle on the map. From the plot we noticed 2 main larger circles situated at geohashed locations "sx8d8xv4bbd" and "sx8d7hyt4xs". With reference to the map of Sophia City below, these address of these areas are at ul. "Pladnishte" 26 1614 VZ Gorna Bania, Sofia, Bulgaria and 1434 Sofia Skay, Sofia, Bulgaria respectively.
Is concentration time dependent?
Concentration is highest in first 4 weeks of the year i.e. January
Period of high concentration of pollutant is represented with warmer shading of coloration on the Calendar graph. For this visualization, we look at changes to level of concentration of PM10 and PM2.5 across all 52 weeks of the year. The results coincides with our earlier discovery in Task 1.
Concentration of pollutant increases in the evening
The density plot is used to represent the hourly concentration of pollutants across Sofia City where area of higher concentration is reflected with darker/warmer shade of colors. We have prepared a gif to allow user to have a better visual understanding of changes in concentration of PM10 and PM2.5 throughout the day. We find that concentration is generally lower throughout in the night, especially period of 0000hr-0600hr. Concentration of pollutants increases significant as the day turns to night.