Difference between revisions of "ISSS608 2018-19 T1 Assign Hou Xuelin"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 22: Line 22:
 
[[Image:Task1-3.png|600px]]
 
[[Image:Task1-3.png|600px]]
  
=== anomalies of official data ===
+
=== Anomalies of Official Data ===
What anomalies do you find in the official air quality dataset? How do these affect your analysis of potential problems to the environment?
+
* Only Nov/Dec data is recorded in 2017 and the rest months data is all missing.This may not be representative for 2017 annual data.
 +
* the sampling frequency `AveagingTime` is inconsistent throughout the data, it ranges from day, hour and var. This may introduce some bias, when we aggregate the data.
  
 
== Task2 Exploration of Sensor Data ==
 
== Task2 Exploration of Sensor Data ==

Revision as of 09:54, 13 November 2018


Xuelin banner.jpg

Task1 Exploration of Official Data

Situation of Air Quality

The annual average PM10 concentration is around 45 from 2013 to 2018.
Druzhba is improving its air condition in recent two years, but Nadezhda showed uplift in 2017.
PM10 concentration in the rest of areas are gradually declining.

Task1-4.png

The PM10 trend in Sofia is highly periodic and the peaks are always fall on winters (Jan/Dec).
This may be due to domestic heating in winters.

Task1-1.png

Task1-2.png

A typical PM10 trend within day remains average around 30, and declines to around 20 between 10am - 5pm, when most of people are out for working.

Task1-3.png

Anomalies of Official Data

  • Only Nov/Dec data is recorded in 2017 and the rest months data is all missing.This may not be representative for 2017 annual data.
  • the sampling frequency `AveagingTime` is inconsistent throughout the data, it ranges from day, hour and var. This may introduce some bias, when we aggregate the data.

Task2 Exploration of Sensor Data

exploratory of sensor data

Characterize the sensors’ coverage, performance and operation. Are they well distributed over the entire city? Are they all working properly at all times?

anomalies of sensor data

Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture? Limit your response to no more than 4 images and 600 words.

air pollution correlation

Now turn your attention to the air pollution measurements themselves. Which part of the city shows relatively higher readings than others? Are these differences time dependent? Limit your response to no more than 6 images and 800 words.


Task3 Factors Affect Sofia Air Pollution

Urban air pollution is a complex issue. There are many factors affecting the air quality of a city. Some of the possible causes are:

Local energy sources. For example, according to Unmask My City, a global initiative by doctors, nurses, public health practitioners, and allied health professionals dedicated to improving air quality and reducing emissions in our cities, Bulgaria’s main sources of PM10, and fine particle pollution PM2.5 (particles 2.5 microns or smaller) are household burning of fossil fuels or biomass, and transport. Local meteorology such as temperature, pressure, rainfall, humidity, wind etc Local topography Complex interactions between local topography and meteorological characteristics. Transboundary pollution for example the haze that intruded into Singapore from our neighbours. In this third task, you are required to reveal the relationships between the factors mentioned above and the air quality measure detected in Task 1 and Task 2. Limit your response to no more than 5 images and 600 words.

Data Source

Methodology

Application Libraries & Packages

Package Name Descriptions
xlsx R package for Excel file manipulation.

References