IS428 AY2018-19T1 Chua Sing Rue
Contents
Problem and Motivation
Data Preparation
Task 1: Spatio-temporal Analysis of Official Air Quality
EEA Dataset
Name of Air Quality Station | Local ID | Data available |
---|---|---|
| ||
| ||
| ||
| ||
| ||
|
Anomalies in the official air quality dataset
Issue 1: The official air quality measurements in the EEA Data folder contains PM10 measurements from air quality stations in Sofia city for the years 2013 to 2018. However, the dataset is not complete. Air quality station Mladost has only 2018 data available. Additionally, air quality station Orlov Most is missing data for the years 2016, 2017 and 2018.
Solution: For the purposes of comparison, air quality station Mladost and Orlov Most will be excluded.
Issue 2: PM10 measurements are reported as an averaged daily value for the years 2013 to 2016, but reported as an averaged hourly value for the years 2017 to 2018.
Solution: When comparing across the periods 2013 to 2016 and 2017 to 2018, the hourly PM10 average values will be recalculated as a daily average in order to get a common basis for comparison. At the same time, averaged hourly PM10 values will still be used to drill down into within-day trends for the more recent information.
Issue 3: For 2017 data, PM10 values for 1 Jan 2017 to 26 Nov 2017 are missing. Additionally, 2018 data is recorded up til September 14 only.
Solution: 2017 and 2018 data should not be used for yearly trends.