Difference between revisions of "Assignments"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 40: Line 40:
  
 
=The Data Sets=
 
=The Data Sets=
 +
 +
The major data sets are provided for this assignment, they are:
  
 
* Official air quality measurements (5 stations in the city) – as per EU guidelines on air quality monitoring see the data description [https://drive.google.com/file/d/1v5yCL-LdriDwa65qXPbFL7b0tydylDlb/view HERE…]
 
* Official air quality measurements (5 stations in the city) – as per EU guidelines on air quality monitoring see the data description [https://drive.google.com/file/d/1v5yCL-LdriDwa65qXPbFL7b0tydylDlb/view HERE…]
 +
* Citizen science air quality measurements, incl. temperature, humidity and pressure (many stations) and topography (gridded data).
 +
* Meteorological measurements (1 station): Temperature; Humidity; Wind speed; Pressure; Rainfall; Visibility
 +
* Topography data
  
    Citizen science air quality measurements, incl. temperature, humidity and pressure (many stations) and topography (gridded data)
+
They can be download by click on this [https://storage.cloud.google.com/global-datathon-2018/sofia-air/air-sofia.zip link].
        sample data HERE…
 
 
 
AirBG.info (as of 17th Sept):
 
 
 
    Meteorological measurements (1 station): Temperature; Humidity; Wind speed; Pressure; Rainfall; Visibility
 
        see data description HERE…
 
    Topography data
 
        see data description HERE
 
 
 
Download the full dataset HERE
 
  
  

Revision as of 23:21, 16 October 2018

Va.jpg IS428 Visual Analytics for Business Intelligenceand Applications

About

Assignment Dropbox

 


To be a Visual Detective

The assignments require you to put the concepts, methods and techniques you had learned in class to solve real world problem using visual analytics techniques. Students should also use the assignments to gain hands-on experience on using the data visualisation toolkits I had shared with you to complate the assignment.

Overview

The Task

General task

The four factories in the industrial area are subjected to higher-than-usual environmental assessment, due to their proximity to both the city and the preserve. Gaseous effluent data from several sampling stations has been collected over several months, along with meteorological data (wind speed and direction), that could help Mitch understand what impact these factories may be having on the Rose-Crested Blue Pipit. These factories are supposed to be quite compliant with recent years’ environmental regulations, but Mitch has his doubts that the actual data has been closely reviewed. Could visual analytics help him understand the real situation?

The primary job for Mitch is to determine which (if any) of the factories may be contributing to the problems of the Rose-crested Blue Pipit. Often, air sampling analysis deals with a single chemical being emitted by a single factory. In this case, though, there are four factories, potentially each emitting four chemicals, being monitored by nine different sensors. Further, some chemicals being emitted are more hazardous than others. Your task, as supported by visual analytics that you apply, is to detangle the data to help Mitch determine where problems may be. Use visual analytics to analyze the available data and develop responses to the questions below.

The specific tasks

  • Characterize the sensors’ performance and operation. Are they all working properly at all times? Can you detect any unexpected behaviors of the sensors through analyzing the readings they capture?Limit your response to no more than 9 images and 1000 words.
  • Now turn your attention to the chemicals themselves. Which chemicals are being detected by the sensor group? What patterns of chemical releases do you see, as being reported in the data? Limit your response to no more than 6 images and 500 words.
  • Which factories are responsible for which chemical releases? Carefully describe how you determined this using all the data you have available. For the factories you identified, describe any observed patterns of operation revealed in the data. Limit your response to no more than 8 images and 1000 words.

The Data Sets

The major data sets are provided for this assignment, they are:

  • Official air quality measurements (5 stations in the city) – as per EU guidelines on air quality monitoring see the data description HERE…
  • Citizen science air quality measurements, incl. temperature, humidity and pressure (many stations) and topography (gridded data).
  • Meteorological measurements (1 station): Temperature; Humidity; Wind speed; Pressure; Rainfall; Visibility
  • Topography data

They can be download by click on this link.


Visualisation Software

To perform the visual analysis, students are encouraged to explore any one or a combination of the following software:

  • Tableau
  • JMP Pro
  • Qlik Sense
  • Microsoft Power BI

One of the goals of this assignment is for you to learn to use and evaluate the effectiveness of these visual analytics tools.


Submission details

This is an individual assignment. You are required to work on the assignment and prepare submission individually. Your completed assignment is due on 8th October 2017, by 11.59pm mid-night.

You need to edit your assignment in the appropriate wiki page of the Assignment Dropbox. The title of the wiki page should be in the form of: IS428_2017-18_T1_Assign_FullName.

The assignment wiki page should include the URL link to the web-based interactive data visualization system prepared.


Reference


Assignment Q&A

Need more clarification, please feel free to pen down your questions.