Difference between revisions of "Assignment 2"

From Visual Analytics and Applications
Jump to navigation Jump to search
(Created page with "{|style="background-color:#556B2F; color:#000000 padding: 5px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" | | style="padding:0.4em; font-size...")
 
Line 24: Line 24:
 
We have provided the following data sets and would like to encourage you to use one of them in order to get started quickly and therefore have more time to explore the data and develop your analysis questions.  The data sets were extracted from [http://www.bankofengland.co.uk/research/Pages/onebank/datasets.aspx#3 Bank of England Data Visualisation Competition].   
 
We have provided the following data sets and would like to encourage you to use one of them in order to get started quickly and therefore have more time to explore the data and develop your analysis questions.  The data sets were extracted from [http://www.bankofengland.co.uk/research/Pages/onebank/datasets.aspx#3 Bank of England Data Visualisation Competition].   
  
==Bank of England/NMG household survey data==
 
  
<font size = 3>[http://www.bankofengland.co.uk/research/Documents/onebank/nmgface.xlsx Bank Of England NMG Household Survey face-to-face survey data]</font>
 
 
The Bank of England/NMG survey is an annual survey of households conducted by NMG Consulting on behalf of the Bank. It includes questions on households’ balance sheets and spending. The survey has been conducted annually since 2004, during September. Between 2004 and 2011 the survey was carried out face-to-face. The survey has been fully online since 2012. Results from each of the surveys are summarised in an annual article published each year in the Q4 Quarterly Bulletin. See the most recent QB article here [http://www.bankofengland.co.uk/publications/Documents/quarterlybulletin/2014/qb14q405.pdf].
 
 
The Bank of England/NMG survey data provide a more timely update of developments in households’ finances than other surveys, which are typically published with a longer lag. They also can be used to address the following One Bank Research Agenda questions: 
 
* What determines marginal propensities to consume across households and how they differ across household types? (Theme 4)
 
* What determines the distribution of household indebtedness and mortgage arrears? (Theme 4)
 
* How is the reliability of household datasets affected by survey methodology? (Theme 4)
 
 
'''Note:''' Focus on the attributes and their relationships, this assignment does not require you to detect and analyse the temporal patterns.
 
 
==Inflation attitudes survey==
 
 
<font size = 3>[http://www.bankofengland.co.uk/research/Documents/onebank/nop.xlsx Inflation attitudes survey data]</font>
 
 
This dataset contains the individual level responses to the Inflation Attitudes Survey, a survey of households’ attitudes about inflation and other key economic variables. The survey is conducted by GfK NOP on behalf of the Bank of England and covers the period from 2003 to 2014 on a quarterly basis, with annual data for 2001 and 2002. The first worksheet contains a list of questions asked on the survey and all possible responses. The second worksheet contains descriptions for all other variables in the dataset. The final worksheet is the full dataset; the numbers in the dataset refer back to the previous spreadsheets.
 
 
In an inflation targeting framework, inflation expectations are a key measure of central bank credibility and can give an indication of wider economic developments. This micro-level dataset provides greater scope for investigating inflation attitudes, including inflation perceptions and expectations. The Bank of England has previously published summary statistics including medians for these data at a macro-level (for more information see the [http://www.bankofengland.co.uk/publications/Pages/other/nop.aspx Bank of England/NOP Inflation Attitudes Survey] page). Sample sizes are around 2,000 per quarter, except in the February quarter when around 4,000 people are surveyed. The responses are subject to sampling error as one sample of people will differ from the characteristics of the population as a whole and that will cause small variations in summary results. This new version of the data allows users to further analyse responses split by demographic variables such as age, region and gender.
 
 
Some of the One Bank Research Agenda questions which this data may help answer include:
 
* How do public communications and disclosure policies affect behaviour and incentives? (Theme 3)
 
* How is the reliability of household and corporate data sets affected by survey methodology? (Theme 4)
 
* How do demographics and the distribution of wealth and income in society affect the monetary transmission mechanism? (Theme 5)
 
 
'''Note:''' Focus on the attributes and their relationships, this assignment does not require you to detect and analyse the temporal patterns.  You can focus on the survey done on 2013 and/or 2014.
 
  
  

Revision as of 10:09, 12 September 2016

Assignment 1   Assignment 2   Assignment 3   Assignment Dropbox


Data Discovery in High-dimensional Data: Visual Analytics Techniques and Methods


Overview

In this digital economy age, massive and complex data have been captured and stored in organization databases and/or data warehouses. By and large, these data contain a large amount of variables of a particular product, customer or activity. Due to limitations in perceptual and screen space, graphical techniques available in traditional business intelligence systems tend to confine to univariate and bivariate data such as bar chart, pie chart and scatterplot. As a result, many important relationships that live in these data remain undiscovered. In this assignment, you are required to apply interactive data exploration and analysis techniques to discovery patterns in multivariate data. The goal of this assignment is not to develop a new visualization tool, but to apply the interactive data exploration and analysis techniques you have learned by using commercial-of-the-shelf and opensource software. It also aims to allow you to gain hands-on experiences on using the visualization tool and at the same time, to evaluate the pros and cons of the tool in real world applications.


Data Sets

We have provided the following data sets and would like to encourage you to use one of them in order to get started quickly and therefore have more time to explore the data and develop your analysis questions. The data sets were extracted from Bank of England Data Visualisation Competition.



Visual Analytics Application Design Process

Step 1: Identify a theme of interest

Each of the dataset provides a wide range of parameters that can be used for many different purposes. Hence, it is very important for you to identify a theme clearly before you start your investigation. For example, you might want to focus on issues related to business competitiveness.


Step 2: Define questions for investigation

After you have identified a theme, you should now formulate questions for investigation. For example: Is there a relationship between sales revenue and marketing expenditure? Are the growth of GPD per capita and the growth of productivity correlated? Are there different patterns of energy consumption in different regions of the world?


Step 3: Find appropriate data attributes

Extract and download the datasets in convenient formats such as Excel or a CSV file. The online database contains a lot of tabulated data. In some cases, you will need to convert the data to a format you can use. Format conversion is a big part of visualization research so it is worth learning techniques for doing such conversions.

You will need to iterate through these steps a few times. It may be challenging to find interesting questions and a dataset that has the information that you need to answer those questions.


Recommended Best Practice

After you have the initial question and the appropriate datasets, construct a visualization that provides an answer to your question. As you construct the visualization, you will find that your question evolves – very often, it will become more specific. Keep track of this evolution and the other questions that occur to you along the way. Once you have answered all the questions to your satisfaction, think of a way to present the data and the answers as clearly as possible. The presentation must be in the form of interactive visualization.

Before starting, write down the initial question clearly. And, as you go, maintain a wiki notebook of what you have to do to construct the visualizations and how the questions evolved. Include in the notebook where you get the data from, and documentation about the format of the dataset. Describe any transformations or rearrangements of the dataset that you need to perform; in particular, describe how you get the data into the format needed by the visualization system. Keep copies of any intermediate visualizations that have helped you refine your question. After you have constructed the final visualization for presenting your answer, write a caption and a paragraph describing the visualization, and how it answers the question you posed. Think of the figure, the caption and the text as materials you might include in a research paper.

You should maintain a session on the assignment wiki documents all the questions you asked and the steps you performed from start to the end .


Data Visualization Software

To perform the analysis, you are encouraged to explore the following visual analytics toolkit and API or their equivalent besides Tableau, JMP, Tibco Spotfire, DataWatch Designer, and Qlik Sense:

  • Mondrian
  • High-D
  • GeoViz Toolkit

and specialised visual analytics techniques

  • Parallel Sets
  • TableLens
  • Treemap
  • Sunburst

One of the goals of this assignment is for you to learn to use and evaluate the effectiveness of these visualization tools. Please do not hesitate to consult me if you encounter problems in using the tool.


Useful resources


Submission details

This is an individual assignment. You are required to work on the assignment and prepare submission individually. Your completed assignment is due on 23rd October 2015, by 11.59pm mid-night.

You need to edit your assignment in the appropriate wiki page of the Assignment Dropbox. The title of the wiki page should be in the form of: ISSS608_2015-16_Term1_Assign2_FullName.

The assignment 2 wiki page should include the URL link to the web-based interactive data visualization system prepared.


Assignment 2 Q&A

Need more clarification, please go here