Difference between revisions of "Group 8 Overview"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 24: Line 24:
  
 
=== Time series Explorer ===
 
=== Time series Explorer ===
 +
 +
Time-series data has always played an important role in understanding and evaluating past behaviours. The usage of time-series information has allowed companies and organizations to tune their operations instead of simply improving by trial and error. It helps to describe and explore the hidden data patterns that have occurred over time, and with these insights, predict the future likelihoods of revenues and profits. At a country level, we can explore living standards of citizens by understanding the factors affecting their day-to-day lives.
 +
 +
One of the important example measures that can allow us to get a glimpse would be the Consumer Price Index (CPI).
  
 
=== Use Case ===
 
=== Use Case ===
 
  
 
The Consumer Price Index (CPI), is a critical indicator to assess the consumer price inflation. To profile the weighted average price changes for households' cost of living, Singapore's CPI adopts a fixed basket of residents' commonly consumption goods and services. About 6,600 brands/varieties from 4,200 outlets are selected in the 2014-based CPI (the dataset for our study). In basket level, the composition of goods and services can be categorized into 10 major divisions, which are listed below:
 
The Consumer Price Index (CPI), is a critical indicator to assess the consumer price inflation. To profile the weighted average price changes for households' cost of living, Singapore's CPI adopts a fixed basket of residents' commonly consumption goods and services. About 6,600 brands/varieties from 4,200 outlets are selected in the 2014-based CPI (the dataset for our study). In basket level, the composition of goods and services can be categorized into 10 major divisions, which are listed below:
Line 36: Line 39:
 
To construct CPI, two main types of data are required - the price data of a sample of goods and services, and the weighting data to represent the shares of different divisions' expenditure. The price data is gathered through a combination of data collection modes, while the frequency of price collection depends on price behavior of the item. The weighting data is derived from the expenditure values collected in the Household Expenditure Survey,  the latest one came from HES 2012/13, updated to 2014 values by taking into account price changes between 2012/13 and 2014.
 
To construct CPI, two main types of data are required - the price data of a sample of goods and services, and the weighting data to represent the shares of different divisions' expenditure. The price data is gathered through a combination of data collection modes, while the frequency of price collection depends on price behavior of the item. The weighting data is derived from the expenditure values collected in the Household Expenditure Survey,  the latest one came from HES 2012/13, updated to 2014 values by taking into account price changes between 2012/13 and 2014.
  
As a Price index, CPI can also be affected by other types of costs in Singapore. Indicators like COE Bidding Price, Import and Export price index, Exchange Rates, are assumed to have interesting relationships with CPI which are worthy of further study. These data sources are also acceptable within the proposed system within certain format restrictions.
+
As a Price index, CPI can also be affected by other types of costs in Singapore. Indicators like COE Bidding Price, Import and Export price index, Exchange Rates, are assumed to have interesting relationships with CPI which are worthy of further study. These data sources are also acceptable within the proposed system design within certain format restrictions.
  
 +
== Motivation ==
  
== Motivation ==
+
With the background knowledge in mind, our team would like to create exploratory and predictive models, which showcase these complex time-related trends of Singapore's CPI throughout the years (1990-2017) for different categories. We would like to use time-series visualization techniques such as tables and line charts representing Trend, Seasonality and Random to investigate any insights and to display potential forecasts to the audience.
  
With the background knowledge in mind, our team would like to create an exploratory model that showcases these complex relationships of Singapore's CPI, exchange rates, import and export pricing throughout the years (1990-2017) visualized on the world map. Time-series visualization techniques would also be adopted to look for hidden data trends for analysis.
+
We would also apply system design principles to make the proposed system accept any form of generic time-series data. This is to allow flexibility of the system and expands its scope of usage.
  
 
There are several reasons why we found this project interesting:
 
There are several reasons why we found this project interesting:
  
* Closer to our daily standards of living, we wanted to understand what are the existing factors that can impact a country's CPI and subsequently its impact on its citizens.
+
* General exploration of time-series data. The project allows us to learn and re-learn time-series analysis techniques and concepts. This gives us the opportunity to let us put our theoretical knowledge into practical use.
* One of the potential factors that impact CPI, we wanted to look at how exchange rates work in Singapore's context and how they are used to Singapore's advantage when trading with the country's trading partners.
+
* Closer to our daily standards of living, we wanted to understand what are the current categories that make up a country's CPI index and subsequently its impact on its citizens.
* Since Singapore is heavily dependent on its imported consumer goods, we also wanted to investigate how import and export pricing works as a whole, as well as its involvement as a moving part of a country's full economic system.
+
* We wanted to explore whether different periods of time would indicate the different price index values in a cyclical manner. This would also allow us to understand the overall direction prices are taking for every goods & services category.
* As an overall conclusion, we know that the above economic indicators are highly related and would like to combine them to receive any data insights in terms of time period trends.
 
  
Even though Singapore is a relatively young country, it is able to provide rich data to help us explore these interesting observations.
+
Even though Singapore is a relatively young country, it is able to provide rich data to help us explore these interesting observations. This is also in part due to the government initiative of Smart Nation Singapore.
  
 
== Objectives ==
 
== Objectives ==
1. Provide interactive platform to illustrate the relationship between Singapore's exchange rate information, Consumer Price Index and Import and Export markets.
+
1. Provide interactive platform to illustrate the trends and seasonalities within given time-series data (i.e. Singapore's CPI).
  
 
2. Discover data insights using visualization and interactivity that cannot be easily represented using raw data.
 
2. Discover data insights using visualization and interactivity that cannot be easily represented using raw data.
  
3. Make use of freely available Singapore economic data to arouse the interests of potential viewers and increase their curiosity on our trading relationships with the country's important trading partners.
+
3. Make use of freely available Singapore economic data to arouse the interests of potential viewers and increase their curiosity on our Singapore state of consumer goods and services.
  
 
== Data Source ==
 
== Data Source ==
  
The consumer price index data is extracted from data.gov.sg in a monthly format which reveals the figures from January, 1961 to August,2017, while the index reference period is 2014. The data has overall index represents changes in the price level of the whole basket with all items considered, and can also be drilled down to sub-indices and sub-sub-indices for different categories and sub-categories of goods and services.
+
The Consumer Price Index (CPI) data is extracted from data.gov.sg in a monthly format which reveals the figures from January 1961 to August 2017, while the index reference period is 2014. The data has an overall index representing changes in the price level of the whole basket with all items considered, and can also be drilled down to sub-indices and sub-sub-indices for different categories and sub-categories of goods and services. For our system analysis, we plan to use filtered data from 1990 onwards.
 
 
The COE data can be gained from a third-party website which contains the COE bidding price since 1990, and showcases the premiums in each month by different categories of vehicles.
 
 
 
Singapore MAS Exchange Rate Data can be retrieved from Singapore MAS website, and showcases denominations of S$ per 100 of selected foreign currencies. The data set can be obtained in weekly and monthly format from the year 1988-2017.
 
 
 
Singapore import/export statistic dataset is downloaded from CEIC database and sourced by International Enterprise Singapore & Department of Statistics. The dataset contains monthly import & export value by commodity section from 1964 to 2017.
 
 
 
In order to align with other datasets used in this project, the data before 1990 will be excluded from the list.
 
  
 
== Analysis Methods ==
 
== Analysis Methods ==
Line 78: Line 73:
  
 
=== Explanatory Analysis ===
 
=== Explanatory Analysis ===
Relationships between our data will be explained based on our understanding of possible real-world events or causes. For example, we might be able to explain the difference in import and export prices between the months of June and December due to the holiday seasons causing an increase of demand for turkey in December.
+
Relationships between our data will be explained based on our understanding of possible real-world events or causes. For example, we might be able to explain the difference in CPI between the months of June and December due to the holiday seasons causing an increase of demand for clothing in December.
  
 
=== Predictive Analysis ===
 
=== Predictive Analysis ===
Due to the repeating nature of cyclic and seasonal time-series data, we can use the identified trends to predict the various CPI index values for future dates.
+
Due to the repeating nature of cyclic and seasonal time-series data, we can use the identified trends to predict the various CPI index values for future dates. Potential analytics techniques include different models of exponential smoothing and ARIMA.
  
 
== Application ==
 
== Application ==

Revision as of 14:44, 28 November 2017

width="100%"

Proposal

Poster

Application

Report


Background

Time series Explorer

Time-series data has always played an important role in understanding and evaluating past behaviours. The usage of time-series information has allowed companies and organizations to tune their operations instead of simply improving by trial and error. It helps to describe and explore the hidden data patterns that have occurred over time, and with these insights, predict the future likelihoods of revenues and profits. At a country level, we can explore living standards of citizens by understanding the factors affecting their day-to-day lives.

One of the important example measures that can allow us to get a glimpse would be the Consumer Price Index (CPI).

Use Case

The Consumer Price Index (CPI), is a critical indicator to assess the consumer price inflation. To profile the weighted average price changes for households' cost of living, Singapore's CPI adopts a fixed basket of residents' commonly consumption goods and services. About 6,600 brands/varieties from 4,200 outlets are selected in the 2014-based CPI (the dataset for our study). In basket level, the composition of goods and services can be categorized into 10 major divisions, which are listed below:

Food, Clothing & Footwear, Housing & Utilities, Household Durables And Services, Health Care, 
Transport, Communication, Recreation & Culture, Education, Miscellaneous Goods & Services
width="100%"

To construct CPI, two main types of data are required - the price data of a sample of goods and services, and the weighting data to represent the shares of different divisions' expenditure. The price data is gathered through a combination of data collection modes, while the frequency of price collection depends on price behavior of the item. The weighting data is derived from the expenditure values collected in the Household Expenditure Survey, the latest one came from HES 2012/13, updated to 2014 values by taking into account price changes between 2012/13 and 2014.

As a Price index, CPI can also be affected by other types of costs in Singapore. Indicators like COE Bidding Price, Import and Export price index, Exchange Rates, are assumed to have interesting relationships with CPI which are worthy of further study. These data sources are also acceptable within the proposed system design within certain format restrictions.

Motivation

With the background knowledge in mind, our team would like to create exploratory and predictive models, which showcase these complex time-related trends of Singapore's CPI throughout the years (1990-2017) for different categories. We would like to use time-series visualization techniques such as tables and line charts representing Trend, Seasonality and Random to investigate any insights and to display potential forecasts to the audience.

We would also apply system design principles to make the proposed system accept any form of generic time-series data. This is to allow flexibility of the system and expands its scope of usage.

There are several reasons why we found this project interesting:

  • General exploration of time-series data. The project allows us to learn and re-learn time-series analysis techniques and concepts. This gives us the opportunity to let us put our theoretical knowledge into practical use.
  • Closer to our daily standards of living, we wanted to understand what are the current categories that make up a country's CPI index and subsequently its impact on its citizens.
  • We wanted to explore whether different periods of time would indicate the different price index values in a cyclical manner. This would also allow us to understand the overall direction prices are taking for every goods & services category.

Even though Singapore is a relatively young country, it is able to provide rich data to help us explore these interesting observations. This is also in part due to the government initiative of Smart Nation Singapore.

Objectives

1. Provide interactive platform to illustrate the trends and seasonalities within given time-series data (i.e. Singapore's CPI).

2. Discover data insights using visualization and interactivity that cannot be easily represented using raw data.

3. Make use of freely available Singapore economic data to arouse the interests of potential viewers and increase their curiosity on our Singapore state of consumer goods and services.

Data Source

The Consumer Price Index (CPI) data is extracted from data.gov.sg in a monthly format which reveals the figures from January 1961 to August 2017, while the index reference period is 2014. The data has an overall index representing changes in the price level of the whole basket with all items considered, and can also be drilled down to sub-indices and sub-sub-indices for different categories and sub-categories of goods and services. For our system analysis, we plan to use filtered data from 1990 onwards.

Analysis Methods

Exploratory Analysis

We will explore the different trends of time-series data provided by the various economic data sets (Period cyclicity and seasonality). Different interactions of identified attributes might provide certain data insights that we can use for our analysis.

Explanatory Analysis

Relationships between our data will be explained based on our understanding of possible real-world events or causes. For example, we might be able to explain the difference in CPI between the months of June and December due to the holiday seasons causing an increase of demand for clothing in December.

Predictive Analysis

Due to the repeating nature of cyclic and seasonal time-series data, we can use the identified trends to predict the various CPI index values for future dates. Potential analytics techniques include different models of exponential smoothing and ARIMA.

Application

References

1. https://data.gov.sg/dataset/consumer-price-index-monthly?view_id=0063aa5a-c5de-4c74-94be-b9ec443878be&resource_id=67d08d6b-2efa-4825-8bdb-667d23b7285e

2. https://secure.mas.gov.sg/msb/ExchangeRates.aspx

3. https://insights-ceicdata-com.libproxy.smu.edu.sg/Untitled-insight/views

4. http://tralvex.com/pub/cars/coe.htm