Difference between revisions of "File:Group3ProjectBanner.PNG"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 36: Line 36:
  
 
Through this project, we hope that the tourism industry business, especially the small and medium business (check with the SME contribution to Singapore economy or tourism industry) can make optimal marketing solutions and business decision. We attempt to create the platform that assist the business owners and analysts to detect some useful insights from the relationship between travelling revenue and expenditure to promote the economic growth.
 
Through this project, we hope that the tourism industry business, especially the small and medium business (check with the SME contribution to Singapore economy or tourism industry) can make optimal marketing solutions and business decision. We attempt to create the platform that assist the business owners and analysts to detect some useful insights from the relationship between travelling revenue and expenditure to promote the economic growth.
 
 
 
 
* The platform can give us the overview on the visitors’ arrivals pattern by country, age and different transportation methods.  
 
* The platform can give us the overview on the visitors’ arrivals pattern by country, age and different transportation methods.  
Line 47: Line 46:
 
== Data Source ==
 
== Data Source ==
  
The Consumer Price Index (CPI) data is extracted from ''data.gov.sg<ref name="datagov">Government of Singapore. [https://data.gov.sg/dataset/consumer-price-index-monthly?view_id=0063aa5a-c5de-4c74-94be-b9ec443878be&resource_id=67d08d6b-2efa-4825-8bdb-667d23b7285e], Last Updated on 30 November 2017, Retrieved on 30 November 2017</ref>'' in a monthly format which reveals the figures from January 1961 to August 2017, while the index reference period is 2014. The data has an overall index representing changes in the price level of the whole basket with all items considered, and can also be drilled down to sub-indices and sub-sub-indices for different categories and sub-categories of goods and services. For our system analysis, we plan to use filtered data from 1990 onwards.
+
The Singapore Tourism Sector data is extracted from CEIC database which is available at:<br>  
 +
https://insights-ceicdata-com.libproxy.smu.edu.sg<br><br>
 +
From our perspective, we have selected the five datasets -- which are on the topics of tourism arrivals by country, age, transport, length of stay and the tourism revenue and expenditure. The datasets are in a monthly format. For our system analysis, we plan to use filtered data from 2007 onwards.
  
 
== Methodology ==
 
== Methodology ==
Line 53: Line 54:
 
==== Exploratory Analysis ====
 
==== Exploratory Analysis ====
  
We will explore the different trends of time-series data provided by the various economic data sets (Period cyclicity and seasonality). Different interactions of identified attributes might provide certain data insights that we can use for our analysis.
+
We will explore the different trends of time-series data provided by the various tourism data sets (Period cyclicity and seasonality). Different interactions of identified attributes might provide certain data insights that we can use for our analysis. Visualize the time series in the following ways:
 +
 
 +
* Geographic heat map: Visualize the time series by displaying the geographic heat map on the density of visitor arrivals based on the selection of the specific calendar month. 
 +
* Slopegraphs; This visualization technique can provide maximum information with “minimum ink”. It could help us to detect how the number of the visitor changed over the years.
 +
* Waterfall: Rather than the values itself, a waterfall plot tries to bring out the changes in the values. It could provide the overview of the time series line chart along with on how large the difference is between two data points.  
  
 
==== Explanatory Analysis ====
 
==== Explanatory Analysis ====
Relationships between our data will be explained based on our understanding of possible real-world events or causes. Using our CPI use-case as an example, the difference in CPI between the months of June and December can be explained as a result of the holiday seasons causing an increase of demand for clothing in December.
+
* Decompose time-series information into its constituent parts: Observation, Seasonal, Trend, Random (Noise). From the separate parts, users can understand the different time-series patterns and derive insights.
 +
* We have many variables(columns) in our dataset, so it is obvious that dimensionality is too high to make effective analysis, and the curse of dimensionality can happen. For this reason, it is important to reduce dimensionality in some way. One of the best approaches is to use time series representations in order to reduce dimensionality, reduce noise and emphasize the main characteristics of time series. In this stage, we would like to do the clustering time series analysis to group the countries with the similar pattern.
  
 
==== Predictive Analysis ====
 
==== Predictive Analysis ====
We can use analytics techniques such as Exponential Smoothing and ARIMA to predict future trends of our time-series data, due to the data's cyclical and seasonal nature.
+
Time series forecasting is the use of a model to predict future values based on previously observed values.in this case, we would like to use forecasting techniques such as seasonal exponential smoothing and ARIMA to perform prediction. After forecasting analysis, we must compare predicted tourism to real tourism to help us understand the accuracy of our forecasts. Meanwhile, the standard error and other mathematical statistics can be estimated to further verify the forecasting models and help to choose the best one.
  
 
== Application ==
 
== Application ==
  
 
The proposed system would have three major functions:
 
The proposed system would have three major functions:

Revision as of 22:12, 14 October 2018

Proposal

Poster

Application

Report


Background

The Crown Jewel of the Formula One Race Circuit, backdrop of the successful Hollywood Film “Crazy Rich Asian” and the honorable host of the Memorable North Korea-United States Summit, Singapore’s ability to position herself as a neutral yet vibrant destination has led to hordes of visitors setting foot onto her sunny shores. It is no surprise that the tourism sector has been developing into a growth engine for Singapore’s economy. For 2017, Singapore’s tourism sectors attained records highs in both tourists’ arrivals and spending. According to the data released by Singapore Tourism Board, the number of arrivals increased by 6.2 per cent to $17.4 million, while tourism receipts increased by 3.9 per cent to $26.8 million. The increasing affordability of travel, with the prevalence of low-cost carriers globally, as contribute to the opportunistic trend.

Motivation

During our exploratory analysis on the data comprising of the tourism arrival into Singapore, we noticed that the arrival patterns of tourists from respective countries at heterogenous. A keen understanding to the unique of arrival patterns can reveal the travel preference which is essential for businesses to attract more tourism receipts to boost their business revenue. The ability of the analysts to grapple the data and transform the insights into actionable business decision will see their businesses flourishes.

With the recent completion of Marina Cruise Centre and ongoing construction of Jewel Changi Airport, the tourism receipts are expected to continue to grow steady for the next decade, barring any black swans.

Objectives

We aim to build an interactive platform to illustrate the trends and seasonality within given time-series data on Singapore tourism sector. Users can have a better understanding of the Singapore tourism situation over the last ten years.

Through this project, we hope that the tourism industry business, especially the small and medium business (check with the SME contribution to Singapore economy or tourism industry) can make optimal marketing solutions and business decision. We attempt to create the platform that assist the business owners and analysts to detect some useful insights from the relationship between travelling revenue and expenditure to promote the economic growth.

  • The platform can give us the overview on the visitors’ arrivals pattern by country, age and different transportation methods.
  • It also provides the geographic map to illustrate the visitor density among different countries.
  • Tourism demand forecasting


Data Source

The Singapore Tourism Sector data is extracted from CEIC database which is available at:
https://insights-ceicdata-com.libproxy.smu.edu.sg

From our perspective, we have selected the five datasets -- which are on the topics of tourism arrivals by country, age, transport, length of stay and the tourism revenue and expenditure. The datasets are in a monthly format. For our system analysis, we plan to use filtered data from 2007 onwards.

Methodology

Exploratory Analysis

We will explore the different trends of time-series data provided by the various tourism data sets (Period cyclicity and seasonality). Different interactions of identified attributes might provide certain data insights that we can use for our analysis. Visualize the time series in the following ways:

  • Geographic heat map: Visualize the time series by displaying the geographic heat map on the density of visitor arrivals based on the selection of the specific calendar month.
  • Slopegraphs; This visualization technique can provide maximum information with “minimum ink”. It could help us to detect how the number of the visitor changed over the years.
  • Waterfall: Rather than the values itself, a waterfall plot tries to bring out the changes in the values. It could provide the overview of the time series line chart along with on how large the difference is between two data points.

Explanatory Analysis

  • Decompose time-series information into its constituent parts: Observation, Seasonal, Trend, Random (Noise). From the separate parts, users can understand the different time-series patterns and derive insights.
  • We have many variables(columns) in our dataset, so it is obvious that dimensionality is too high to make effective analysis, and the curse of dimensionality can happen. For this reason, it is important to reduce dimensionality in some way. One of the best approaches is to use time series representations in order to reduce dimensionality, reduce noise and emphasize the main characteristics of time series. In this stage, we would like to do the clustering time series analysis to group the countries with the similar pattern.

Predictive Analysis

Time series forecasting is the use of a model to predict future values based on previously observed values.in this case, we would like to use forecasting techniques such as seasonal exponential smoothing and ARIMA to perform prediction. After forecasting analysis, we must compare predicted tourism to real tourism to help us understand the accuracy of our forecasts. Meanwhile, the standard error and other mathematical statistics can be estimated to further verify the forecasting models and help to choose the best one.

Application

The proposed system would have three major functions:

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current22:01, 14 October 2018Thumbnail for version as of 22:01, 14 October 20181,050 × 351 (841 KB)Anna.zuo.2017 (talk | contribs)
  • You cannot overwrite this file.

There are no pages that use this file.

Metadata