Difference between revisions of "ISSS608 2016 17T1 Group1 Proposal"

From Visual Analytics and Applications
Jump to navigation Jump to search
 
(4 intermediate revisions by the same user not shown)
Line 4: Line 4:
 
| style="border-bottom:4px solid #000066; border-top:4px solid #000066; background:none;" width="1%" |    
 
| style="border-bottom:4px solid #000066; border-top:4px solid #000066; background:none;" width="1%" |    
 
| style="padding:0.4em; font-size:100%; background-color:#000066;  border-bottom:4px solid #000066; border-top:4px solid #000066; text-align:center; color:#000066" width="10%" |[[ISSS608_2016_17T1_Group1_Poster|<font color="#ffffff" size=2><b>POSTER</b></font>]]
 
| style="padding:0.4em; font-size:100%; background-color:#000066;  border-bottom:4px solid #000066; border-top:4px solid #000066; text-align:center; color:#000066" width="10%" |[[ISSS608_2016_17T1_Group1_Poster|<font color="#ffffff" size=2><b>POSTER</b></font>]]
 
| style="border-bottom:4px solid #000066; border-top:4px solid #000066; background:none;" width="1%" | &nbsp;
 
| style="padding:0.4em; font-size:100%; background-color:#000066;  border-bottom:4px solid #000066; border-top:4px solid #000066; text-align:center; color:#000066" width="10%" |[[ISSS608_2016_17T1_Group1_Application|<font color="#ffffff" size=2><b>APPLICATION</b></font>]]
 
  
 
| style="border-bottom:4px solid #000066; border-top:4px solid #000066; background:none;" width="1%" | &nbsp;  
 
| style="border-bottom:4px solid #000066; border-top:4px solid #000066; background:none;" width="1%" | &nbsp;  
 
| style="padding:0.4em; font-size:90%; background-color:#000066;  border-bottom:4px solid #000066; border-top:4px solid #000066; text-align:center; color:#000066" width="10%" |[[ISSS608_2016_17T1_Group1_Report|<font color="#ffffff" size=2><b>REPORT</b></font>]]
 
| style="padding:0.4em; font-size:90%; background-color:#000066;  border-bottom:4px solid #000066; border-top:4px solid #000066; text-align:center; color:#000066" width="10%" |[[ISSS608_2016_17T1_Group1_Report|<font color="#ffffff" size=2><b>REPORT</b></font>]]
 
|}
 
|}
 +
 
 +
 
 +
 
 +
=Background=
  
 +
Singapore is a highly urbanised city state with a good network of public transport.  It is important to understand how the population flows within the city state through the public transport system.  The public transport planners will need to understand the characteristics of the population commuting on the public transport system, especially the public bus system.  The public bus system has a greater reach to majority of the population and is more dynamic in nature as compared to the train system.  By understanding the characteristics of the commuters, it will allow the planners to twist the public bus system in order to make it more efficiency and support the initiative of a car-lite society.
  
 +
 
 +
=Challenges=
  
=Motivation=
+
An Ezlink passenger travel record over 7 days has 50 millions + rows of data and 50+ variables.  The raw data was too large for visualisation tools such as Tableau and D3.JS to read and process.  Even loading the data into JMP proved to be a challenge and on average it took at least 30 mins to read in the csv data and convert into JMP format.  Hence the ability to visualise the raw data was very limited.
  
There is no shortage of visualisation software on the market, both commercial and open-source. However, very few of them emphasize and utilise the science of visualisation to guide the users towards deriving deep insights from the data.
+
=Project Objectives=
<br>
 
We believe that statistical learning can also benefit from better visualisation coupled with interactivity. Business users should be able to perform data modelling on their own, and understand the implications of the decision parameters and assess their models through visualisation as easily as data exploration. With the aim of realising such a capability, we built a web-based tool that is accessible to everyone.
 
  
=Review and Critic on other Visualisation Tools=
+
The objective of the project was to prepare and manipulate the raw data into manageable data size for analysis on visualisation tools such as Tableau and D3.JS.
  
Mention what tableau and SAS can do...
+
     
 
+
=Motivation=
=Selection of Tools=
 
 
 
R scripting language was chosen due to the wide range of statistical learning libraries available. Of equal importance is the availability of Shiny, an R package and web application framework, which can build interactive web applications quickly in the same environment.  Shiny also has a comprehensive list of widgets to implement interactive features such as selection button and input slider. It also allows any User-Interface interactions like click, hover, brush for users to perform deeper exploration of the data.
 
 
 
=Design Framework=
 
 
 
A detail description of the design principles used and data visualisation elements built
 
 
 
=Demonstration=
 
 
 
Sample test cases
 
 
 
=Discussion=
 
 
 
What has the audience learned from your work? What new insights or practices has your system enabled? A full blown user study is not expected, but informal observations of use that help evaluate your system are encouraged.
 
 
 
=Future Work=
 
A description of how your system could be extended or refined.
 
 
 
=Installation guide=
 
No installation is required, you can access the application in the following link: https://oohshinyviztool.shinyapps.io/server<br>
 
 
 
However, the next section shows the steps required to host shiny application on Shinyapps.io.
 
 
 
==Deploy the application online==
 
 
 
You can choose to host the shiny application on your own server. Or you can choose to host it on shinyapps.io by RStudio. The following steps describe the steps to host the application on
 
shinyapps.io.<br>
 
 
 
'''Step 1''': Go to http://www.shinyapps.io/ to sign up for a free account which allows you to host up to 5 applications and utilise 25 active hours.<br><br>
 
'''Step 2''': You will get your personal token and secret number, as well as determine the name of your application link.<br><br>
 
In your Rstudio, <br>
 
'''Step 3''': Key in the following in the console: <br>
 
install.packages('devtools')<br>
 
devtools::install_github('rstudio/shinyapps')<br>
 
devtools::install_github('rstudio/rsconnect')<br>
 
library(shinyapps)<br>
 
library(rsconnect)<br><br>
 
'''Step 4''': Go to Tools -> Global Options -> Publishing -> Connect <br>
 
paste the following in the empty box:
 
shinyapps::setAccountInfo(name="''Your account name''", token="''Personal Token issued''", secret="''Secret number issued''")
 
<br>
 
'''Step 5''': Key in the following in the console: <br>
 
file_directory = 'C:/Users/app'<br>
 
setwd(file_directory)<br>
 
[Note: use the file directory which you store the R codes for the application.]<br><br>
 
'''Step 6''': Key in the following in the console: <br>
 
deployApp(appName="server")<br><br>
 
'''Step 7'''': The application should open on your browser.<br>
 
  
 +
With this project we hope to help the public transport planners to study the commuter's movement on the public bus system.
  
=User Guide=
+
     
Step-by-step guide on how to use the data visualisation functions designed.
+
=Visualisation Modelling=  
  
=References=
+
We are exploring using either D3.JS or R Shiny to create visualisation model such as the Chord Diagram or sparklines to analysis the flow pattern.
 +
 +
 
 +
=Data Sources=

Latest revision as of 22:26, 28 November 2016

PROPOSAL   POSTER   REPORT


Background

Singapore is a highly urbanised city state with a good network of public transport. It is important to understand how the population flows within the city state through the public transport system. The public transport planners will need to understand the characteristics of the population commuting on the public transport system, especially the public bus system. The public bus system has a greater reach to majority of the population and is more dynamic in nature as compared to the train system. By understanding the characteristics of the commuters, it will allow the planners to twist the public bus system in order to make it more efficiency and support the initiative of a car-lite society.


Challenges

An Ezlink passenger travel record over 7 days has 50 millions + rows of data and 50+ variables. The raw data was too large for visualisation tools such as Tableau and D3.JS to read and process. Even loading the data into JMP proved to be a challenge and on average it took at least 30 mins to read in the csv data and convert into JMP format. Hence the ability to visualise the raw data was very limited.

Project Objectives

The objective of the project was to prepare and manipulate the raw data into manageable data size for analysis on visualisation tools such as Tableau and D3.JS.


Motivation

With this project we hope to help the public transport planners to study the commuter's movement on the public bus system.


Visualisation Modelling

We are exploring using either D3.JS or R Shiny to create visualisation model such as the Chord Diagram or sparklines to analysis the flow pattern.


Data Sources