Difference between revisions of "Project Groups"

From Visual Analytics and Applications
Jump to navigation Jump to search
m
 
(80 intermediate revisions by 20 users not shown)
Line 41: Line 41:
 
|-
 
|-
 
||
 
||
[[Group01_Overview|Group 1]]
+
<div style="text-align:center;">
 +
[[Group01_Overview|Group 1: The Three Musketeers]]<br><br>[[File:grp01_headerImage.png|209px|center]]
 +
</div>
 
||
 
||
'''Gender Studies: A web-based analytics application for visualizing World Development Indicators'''
+
'''World Development Indicators: A New Visual Perspective'''<br>
 +
''A web-based analytics application to visualize countries development across the globe''
  
As countries start to develop, more attention on Social Economic factors begin to be discusses in greater detail. In fact, the discussion of Social Economics may be deemed as a privilege of Developed countries, as developing countries continue to battle more fundamental economic factors like Gross Domestic Product, Population Growth, Education and so on. Social Economics, as defined, is the study of social phenomenons that make up the society at large. These include topics of discussions, a few of which are listed below:
+
World Development Indicators (WDI) is an extensive and holistic database compiled by World Bank focusing on countries development across the globe. It covers 20 topics with more than 1,300 time series development indicators featuring 214 nations and 38 country group which adds up to more than 7 million data points collected over the span of 56 years.
  
Lesbian Gay Bisexual Transsexual Rights
+
The massive amount of world development data has by far exceeds the ability for students, policymakers, analysts and officials to transform the data into proper visualization for analysing and gaining insight of the global developmental landscape. Thus, creating an adverse impact on the financial and technical assistance World Bank is providing to the developing countries around the world.
Racial Discrimination
+
 
Feminism and Women Equality
+
To address this pressing issue, the team is motivated to design and develop a single-view, dynamic and interactive visual dashboard to provide students, policymakers, analysts and officials a holistic view of the World Development Indicators data collected.
Freedom Religious/Personal practice
 
For our project, we would be focusing on Feminism and Women Equality. in line with the topic, there are many controversial topics of discussions. Such include the Gender Pay Gap, the Patriarchy, Abortion (Women's health care rights), etc, with the liberal impression that Women in developed countries are still suppressed into sub-leading roles, and that society is inherently "male" dominant. Our group hopes to dispel this theory with factual statistics, and as such, change the mindsets of women to "break" out of the "self-victimization" mentality, which has been recognized by groups of Conservative Physiologist as the stumbling block to Gender equality and progression.
 
 
||
 
||
 
*[[Group01_Proposal|Proposal]]
 
*[[Group01_Proposal|Proposal]]
Line 63: Line 64:
 
|-
 
|-
 
||
 
||
[[Group_2_Overview|Group 2]]
+
<div style="text-align:center;">
 +
[[Group_2_Overview|Group 2:Exploring Associations of Geospatial-temporal Factors - A Visual Interactive Toolkit]]<br><br>[[File:Kernel Density.png|209px|center]]
 +
</div>
 
||
 
||
 
'''Environmental Criminology: The Missing "W" in Whodunnit'''
 
'''Environmental Criminology: The Missing "W" in Whodunnit'''
  
Environmental Criminology, an approach first developed in the 1980s, involves the study of geospatial and contextual elements in relation to crime. These elements could involve aspects such as victim movement, and spatial-temporal patterns in the occurrence of crimes. Through the exploration of a crime record dataset provided by the Los Angeles Poice Department (LAPD), this project aims to provide a layman's view of criminology through the lens of geospatial-themed and statistical visual analytics.  
+
With increased availability of crime data rich with geospatial-temporal variables, exploratory, statistical and predictive analytics can be leveraged on to understand crime occurences with the lens of environmental criminology. The application produced from this research leverages on previous works on analysing interaction and associations amongst crime data variables that is supplemented with the population data. With Los Angeles city crimes used as our case study, we demonstrate how results from various analytical methods can be displayed visually and intuitively for exploration by the casual user with interactivity catered to potential varying needs. In particular, the application displayed exploratory and predictive statistcal analytics results using radar charts, calendar plot, choropleths, small multiples of choropleths, multimodal network graphs, heat maps and geographical maps.  
 
 
<motivation>
 
<features>
 
  
 
||
 
||
 
*[[Grp2_Proposal|Proposal]]
 
*[[Grp2_Proposal|Proposal]]
 
*[[Group02_Report|Report]]
 
*[[Group02_Report|Report]]
*[<Shiny App link>]
+
*[https://rchlt.shinyapps.io/va-g2-lightapp/ Light Version of App^]
 +
*[https://tinyurl.com/yaapkvx7/ Code for full-scale App]
 +
*[https://tinyurl.com/ybmxe3d8/ Data for full-scale App]
 
*[[Group02_Poster|Poster]]
 
*[[Group02_Poster|Poster]]
 +
 +
^ Light Version contains 10 months of data (Jan, Feb, Aug, Sep, Dec for 2016 and 2017)
 +
 
||
 
||
 
* Matilda Tan Ying Xuan
 
* Matilda Tan Ying Xuan
Line 83: Line 88:
 
|-
 
|-
 
||
 
||
[[Group_4_Overview|Group 4: A tale of Bitcoin]]
+
[[Group_3_Overview|Group 3: Shiny-GWR Geovisual Analytics Application]]<br><br>[[File:Icont3.JPG|209px|center]]
 +
||
 +
 
 +
'''Building a geo-visualization application to analyse district economy in east region of China with geographically weighted regression (GWR) technique'''
 +
 
 +
Geospatial analysis was developed for problems in the environmental and life sciences, which has currently extended to almost all industries including economy, defence, utilities, social sciences, and public safety.  The application of geo-visualization using geographically weighted regression (GWR) is an exploratory technique mainly intended to indicate where non-stationarity is taking place on the map. It is a good exploratory analytical tool which creates a set of location based parameter estimates, able to be mapped and analysed to give spatial information for the relationship of explanatory variables and response variable.
 +
 
 +
 
 +
Our study uses economical data to explore district GDP condition in northern region of China. The project scope covers the analysis, model and visual representation of multivariate factors like GDP,Industry Output, Usual Residence,Average Wage,Area,City Construction Rate,No. of higher institution, and ratio of Teacher/Student which contributes to economical development in each city area of the province or municipality with the assistance of interactive charts and graphs.
 +
 
 +
||
 +
*[[Group_3_Proposal|Proposal]]
 +
*[[Group_3_Application|Application]]
 +
*[[Group_3_Poster|Poster]]
 +
*[[Group_3_Report|Report]]
 +
||
 +
* Xiao Zhenyu
 +
* Chen Zhengjian
 +
* Zheng Mianyi
 +
|-
 +
||
 +
[[Group_4_Overview|Group 4: A tale of Bitcoin]]<br><br>[[File:Bitcoin.png|209px|center]]
 
||
 
||
  
'''Ever wonder how far bitcoin value could go?'''
+
'''Ever wondered how far bitcoin's value could go?'''
 +
 
 +
Bitcoin has recently garnered mixed reviews from two extreme ends, from China banning bitcoin to Chicago Mercantile Exchange supporting the futures trading of bitcoin. There are even more varying opinions from big investment banks to regulators. All this recent excitement is due to bitcoin’s value rising by more than 700% (as of October 2017) from the start of 2017.
  
Bitcoin has recently garnered mixed reviews from two extreme ends. From China banning bitcoin to Chicago Mercantile Exchange supporting the futures trading of bitcoin. And there are plenty more views from big investment banks as well as regulators. All these recent excitement is due to bitcoin’s value rising more than 700% since the start of January 2017.
+
It is very tempting to speculate that the price will continue to go up. If it does, by how much? If it doesn’t, how hard will it fall? How is its relative performance compared to other instruments? There are many more questions from both investors as well as curious academics alike. This paper’s focus will be on the following:
  
 +
# price movement patterns and trends; and
 +
# the risk and return profile of bitcoin
  
It is very tempting to speculate that the price will continue to go up. If it does, by how much? If it doesn’t, how hard will it fall? How is the relative performance compared to other instruments? There are many more questions from both investors and curious academics alike. This paper seeks to answer some of these questions through visualisation techniques by employing the R technology.
 
  
 +
The approach taken to answering these question is through various visualisation techniques built in R.
  
In this project, we will perform limited technical analysis, time series analysis and return analysis through the usage of R. There are various packages which proof useful when it comes to quick data crunching and visually pleasing presentation. The data what we will be using is per minute close bitcoin price since 2009. Apart from price, we will also be comparing price information of indexes and other commonly traded commodities as a return comparison against bitcoin.
 
  
  
 
||
 
||
 
*[[Group_4_Overview|Proposal]]
 
*[[Group_4_Overview|Proposal]]
 
+
*[[Group_4_Application|Application]]
 +
*[[Group_4_Poster|Poster]]
 +
*[[Group_4_Report|Report]]
 
||
 
||
 
* DENG Yuetong
 
* DENG Yuetong
Line 107: Line 138:
 
|-
 
|-
 
||
 
||
[[Group_5_Overview|Group 5]]
+
<div style="text-align:center;">
 +
[[ISSS608_2017-18_T1_Group5_Report|Group 5: Aviation Expansion]]<br><br>[[File:G5_pic.jpg|250px|center]]
 +
</div>
 
||
 
||
'''How civil aviation contributes to the “Belt and Road” initiative?'''
+
'''Linking the globe: An Interactive Dashboard for Exploring Aviation Expansion Along the "Belt and Road"'''<br>
 +
''How civil aviation contributes to the “Belt and Road” initiative?''<br>
  
The Belt and Road (B&R) refers to the land-based "Silk Road Economic Belt" and the seagoing "21st Century Maritime Silk Road". The routes cover more than 60 countries and regions from Asia to Europe via Southeast Asia, South Asia, Central Asia, West Asia and the Middle East. The B&R countries currently accounting for around 31% of global GDP and more than 34% of the world's merchandise trade.
+
The Belt and Road (B&R) refers to the land-based "Silk Road Economic Belt" and the seagoing "21st Century Maritime Silk Road". Unveiled in 2013, the strategy underlines China's push to take a larger role in global affairs with a China-centered trading network by reinvigorating the seamless flow of capital, goods and services between Asia and the rest of the world. with aim of promoting further market integration and forging new ties among communities, the routes cover more than 60 countries and regions from Asia to Europe via Southeast Asia, South Asia, Central Asia, West Asia and the Middle East.<br><br>
Unveiled in 2013, the strategy underlines China's push to take a larger role in global affairs with a China-centered trading network. The Belt and Road initiative is set to reinvigorate the seamless flow of capital, goods and services between Asia and the rest of the world, by promoting further market integration and forging new ties among communities.
 
  
 +
By investigating all air-routes and flights between China and the “Belt and Road” countries between 2013 and 2017, the main purpose of this project is to explore the growing trend, regional connectivity and development potential in this aviation network. <br><br>
  
The main purpose of this project is to investigate the flight network between China and the “Belt and Road” countries to explore the economic connectivity and development potential.
+
A web-based visual analytics tool is implemented using R shiny with Leaflet package which can be used to easily explore and understand the flight network.
A visual exploration tool will be implemented using R shiny which can be used to easily explore and understand the flight network.
 
  
 
||
 
||
 
*[[ISSS608_2017-18_T1_Group5_Proposal|Proposal]]
 
*[[ISSS608_2017-18_T1_Group5_Proposal|Proposal]]
 +
*[[ISSS608_2017-18_T1_Group5_Report|Report]]
 
*[[ISSS608_2017-18_T1_Group5_Poster|Poster]]
 
*[[ISSS608_2017-18_T1_Group5_Poster|Poster]]
 
*[[ISSS608_2017-18_T1_Group5_Application|Application]]
 
*[[ISSS608_2017-18_T1_Group5_Application|Application]]
*[[ISSS608_2017-18_T1_Group5_Report|Report]]
 
 
||
 
||
 
* Wang Rui
 
* Wang Rui
Line 131: Line 164:
 
|-
 
|-
 
||
 
||
[[Group_6_Overview|Group 6: How Is Beijing Today?]]
+
<div style="text-align:center;">
 +
[[Group_6_Overview|Group 6: Beijing Air Quality]]<br><br>[[File: Air3.jpg|209px|center]]
 +
</div>
 
||
 
||
  
'''Use the air quality indicators to save us'''
+
'''How is Beijing Air Quality in 2017?'''
  
 
On Nov 4th, Beijing Environmental Protection Agency released the news, owing to the adverse weather conditions and early winter heating as well as other factors, it is expected that there will be a continuous 4-day regional heavily polluted air quality in Beijing-Tianjin-Hebei and surrounding areas on November 4th, in addition, the air quality in some cities may reach serious pollution level….
 
On Nov 4th, Beijing Environmental Protection Agency released the news, owing to the adverse weather conditions and early winter heating as well as other factors, it is expected that there will be a continuous 4-day regional heavily polluted air quality in Beijing-Tianjin-Hebei and surrounding areas on November 4th, in addition, the air quality in some cities may reach serious pollution level….
  
  
“Why is China’s smog so bad now?”, a lot of people from overseas want to explore. With the rapid development of economy in China, news from China is more frequently commented in the globe. China’s air pollution has been a serious issue for more than 10 years, with the problem appealing more attention worldwide, the Chinese government has make big efforts to solve it.
+
Beijing, one of the most serious polluted city, which is also the capital of China. Along with the escalation of air pollution, most people who are working and living in Beijing are faced with the tracheitis, pneumoconiosis, asthma, to name just a few. Gradually, a lot of people are terrified with living and working in Beijing.
  
  
Beijing, one of the most serious polluted city, which is also the capital of China. Along with the escalation of air pollution, most people who are working and living in Beijing are faced with the tracheitis, pneumoconiosis, asthma, to name just a few. Gradually, a lot of people are terrified with living and working in Beijing.
+
In our project, we make efforts to visualize and analyze Beijing air quality according to its main existing indicators, such as AQI, NO, SO2, CO, PM2.5, etc.. To better display the visualization results, we utilize R Shiny Dashboard to make the part of the page design. Then, through exerting the r package of ggplot2, we visualize the fluctuation of AQI and frequency of AQI level. Besides, we generate the spider chart which shows the severity of each pollutant by using fmsb this package. We also display raster map and geofacet line graphs for 8 main view points through using the packages of ggplot2, maptools, gstat, raster, geofacet.  
  
  
In our project, we mean to apply the tools and better visualize the changes of air quality according its existing indicators. We will show the historical AQI, the pollutant concentrations and trend charts by pollutants, forecast the further air quality in the different view point in Beijing. We hope that we can try our best to show the weather condition in hand and assist people in keep them in good health.
+
All in all, we hope that we can try our best to show the air quality, and make people clearly know more about the surroundings they are living in as well as raise public environmental awareness.
  
 
||
 
||
Line 157: Line 192:
 
* Zhang Lidan
 
* Zhang Lidan
 
|-
 
|-
 +
 
|-
 
|-
 
||
 
||
[[Group_8_Overview|Group 8]]
+
<div style="text-align:center;">
 +
[[Group 7 Overview|Group 7: Bike Sharing]]<br><br>[[Image:Shareing_bicycle.png|250px|center]]
 +
</div>
 
||
 
||
'''Singapore CPI Index: A Detalied Data Analysis'''
 
  
The Consumer Price Index (CPI) is an economic measure that reflects the prices of consumer goods and services a country's citizens consume for day to day living. Understanding it gives a clear overview of how the country’s standard of living is and its short-term forecasts can have deep ramifications. It is often used for tuning Governmental policies to steer inflation rates in Singapore, for potential migrants to assess the country’s living patterns, and for foreign investors to consider allocating potential investment funds into the country.
+
'''Bike Sharing'''
 +
 
 +
Pronto Cycle Share, branded as Pronto!, was a public bicycle sharing system in Seattle, Washington, that operated from 2014 to 2017. The system, owned initially by a non-profit and later by the Seattle Department of Transportation, included 58 stations in the city's central neighbourhoods and above 500 bicycles.
 +
 
 +
Bike-sharing is a short distance transportation for people to make their life more convenient. When people use shared-bike, they can borrow and return bikes at any stations in the service station. Some stations have too many incoming bike and get jammed without enough docks for upcoming bikes, while some other stations get empty quickly and lack enough bikes for people to check out.  
  
However, to merely glance at the present CPI values provide limited information and its real value lies in its forecast. Given the myriad possibilities of how the market forces work, the CPI can be difficult to predict, especially in the long term. Analysts who are interested in the country’s growth and outlook would need to apply many variations of parameters to find the forecast that is nearest to its future realization.
+
'''Which station has the most passenger flow?'''
  
To tackle this difficulty, we propose to create an easy-to-use system that is assessible even to the uninitiated analyst. The system should allow exploratory functions to decompose the CPI time series data to its constituent parts, namely Seasonality, Trend, and Random (Noise). It should also generate several potential forecasting models (such as Exponential Smoothing and ARIMA) to predict the CPI using a combination of predetermined parameters and grid search optimization. The system will be programmed using the ''R Shiny'' package, and several useful supportive packages such as: ''Lubridate'' and ''Timetk'' (for date-time data manipulation), ''Forecast'' and ''Sweep'' (for easy forecasting), and ''ggplot2'' (for amazing time-series visualizations).
+
In our project, we calculate the in degree an out degree for each station, to help user to understand how passenger usually use bike sharing service through each station point. We also divided time range into different periods, the data users can see much more details in yearly, monthly, even hourly. So that they can understand better the bike usage pattern.
  
To extend our system further, we also allow other forms of time series data to be displayed and their forecasts compared using the given forecasting methods. Other example time-series data sources that are supported could include Singapore import/export prices, currency exchange rate information with other countries, and even Singapore Certificate-of-Entitlement (COE) price changes.
+
'''How to re-distribute bike at a lower cost?'''
 +
 
 +
Because passenger will take the bike from one station to another station everyday, Company should ask employees to re-distribute bike among existent stations. In our project, we use the real map, cooperated with the degree data calculated before, to visualize a shortest path to help employees re-distribute bike in a more efficient way.
 +
 
 +
||
 +
*[[G7 Project Introduction|Proposal]]
 +
*[[G7 Poster|Poster]]
 +
*[[G7 Application|Application]]
 +
*[[G7 Report|Report]]
 +
||
 +
* Zhang Peng
 +
* Wang Shang
 +
|-
 +
 
 +
|-
 +
||
 +
[[Group_8_Overview|Group 8: Time Series Explorer]]<br><br>[[File: Group8ProjectBanner.png|350px|center]]
 +
||
 +
'''Time-series Explorer: Building interactive data visualisation for time series analysis'''
 +
 
 +
Time-series analysis is a time and effort consuming endeavour. As budding data analysts, we spent considerable resources in experimenting with many variations of parameter configurations to analyse time-series data. This difficulty stems from the lack of automatic tools that can help calculate the optimized time-series parameters during model training. To tackle this challenge, we created an easy-to-use time-series exploration system that is accessible even to the uninitiated analyst. The system is able to decompose the time series data to its constituent parts, namely Seasonality, Trend and Random (Noise). It can generate several forecasting models, using Exponential Smoothing and ARIMA analysis techniques, to predict future time periods using optimization techniques. The system also allows other forms of time series data to be displayed and their forecasts compared using the given forecasting methods, within certain formats. To test the system capabilities, we adopted the Singapore Consumer Price Index (CPI) as our use case. The CPI, with its short-term forecasts, is often used for tuning Governmental policies to steer inflation rates in countries like Singapore and for foreign investors to consider allocating potential investment funds into the country.
  
 
||
 
||
Line 182: Line 243:
 
|-
 
|-
 
||
 
||
[[Group10_Overview|Group 10]]
+
[[Group10_Overview|Group 10:China Property Trend]]<br><br>[[File:Geocluster.jpeg|209px|center]]
 
||
 
||
 
'''China property analysis'''
 
'''China property analysis'''
 +
 
The real-estate market is ever growing and has more stakeholders. We are here to build an app that makes an analysis of the housing prices market an easy and effective one by just a few clicks and hovering around. This way allowing the major stakeholders perform their analysis and plan their decisions more efficiently.
 
The real-estate market is ever growing and has more stakeholders. We are here to build an app that makes an analysis of the housing prices market an easy and effective one by just a few clicks and hovering around. This way allowing the major stakeholders perform their analysis and plan their decisions more efficiently.
  
We have used various packages such as: that allowed users to model and visualize the housing prices indexes in different ways for different purposes.
+
We have used various packages such as'Recharts','Timekt','Sweep','ggplot2' that allowed users to model and visualize the housing prices indexes in different ways for different purposes.
  
 
Time Series Analysis-The application will allow the user to choose the City they are interested in and the time period they want to look at. The trend of the prices during that period will be provided.This is built for analysts and agents and government officials who would like to know on the performance at a certain period of time and also a comparative study between different cities. This way they can find any outliers or a particular pattern in the indices. The time series is related to the economic policy and the effect is stressed based on the chosen policy time period.
 
Time Series Analysis-The application will allow the user to choose the City they are interested in and the time period they want to look at. The trend of the prices during that period will be provided.This is built for analysts and agents and government officials who would like to know on the performance at a certain period of time and also a comparative study between different cities. This way they can find any outliers or a particular pattern in the indices. The time series is related to the economic policy and the effect is stressed based on the chosen policy time period.
Line 193: Line 255:
 
Cluster Analysis – We further develop some clusters of the cities based on their housing index reaction. This way we can group the cities whose housing market behave/ respond to the market in a similar way. The government officials and the local agents understand the markets better and plan their policies better. A waiver or cluster development centric policy can be made by the government.
 
Cluster Analysis – We further develop some clusters of the cities based on their housing index reaction. This way we can group the cities whose housing market behave/ respond to the market in a similar way. The government officials and the local agents understand the markets better and plan their policies better. A waiver or cluster development centric policy can be made by the government.
  
Forcast Analysis: Forecast analysis is done using Geofacet that we can compare the forecasted prices between different region of the country
+
Forcast Analysis: Forecast analysis is done using Geofacet that we can compare the forecasted prices between the different region of the country. Geofaceting arranges a sequence of plots of data for different geographical entities into a grid that strives to preserve some of the original geographical orientation of the entities.
  
 
This app can be applied to any other economic variable in China. This will be greatly helpful for economists, agents and government officials to look into the specific data and make some judgments and decisions based on it.
 
This app can be applied to any other economic variable in China. This will be greatly helpful for economists, agents and government officials to look into the specific data and make some judgments and decisions based on it.
  
 
||
 
||
 +
*[[Group10_Overview|Proposal]]
 +
*[[Group_10_Poster|Poster]]
 +
*[[Group_10_Application|Application]]
 +
*[[Group_10_Report|Report]]
 +
||
 +
* Aishwarya Mohan
 +
* Deng Chunling
 +
* Ma Xiaoliu
 +
|-
 +
 
|-
 
|-
 
||
 
||
[[Group12_Overview|Group 12]]
+
[[Group_11_Overview|Group 11: CrimeModeler: A Visually-Driven Geospatial Modelling Tool for Crime Applications]]<br><br>[[File:police.jpeg|220px|center]]
 +
||
 +
'''CrimeModeler: A Visually-Driven Geospatial Modelling Tool for Crime Applications'''
 +
 
 +
Based on UN’s Survey of Crime Trends published in 2006, England and Wales have one of the highest crime rates among OECD countries. We have developed CrimeModeler, a geospatially modelling tool to investigate the spatial variation of crime across different districts in England and Wales, and the relationship between crime and socio-economic characteristics for each district. As it is common for neighbouring regions to have correlation in their crime rate, we compare the use of geographically weighted regression (GWR) and conventional (or global) multiple regression model to see whether a better result can be obtained from GWR. We will also investigate whether there are certain variables that have an impact on crime rate in one area but not in another. Local governments may use this information to come up with better policies to tackle crime.
 +
 
 +
 
 +
||
 +
*[[Group11 Proposal|Proposal]]
 +
*[[Group11 Poster|Poster]]
 +
*[http://crime.raymondfoo.host/ Application]
 +
*[[Group11 Report|Report]]
 +
||
 +
* Raymond FOO Celong
 +
* Anthony GOH Jun Jie
 +
* Karan Jyoti KHANNA
 +
|-
 +
 
 +
||
 +
<div style="text-align:center;">
 +
[[Group12_Proposal|Group 12:Cross Shareholding]]<br><br>[[File:Group12Title.JPG|209px|center]]
 
||
 
||
 
'''Cross Shareholding'''
 
'''Cross Shareholding'''
 +
 
Cross shareholding is a situation in which a corporation owns stock in another company. So, technically, corporations own securities issued by other corporations. Cross shareholding can lead to double counting, whereby the equity of each company is counted twice when determining value. When double counting occurs, the security's value is counted twice, which can result in estimating the wrong value of the two companies.
 
Cross shareholding is a situation in which a corporation owns stock in another company. So, technically, corporations own securities issued by other corporations. Cross shareholding can lead to double counting, whereby the equity of each company is counted twice when determining value. When double counting occurs, the security's value is counted twice, which can result in estimating the wrong value of the two companies.
  
Line 210: Line 303:
  
 
||
 
||
*[[Group12_Proposal|Proposal]]
+
*[[Group12 Proposal|Proposal]]
*[[Group12_Poster|Poster]]
+
*[[Group12 Poster|Poster]]
*[LINK]
+
*[https://koreastockcrossholding.shinyapps.io/ksch1203/ Application]
*[[Group12_Report|Report]]
+
*[[Group12 Report|Report]]
 
||
 
||
 
* KYONG HWAN KIM
 
* KYONG HWAN KIM
Line 219: Line 312:
 
* HE ZIWEN
 
* HE ZIWEN
 
|-
 
|-
|-
 
||
 
{| class="wikitable centered" width="90%"
 
!Team
 
!colspan="18"|Members
 
  
|-
+
 
|width="10%"|<!-- Team --> [[Group_1_Overview| Group 1]]
 
|colspan="3"|<!-- First team member  --> GE BIN
 
|colspan="3"|<!-- Second team member --> LIM LIANG DANNY
 
|colspan="3"|<!-- Third team member -->  RYAN CHIA
 
|-
 
|width="10%"|<!-- Team --> [[Group_2_Overview| Group 2]]
 
|colspan="3"|<!-- First team member  --> Rachel Tong
 
|colspan="3"|<!-- Second team member --> Nurul Asyikeen Binte Azhar
 
|colspan="3"|<!-- Third team member -->  Matilda Tan Ying Xuan
 
|-
 
|width="10%"|<!-- Team --> [[Group_3_Overview|Group 3]]
 
|colspan="3"|<!-- First team member  --> CHEN ZHENGJIAN
 
|colspan="3"|<!-- Second team member --> XIAO ZHENYU
 
|colspan="3"|<!-- Third team member -->  ZHENG MIANYI
 
|-
 
|width="10%"|<!-- Team --> [[Group_4_Overview|Group 4]]
 
|colspan="3"|<!-- First team member  --> Yau Hon Tak
 
|colspan="3"|<!-- Second team member --> Deng Yuetong
 
|colspan="3"|<!-- Third team member --> 
 
|-
 
|width="10%"|<!-- Team --> [[ISSS608_2017-18_T1_Group5_Proposal|Group 5]]
 
|colspan="3"|<!-- First team member  --> Wang Rui
 
|colspan="3"|<!-- Second team member --> Wu Yuqing
 
|colspan="3"|<!-- Third team member -->  Xing Siyuan
 
|-
 
|width="10%"|<!-- Team --> [[Group_6_Overview|Group 6]]
 
|colspan="3"|<!-- First team member  --> Wang Yizhou
 
|colspan="3"|<!-- Second team member --> Zhou Chen
 
|colspan="3"|<!-- Third team member -->  Zhang Lidan
 
|-
 
|width="10%"|<!-- Team --> [[Group_7_Overview|Group 7]]
 
|colspan="3"|<!-- First team member  --> Zhang Peng
 
|colspan="3"|<!-- Second team member --> Wang Shang
 
|colspan="3"|<!-- Second team member -->
 
|-
 
|width="10%"|<!-- Team --> [[Group_8_Overview| Group 8]]
 
|colspan="3"|<!-- First team member  --> Fam Guo Teng
 
|colspan="3"|<!-- Second team member --> Wang Yuchen
 
|colspan="3"|<!-- Third team member -->  Xu Yanru
 
|-
 
|width="10%"|<!-- Team --> [[Group_10_Overview|Group 10]]
 
|colspan="3"|<!-- First team member  --> Ma Xiaoliu
 
|colspan="3"|<!-- Second team member --> Deng Chunling
 
|colspan="3"|<!-- Third team member -->  AISHWARYA MOHAN
 
|-
 
|width="10%"|<!-- Team --> [[Group_11_Overview|Group 11]]
 
|colspan="3"|<!-- First team member  --> FOO CELONG RAYMOND
 
|colspan="3"|<!-- Second team member --> GOH JUN JIE ANTHONY
 
|colspan="3"|<!-- Third team member -->  KARAN JYOTI KHANNA
 
|-
 
|width="10%"|<!-- Team --> [[Group_12_Overview|Group 12]]
 
|colspan="3"|<!-- First team member  --> HE ZIWEN
 
|colspan="3"|<!-- Second team member --> GONGQIANG
 
|colspan="3"|<!-- Third team member -->  KYONG HWAN KIM
 
 
|}
 
|}

Latest revision as of 11:51, 4 December 2017

Vaa1.jpg ISSS608 Visual Analytics and Applications

About

Weekly Session

Assignments

Visual Analytics Project

Course Resources

 


Project Groups

Please change Your Team name to your project topic and change student name to your own name

Project Team Project Title/Description Project Artifacts Project Member

World Development Indicators: A New Visual Perspective
A web-based analytics application to visualize countries development across the globe

World Development Indicators (WDI) is an extensive and holistic database compiled by World Bank focusing on countries development across the globe. It covers 20 topics with more than 1,300 time series development indicators featuring 214 nations and 38 country group which adds up to more than 7 million data points collected over the span of 56 years.

The massive amount of world development data has by far exceeds the ability for students, policymakers, analysts and officials to transform the data into proper visualization for analysing and gaining insight of the global developmental landscape. Thus, creating an adverse impact on the financial and technical assistance World Bank is providing to the developing countries around the world.

To address this pressing issue, the team is motivated to design and develop a single-view, dynamic and interactive visual dashboard to provide students, policymakers, analysts and officials a holistic view of the World Development Indicators data collected.

  • GE Bin
  • Ryan CHIA
  • Danny LIM

Environmental Criminology: The Missing "W" in Whodunnit

With increased availability of crime data rich with geospatial-temporal variables, exploratory, statistical and predictive analytics can be leveraged on to understand crime occurences with the lens of environmental criminology. The application produced from this research leverages on previous works on analysing interaction and associations amongst crime data variables that is supplemented with the population data. With Los Angeles city crimes used as our case study, we demonstrate how results from various analytical methods can be displayed visually and intuitively for exploration by the casual user with interactivity catered to potential varying needs. In particular, the application displayed exploratory and predictive statistcal analytics results using radar charts, calendar plot, choropleths, small multiples of choropleths, multimodal network graphs, heat maps and geographical maps.

^ Light Version contains 10 months of data (Jan, Feb, Aug, Sep, Dec for 2016 and 2017)

  • Matilda Tan Ying Xuan
  • Nurul Asyikeen Binte Azhar
  • Rachel Tong
Group 3: Shiny-GWR Geovisual Analytics Application

Icont3.JPG

Building a geo-visualization application to analyse district economy in east region of China with geographically weighted regression (GWR) technique

Geospatial analysis was developed for problems in the environmental and life sciences, which has currently extended to almost all industries including economy, defence, utilities, social sciences, and public safety. The application of geo-visualization using geographically weighted regression (GWR) is an exploratory technique mainly intended to indicate where non-stationarity is taking place on the map. It is a good exploratory analytical tool which creates a set of location based parameter estimates, able to be mapped and analysed to give spatial information for the relationship of explanatory variables and response variable.


Our study uses economical data to explore district GDP condition in northern region of China. The project scope covers the analysis, model and visual representation of multivariate factors like GDP,Industry Output, Usual Residence,Average Wage,Area,City Construction Rate,No. of higher institution, and ratio of Teacher/Student which contributes to economical development in each city area of the province or municipality with the assistance of interactive charts and graphs.

  • Xiao Zhenyu
  • Chen Zhengjian
  • Zheng Mianyi
Group 4: A tale of Bitcoin

Bitcoin.png

Ever wondered how far bitcoin's value could go?

Bitcoin has recently garnered mixed reviews from two extreme ends, from China banning bitcoin to Chicago Mercantile Exchange supporting the futures trading of bitcoin. There are even more varying opinions from big investment banks to regulators. All this recent excitement is due to bitcoin’s value rising by more than 700% (as of October 2017) from the start of 2017.

It is very tempting to speculate that the price will continue to go up. If it does, by how much? If it doesn’t, how hard will it fall? How is its relative performance compared to other instruments? There are many more questions from both investors as well as curious academics alike. This paper’s focus will be on the following:

  1. price movement patterns and trends; and
  2. the risk and return profile of bitcoin


The approach taken to answering these question is through various visualisation techniques built in R.


  • DENG Yuetong
  • YAU Hon Tak


Linking the globe: An Interactive Dashboard for Exploring Aviation Expansion Along the "Belt and Road"
How civil aviation contributes to the “Belt and Road” initiative?

The Belt and Road (B&R) refers to the land-based "Silk Road Economic Belt" and the seagoing "21st Century Maritime Silk Road". Unveiled in 2013, the strategy underlines China's push to take a larger role in global affairs with a China-centered trading network by reinvigorating the seamless flow of capital, goods and services between Asia and the rest of the world. with aim of promoting further market integration and forging new ties among communities, the routes cover more than 60 countries and regions from Asia to Europe via Southeast Asia, South Asia, Central Asia, West Asia and the Middle East.

By investigating all air-routes and flights between China and the “Belt and Road” countries between 2013 and 2017, the main purpose of this project is to explore the growing trend, regional connectivity and development potential in this aviation network.

A web-based visual analytics tool is implemented using R shiny with Leaflet package which can be used to easily explore and understand the flight network.

  • Wang Rui
  • Wu Yuqing
  • Xing Siyuan

How is Beijing Air Quality in 2017?

On Nov 4th, Beijing Environmental Protection Agency released the news, owing to the adverse weather conditions and early winter heating as well as other factors, it is expected that there will be a continuous 4-day regional heavily polluted air quality in Beijing-Tianjin-Hebei and surrounding areas on November 4th, in addition, the air quality in some cities may reach serious pollution level….


Beijing, one of the most serious polluted city, which is also the capital of China. Along with the escalation of air pollution, most people who are working and living in Beijing are faced with the tracheitis, pneumoconiosis, asthma, to name just a few. Gradually, a lot of people are terrified with living and working in Beijing.


In our project, we make efforts to visualize and analyze Beijing air quality according to its main existing indicators, such as AQI, NO, SO2, CO, PM2.5, etc.. To better display the visualization results, we utilize R Shiny Dashboard to make the part of the page design. Then, through exerting the r package of ggplot2, we visualize the fluctuation of AQI and frequency of AQI level. Besides, we generate the spider chart which shows the severity of each pollutant by using fmsb this package. We also display raster map and geofacet line graphs for 8 main view points through using the packages of ggplot2, maptools, gstat, raster, geofacet.


All in all, we hope that we can try our best to show the air quality, and make people clearly know more about the surroundings they are living in as well as raise public environmental awareness.

  • Wang Yizhou
  • Zhou Chen
  • Zhang Lidan

Bike Sharing

Pronto Cycle Share, branded as Pronto!, was a public bicycle sharing system in Seattle, Washington, that operated from 2014 to 2017. The system, owned initially by a non-profit and later by the Seattle Department of Transportation, included 58 stations in the city's central neighbourhoods and above 500 bicycles.

Bike-sharing is a short distance transportation for people to make their life more convenient. When people use shared-bike, they can borrow and return bikes at any stations in the service station. Some stations have too many incoming bike and get jammed without enough docks for upcoming bikes, while some other stations get empty quickly and lack enough bikes for people to check out.

Which station has the most passenger flow?

In our project, we calculate the in degree an out degree for each station, to help user to understand how passenger usually use bike sharing service through each station point. We also divided time range into different periods, the data users can see much more details in yearly, monthly, even hourly. So that they can understand better the bike usage pattern.

How to re-distribute bike at a lower cost?

Because passenger will take the bike from one station to another station everyday, Company should ask employees to re-distribute bike among existent stations. In our project, we use the real map, cooperated with the degree data calculated before, to visualize a shortest path to help employees re-distribute bike in a more efficient way.

  • Zhang Peng
  • Wang Shang
Group 8: Time Series Explorer

Group8ProjectBanner.png

Time-series Explorer: Building interactive data visualisation for time series analysis

Time-series analysis is a time and effort consuming endeavour. As budding data analysts, we spent considerable resources in experimenting with many variations of parameter configurations to analyse time-series data. This difficulty stems from the lack of automatic tools that can help calculate the optimized time-series parameters during model training. To tackle this challenge, we created an easy-to-use time-series exploration system that is accessible even to the uninitiated analyst. The system is able to decompose the time series data to its constituent parts, namely Seasonality, Trend and Random (Noise). It can generate several forecasting models, using Exponential Smoothing and ARIMA analysis techniques, to predict future time periods using optimization techniques. The system also allows other forms of time series data to be displayed and their forecasts compared using the given forecasting methods, within certain formats. To test the system capabilities, we adopted the Singapore Consumer Price Index (CPI) as our use case. The CPI, with its short-term forecasts, is often used for tuning Governmental policies to steer inflation rates in countries like Singapore and for foreign investors to consider allocating potential investment funds into the country.

  • Fam Guo Teng
  • Wang Yuchen
  • Xu Yanru
Group 10:China Property Trend

Geocluster.jpeg

China property analysis

The real-estate market is ever growing and has more stakeholders. We are here to build an app that makes an analysis of the housing prices market an easy and effective one by just a few clicks and hovering around. This way allowing the major stakeholders perform their analysis and plan their decisions more efficiently.

We have used various packages such as'Recharts','Timekt','Sweep','ggplot2' that allowed users to model and visualize the housing prices indexes in different ways for different purposes.

Time Series Analysis-The application will allow the user to choose the City they are interested in and the time period they want to look at. The trend of the prices during that period will be provided.This is built for analysts and agents and government officials who would like to know on the performance at a certain period of time and also a comparative study between different cities. This way they can find any outliers or a particular pattern in the indices. The time series is related to the economic policy and the effect is stressed based on the chosen policy time period.

Cluster Analysis – We further develop some clusters of the cities based on their housing index reaction. This way we can group the cities whose housing market behave/ respond to the market in a similar way. The government officials and the local agents understand the markets better and plan their policies better. A waiver or cluster development centric policy can be made by the government.

Forcast Analysis: Forecast analysis is done using Geofacet that we can compare the forecasted prices between the different region of the country. Geofaceting arranges a sequence of plots of data for different geographical entities into a grid that strives to preserve some of the original geographical orientation of the entities.

This app can be applied to any other economic variable in China. This will be greatly helpful for economists, agents and government officials to look into the specific data and make some judgments and decisions based on it.

  • Aishwarya Mohan
  • Deng Chunling
  • Ma Xiaoliu
Group 11: CrimeModeler: A Visually-Driven Geospatial Modelling Tool for Crime Applications

Police.jpeg

CrimeModeler: A Visually-Driven Geospatial Modelling Tool for Crime Applications

Based on UN’s Survey of Crime Trends published in 2006, England and Wales have one of the highest crime rates among OECD countries. We have developed CrimeModeler, a geospatially modelling tool to investigate the spatial variation of crime across different districts in England and Wales, and the relationship between crime and socio-economic characteristics for each district. As it is common for neighbouring regions to have correlation in their crime rate, we compare the use of geographically weighted regression (GWR) and conventional (or global) multiple regression model to see whether a better result can be obtained from GWR. We will also investigate whether there are certain variables that have an impact on crime rate in one area but not in another. Local governments may use this information to come up with better policies to tackle crime.


  • Raymond FOO Celong
  • Anthony GOH Jun Jie
  • Karan Jyoti KHANNA

Cross Shareholding

Cross shareholding is a situation in which a corporation owns stock in another company. So, technically, corporations own securities issued by other corporations. Cross shareholding can lead to double counting, whereby the equity of each company is counted twice when determining value. When double counting occurs, the security's value is counted twice, which can result in estimating the wrong value of the two companies.

Cross shareholding is very common in corporate world. Sometimes, there can be more than 10 companies involved and it is very difficult for investors and regulators to track who owns how much.

In this project, our group choose 1 or 2 big groups of companies from Korea and China with heavy cross shareholding between each other and conduct visualization and relationship analysis on their networks using R-Shiny so that people can have better picture of these companies’ network and easier to understand relationship between companies.

  • KYONG HWAN KIM
  • GONGQIANG
  • HE ZIWEN