Project Groups
|
|
|
|
|
|
Project Groups
Please provide project description the project title and an abstract of your project. The abstract should not be more than 350 words. You are also required to include project blog link and the names of team member.
Project Team | Project Title/Description | Project Web Blog | Project Member |
---|---|---|---|
|
Understanding Airbnb listings in Australia The abundance of Airbnb data provides great opportunity to conduct a variety of data analyses to understand the residential short-lease rental market. The dataset that has be scrapped on the Airbnb web and made publicly available by Inside Airbnb provides geospatial, textual, and quantitative data on each of the listings listed on the web. This project provides an analytics platform for interested parties (especially non-data specialists) to conduct exploratory spatial data, text, cluster, and regression analysis on the Australia Airbnb dataset using simple and user-friendly interactive dashboards that does not require programming knowledge. |
| |
|
Combatting Greenhouse Gas Emissions through Exploratory and Panel Data Analysis Global warming is expected to result in a rise of the average global temperature between 1.1 to 6.4 degree celsius over the century, if there are no interventions taken to reduce emissions of greenhouse gases. With this impending global situation, the European Union (EU) leaders committed to an ambitious goal of reducing greenhouse gases by 55% by 2030 to tackle climate change. The availability of a broad range of climate change related statistics on Eurostat allowed our group to investigate the impacts of the drivers and mitigation measures on the greenhouse gas emissions for the EU countries. Exploratory analysis will used to understand the current situation, and Panel data analysis will be performed to glean insights on the determinants of the greenhouse gas emissions, allowing us to monitor id EU's progress towards achieving its 2030 goal. |
| |
|
Project Title and Abstract Understanding Key Stories Covered In the Media and How Readers Engaged With News As we become more and more inundated with news from various digital sources today, understanding what the key stories are across the digital spectrum is becoming more and more challenging. As such, we are interested in understanding how to best present a visual snapshot of the key stories that are covered in local media and identifying how readers engaged with the news. |
Project Blog Link |
|
|
Visualisation and Analysis of Patient Psychosocial Acuity (VAPPA) The Community Care Team (CCT) of the Singapore General Hospital aims to facilitate person-centered care in the community and to enable patients to remain in the community as long as possible. This is achieved through collaboration with community partners to meet patient psychosocial needs with health and social issues. The CCT collects data on the patient socio-demographics, location and psychosocial acuity to understand the psychosocial needs of patients in the community, and to devise targeted intervention strategies to address them. Our project aims to develop an interactive application to enhance the visualisation and analysis of the data collected by the CCT. |
||
|
Predicting whether an individual would go for the H1N1 vaccine Vaccination is a crucial public health measure to flatten the curve in a pandemic. By looking at a dataset that contains the personal demographics and attitudes of respondents in the USA towards H1N1 vaccination, we hope to predict whether an individual would go for the vaccine. |
| |
|
Our Shiny PET: A Predictive, Exploratory and Text Application for Airbnb Data The increasing availability of data has resulted in the increased demand for data driven decisions. Although there is an extensive range of commercial statistical tools, they are often subscription-based and demand good technical knowledge to mine and draw insights from. Therefore, it may not appeal to the average user. As such, our project aims to develop a user-friendly application that will enable users to make data-driven decisions without the need to understand programming languages or have extensive statistical knowledge. We will use Airbnb data as our baseline for this project - data generated is rich in information, which consists of structured, unstructured (textual), and location data. With this application, users will be able to perform text analysis on review and listing data to generate more quantitative insights. The exploratory module allows users to identify interesting patterns based on selected variables. Findings from the exploratory module will be further augmented in the confirmatory module where selection of statistical methods will be guided based on user’s chosen variables. Finally, the predictive module enables users to prepare and build a variety of prediction models without needing to have in-depth understanding of the predictive models and its algorithms. |
||
|
Happiness Amidst Covid-19 Covid-19’s impacts on workers and workplaces across the globe has been dramatic. Due to the uncertainty and isolation of lockdowns, people have begun to recalibrate what is important to them and what it means to be happy as their understanding of happiness changed. In this project, we analyse and identify patterns regarding the Happiness Score comparing pre-and post Covid-19 pandemic in different countries based on the World Happiness Report 2020 and specifically to better understand the impact in Singapore. Our approach includes developing a R-Shiny application for an interactive (1) Exploratory Data Analysis which includes a Choropleth Map, Bubble Plot, Visualising Uncertainty, and Time-series Analysis, and (2) Statistical Analysis which includes Correlation Analysis, Multiple Linear Regression Model, and Hierarchical Clustering. |
||
|
Project Title and Abstract |
Project Blog Link |
|
|
Enabling optimization of bike-sharing operations – Bluebikes The advent of shared bikes has provided people with a new way of commuting, and has picked up rapidly due to its convenience and low cost. However, there are still some problems at the current stage, such as an over-accumulation of bikes at certain areas leading to inconveniences to the public. On the flip side, there could be insufficient supply of bikes at selected stations during peak periods leading to potential users choosing an alternate form of transport. There is also the issue of overused bikes lacking maintenance/servicing at the right time intervals. There is currently no platform that provides an integrated analytics capability to perform exploratory analysis of the trip data and gather insights to improve the operations. This is the gap that our team is intrigued to close. We would like to design an interactive application that will help the executives of Blue Bikes to analyze and visualize users’ trip data. This application would serve as the go-to analytics platform for gathering insights on the bike sharing operations and facilitate decision making on improvement ideas. The objective of this project is to create an app using R-Shiny that will enable Bluebikes to focus on the operational optimization of their bike fleet supply at each of the stations via: • Exploratory and Confirmatory interface to analyze bike trip duration and intensity of bike station activity. • Analyze the deficit or excess of bikes that are moving in and out of the numerous bike stations. • Optimize the utilization rate of their entire bike fleet. • Track and determine the right time to perform servicing and maintenance on the bike fleet.
|
| |
|
Understanding Prime Mover (PM) Waiting Time in Yard The study objective is to seek insight from Prime Mover (PM) Operations from port operations to identify common characteristics exhibited by PM with high and low waiting times, through understanding of PM events and operational data. This, in turn, enables us to pinpoint and identify correlated attributes and embark on further study to improve the overall productivity of PM operations and resource utilisation through active targeting of activities contributing to the PM waiting time. |
| |
|
The Prime Crime Area Spatio-Temporal Analysis With the limited police resources and possible adverse impact when crime occurs, analytics on crime has been done as far back as in the 1800s (Hunt, 2019). Crime occurrence was found to have spatial patterns, and thus predictive analytics should be possible. However, mixed results were obtained in the research to determine whether predictive policing results to lower crime rates (Meijer & Wessels, 2019). Thus, it is more beneficial to use analytics to determine areas with a higher risk of crime and to discover the underlying factors to the increased risk. Traditionally, crime analysis is done manually or through a spreadsheet program (RAND Corporation, 2013). Using demographic, socio-economic and crime rate data of the Greater London Region, retrieved from the London Datastore, this project would give the users an easier way to do the crime analysis using a web application. In this project, 3 key analysis will be performed:
|
| |
|
Investing 101: A visual and predictive guide for the rookie investor Existing financial data websites such as Yahoo Finance do a good job in providing historical price data and technical indicators, but the beginner investor lacks the knowledge to properly utilise and benefit from these. In addition, we have also identified several gaps in such websites. For one, these websites do not provide tools to allow the user to compare stocks meaningfully or zoom in to the statistical properties of financial returns. For example, a user is unable to conduct correlation analysis or visualize the distribution of returns. Secondly, these websites also do not provide any form of forecasting to aid in investors’ decisions. This project aims to improve on the current offering of financial data websites by including the following key modules:
|
| |
|
The Impact of Lifestyle and Family Background on Grades of High School Students In the past many years, there has been an emphasis on education around the world because of the impact it a person, be it in terms of employment opportunities and quality of life. It is hence important to know what are factors that affect one’s academic performance. While there are many factors that can impact a person’s academic performance, family background and one’s lifestyle are two of the larger factors. Since there are many sub-factors in family background and lifestyle choices, the motivation of this study is to look deeper at these sub-factors to see which are the factors that have a greater correlation in the impact on a student’s grades. More specifically, this study aims to study the correlation between each factor and a student’s grades, as well as aiming to build a model that can accurately determine the academic performance of a student. From the findings, targeted help may be administered to students in these specific areas attributing to poor grades in school, therein helping them have a higher chance of a better future. |
Project Blog Link |
|
|
Project Title and Abstract |
Project Blog Link |
|
|
Project Title and Abstract |
Project Blog Link |
|
|
Project Title and Abstract |
Project Blog Link |
|