Difference between revisions of "ANLY482 Team wiki: 2015T2 TeamROLL Project Overview"

From Analytics Practicum
Jump to navigation Jump to search
 
(12 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
<!--Logo-->
 +
<div style="padding-bottom:25px;"> [[File:T(eam)ROLL.png|350px|center]] </div>
 +
 
<!--Header Start-->
 
<!--Header Start-->
 
{|style="background-color:#F5F5F5; color:#ffffff; padding: 10 0 10 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
 
{|style="background-color:#F5F5F5; color:#ffffff; padding: 10 0 10 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
 
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #F5F5F5; text-align:center; color:#000" width="10%" |
 
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #F5F5F5; text-align:center; color:#000" width="10%" |
 
   
 
   
<!-- [[Image:teamroll_home.png|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL]] &nbsp; -->
+
[[Image:teamroll_home.png|40px|link=ANLY482 Team wiki: 2015T2 TeamROLL]] &nbsp;  
 
[[ANLY482 Team wiki: 2015T2 TeamROLL |<font color="#000000" size=2><b>HOME</b></font>]]
 
[[ANLY482 Team wiki: 2015T2 TeamROLL |<font color="#000000" size=2><b>HOME</b></font>]]
  
 
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;  
 
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;  
 
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #3D9DD7; text-align:center; color:#000" width="10%" |  
 
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #3D9DD7; text-align:center; color:#000" width="10%" |  
<!-- [[Image:teamroll_dance.gif|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL About Us]] &nbsp; -->
+
[[Image:teamroll.png|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL About Us]] &nbsp;
 
[[ANLY482 Team wiki: 2015T2 TeamROLL About Us|<font color="#000000" size=2><b>ABOUT US</b></font>]]
 
[[ANLY482 Team wiki: 2015T2 TeamROLL About Us|<font color="#000000" size=2><b>ABOUT US</b></font>]]
  
 
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;
 
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;
 
| style="padding:0.3em; font-size:100%; background-color:#E0E0E0;  border-bottom:0px solid #3D9DD7; text-align:center; color:#000" width="12%" |  
 
| style="padding:0.3em; font-size:100%; background-color:#E0E0E0;  border-bottom:0px solid #3D9DD7; text-align:center; color:#000" width="12%" |  
<!-- [[Image:teamroll_overview.png|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL Project Overview]] &nbsp; -->
+
[[Image:teamroll_this.png|40px|link=ANLY482 Team wiki: 2015T2 TeamROLL Project Overview]] &nbsp;  
 
[[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview |<font color="#0091EA" size=3><b>PROJECT OVERVIEW</b></font>]]
 
[[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview |<font color="#0091EA" size=3><b>PROJECT OVERVIEW</b></font>]]
  
 
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;
 
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;
 
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #3D9DD7; text-align:center; color:#000" width="12%" |  
 
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #3D9DD7; text-align:center; color:#000" width="12%" |  
<!-- [[Image:teamroll_mgmt.png|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL Project Management]] &nbsp; -->
+
[[Image:teamroll_analysis.png|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL Data Analysis]] &nbsp;
 +
[[ANLY482 Team wiki: 2015T2 TeamROLL Data Analysis |<font color="#000000" size=2><b>DATA ANALYSIS</b></font>]]
 +
 
 +
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;
 +
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #3D9DD7; text-align:center; color:#000" width="12%" |
 +
[[Image:teamroll_mgmt.png|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL Project Management]] &nbsp;
 
[[ANLY482 Team wiki: 2015T2 TeamROLL Project Management |<font color="#000000" size=2><b>PROJECT MANAGEMENT</b></font>]]
 
[[ANLY482 Team wiki: 2015T2 TeamROLL Project Management |<font color="#000000" size=2><b>PROJECT MANAGEMENT</b></font>]]
  
 
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;
 
| style="border-bottom:0px solid #3D9DD7; background:none;" width="1%" | &nbsp;
 
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #3D9DD7; text-align:center; color:#F5F5F5" width="10%" |  
 
| style="padding:0.3em; font-size:100%; background-color:#F5F5F5;  border-bottom:0px solid #3D9DD7; text-align:center; color:#F5F5F5" width="10%" |  
<!-- [[Image:teamroll_doc.png|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL Documentation]] &nbsp; -->
+
[[Image:teamroll_doc.png|30px|link=ANLY482 Team wiki: 2015T2 TeamROLL Documentation]] &nbsp;
 
[[ANLY482 Team wiki: 2015T2 TeamROLL Documentation | <font color="#000000" size=2><b>DOCUMENTATION</b></font>]]
 
[[ANLY482 Team wiki: 2015T2 TeamROLL Documentation | <font color="#000000" size=2><b>DOCUMENTATION</b></font>]]
 
|}  
 
|}  
 +
 
<!--Header End-->
 
<!--Header End-->
 
<!--Sub-Navigation-->
 
<!--Sub-Navigation-->
 
{| style="background-color:white; color:white ; border:0px solid #4690cd; margin-left: auto; margin-right: auto;" width="1080px" height=50px cellspacing="0" cellpadding="0" valign="top"  |
 
{| style="background-color:white; color:white ; border:0px solid #4690cd; margin-left: auto; margin-right: auto;" width="1080px" height=50px cellspacing="0" cellpadding="0" valign="top"  |
  
| style="padding:0 .3em;  solid #000000;  padding: 10px; text-align:center; background-color:#daeeff; border-right:0px solid #4690cd; " width="33%" | [[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview| <font face = "Arial" color="#3C2415"><b>Proposal </b></font>]]
+
| style="padding:0 .3em;  solid #000000;  padding: 10px; text-align:center; background-color:#daeeff; border-right:0px solid #4690cd; " width="33%" | [[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview| <font face = "Arial" color="#3C2415"><b>Description</b></font>]]
 
 
| style="padding:0 .3em;  solid #000000; text-align:center; background-color:white; border-right:0px solid #4690cd; " width="34%" | [[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview/Midterm| <font face = "Arial" color="#101010"><b>Midterm</b></font>]]
 
  
| style="padding:0 .3em;  solid #000000; padding: 10px; text-align:center; background-color:white;" width="33%" | [[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview/Final| <font face = "Arial" color="#101010"><b>Final</b></font>]]
+
| style="padding:0 .3em;  solid #000000; padding: 10px; text-align:center; background-color:white; border-right:0px solid #4690cd; " width="33%" | [[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview/Description| <font face = "Arial" color="#3C2415"><b>Methodology</b></font>]]
  
|
+
| style="padding:0 .3em;  solid #000000;  padding: 10px; text-align:center; background-color:white; border-right:0px solid #4690cd; " width="33%" | [[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview/Technology| <font face = "Arial" color="#3C2415"><b>Technology </b></font>]]
  
 +
<!--| style="padding:0 .3em;  solid #000000;  padding: 10px; text-align:center; background-color:white; border-right:0px solid #4690cd; " width="33%" | [[ANLY482 Team wiki: 2015T2 TeamROLL Project Overview/Limitations| <font face = "Arial" color="#3C2415"><b>Limitations</b></font>]] -->
 
|}
 
|}
  
 
<!--END OF Sub-Navigation-->
 
<!--END OF Sub-Navigation-->
 
 
<br>
 
<br>
 
<div align="left">
 
<div align="left">
<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px"><font color=#000000></font><font color= #FFFFFF> Overview </font></div>
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
 
<div style="text-align: justify;"> SGAG, one of Singapore's leading local humour content creators, maintains popular social media sites, an online website and a mobile application. Creating creative content on a daily basis, SGAG has garnered a significant number of followers on the various platforms. With an aim to achieve growth, SGAG hopes to leverage on their rich pool of data and derive valuable insights towards content creation.
 
 
However with limited resources, SGAG could not conduct a comprehensive analysis and harness on the big data available to them. This project aims to uncover valuable insights on SGAG’s content attributes in order to achieve audience growth. Using data gathered from SGAG’s facebook page for the year 2015, the team hopes to firstly, conduct exploratory data analysis so as to identify overall performance trends. Next, the team will be performing cluster analysis followed by sentiment analysis, topic analysis and content analysis. Lastly, the team will be building a regression
 
model, which includes findings derived from the analysis conducted, in order to predict better performing future posts. With the insights gained, the team will be providing recommendations to enable data driven content creation, thus allowing SGAG to achieve their aim of greater growth.
 
</div>
 
</div>
 
 
 
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">About SGAG</font></div>==
 
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">About SGAG</font></div>==
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
Line 60: Line 58:
 
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">Project Motivation</font></div>==
 
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">Project Motivation</font></div>==
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
<div style="text-align: justify;"> SGAG’s motto is “to make readers laugh at least 5 times a day, 365 days a year”. As such, SGAG places emphasis on the quality of their 5 daily posts, to ensure that readers will find their posts humorous with a local twist, a good-natured piece of fun without any intention to hurt. Much of their content focuses on a funny stereotype of everyday Singaporean life which locals are able to identify with. Although SGAG has been very successful thus far, it also recognises that the online content space is very competitive, with newer players such as "SMRT Feedback" joining in the fray to generate local humour content. As such, SGAG needs to evaluate and improve their content strategy to ensure they stay entertaining and engaging to their audience. However, SGAG faces a few challenges in understanding and thus, leveraging on, their past success factors, which may be summarised as follows:
+
<div style="text-align: justify;"> SGAG’s motto is “to make readers laugh at least 5 times a day, 365 days a year”. As such, SGAG places emphasis on the quality of their 5 daily posts, to ensure that readers will find their posts humorous with a local twist, a good-natured piece of fun without any intention to hurt. Much of their content focuses on a funny stereotype of everyday Singaporean life which locals are able to identify with. <br>
# What are the characteristics of a “great” post? SGAG has so far thrived on an intuitive understanding of their customer's content preferences. However, SGAG does not have a concrete or clear picture of the kinds of attributes which they can work on to make a specific post a "great" one.
+
Although SGAG has been very successful thus far, it also recognises that the online content space is very competitive, with newer players such as "SMRT Feedback" joining in the fray to generate local humour content. As such, SGAG needs to evaluate and improve their content strategy to ensure they stay entertaining and engaging to their audience.  
# What is audience sentiment on "viral" posts? Are they reacting in a positive or negative manner? SGAG is concerned that "viral" posts become popular because they receive a lot of "hate", which goes against their content philosophy which is to make people "laugh", a positive emotion. Currently, they do not have easy visibility on this aspect.
+
 
SGAG hopes this project will be able to utilise a rich pool of historical data to derive insights into the concerns posed above, so that SGAG would be better able to formulate a more relevant content creation strategy.
 
  
 
</div>
 
</div>
 
</div>
 
</div>
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">Project Objective</font></div>==
+
 
 +
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">Business Questions</font></div>==
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
 
<div style="text-align: justify;">
 
<div style="text-align: justify;">
The final goal of this project is to offer useful insights for SGAG to formulate a better content creation strategy moving forward. To measure the effectiveness of their content strategy, and at a more granular level, the effectiveness of each individual post, SGAG operationalises effectiveness as "growth" which is defined by an increase in 1) Number of fans, 2) Audience reach, and 3) Engagement with audience members. This last indicator is further measured by the number of times audience members perform actions such as “likes”, “comments”, “shares”, “retweets” or clicking on links to find out more about the content SGAG has to offer.
+
In a nutshell, SGAG's business questions are: <br>
To do so, we attempt to answer the two main challenges posed by SGAG in a concrete, data-driven manner by performing an in-depth analysis on SGAG's historical data. More specifically, we attempt to address the following analysis requirements:
+
# How can we drive audience growth through our creative content? <br>
# To be able to understand whether a post is popular in a “positive” or “negative” manner
+
# What are the characteristics of a “great” post which appeals to our audience? <br>
# To assess the role of content layout and design in improving popularity of posts.  
+
SGAG has so far thrived on an intuitive understanding of their customer's content preferences. However, they do not have a concrete or clear picture of the kinds of attributes which they can work on to make a specific post a "great" one.  
# To develop a list of common topics and be able to understand the role of topic-selection in affecting the popularity of posts
+
They hope that this project will utilise their rich pool of historical data to derive actionable insights into the concerns above, and offer recommendations how SGAG can improve their content creation strategy.
  
</div>
 
</div>
 
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">Data collection and description</font></div>==
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
 
<div style="text-align: justify;">
 
Our two main datasets are: Facebook Insights Data Export - SGAG - Page Level, and Facebook Insights Data Export - SGAG - Post Level. The datasets are sponsored by SGAG and extracted from the Facebook Insights tool. A year's worth of data from 2015 was extracted. Although SGAG also obtained similar data for the same time period from Twitter through Twitter Analytics, this would not be the focus of our project for the present time.
 
===<div style="background: white; font-weight: bold; font-size:20px"><font color="#0D47A1">Facebook Insights Data Export - SGAG - Page Level</font></div>===
 
This dataset captures key performance indicators of SGAG at the page level. These include variables such as lifetime total likes, new likes, unlikes, number of engaged users, reach, organic reach, number of clicks on content, and number of negative feedback, on the daily level, or aggregated to form weekly and 28 days measures. This dataset also captures information regarding the demographics of SGAG's customers, their ages and gender, as well as their location in terms of countries and cities.
 
===<div style="background: white; font-weight: bold; font-size:20px"><font color="#0D47A1">Facebook Insights Data Export - SGAG - Post Level</font></div>===
 
This dataset similarly captures key metrics of SGAG, but at the post level. Many variables found in the earlier dataset are also reflected in this dataset, but at the post level. We propose that this dataset be our main point of analysis for this project, with the earlier dataset utilised as a supporting analysis.
 
 
</div>
 
</div>
 
</div>
 
</div>
  
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">Work Scope</font></div>==
+
==<div style="background: #2196F3; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px; font-size:24px; border-left: #0D47A1 solid 32px;"><font color="white">Analytical Objectives</font></div>==
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
 
<div style="border-left: #E1F5FE solid 12px; padding: 0px 30px 0px 18px; ">
 
<div style="text-align: justify;">
 
<div style="text-align: justify;">
Our proposed work scope will focus on the main content distribution channel SGAG currently uses, which is Facebook. This would be where SGAG garners the most reach and engagement from their target audience. We will also be conducting our analysis based on historical Facebook data for the year 2015, which is suitable due to it being relatively recent.
+
Taking into account the analytical problems faced, we aim to: <br>
A step-by-step breakdown of our proposed scope of analysis is as follows:
+
<b> Page-level Data </b><br>
# Data Collection – Collect Facebook data for the year 2015 to be analysed, from SGAG
+
(i) Identify the trend in lifetime total likes <br>
# Data Preparation – Clean and transform  data into a readable CSV for upload
+
* Based on the net change in count <br>
# Exploratory Data Analysis - Identify overall performance trends
+
(ii) Identify the better performing posts <br>
# Cluster Analysis – Perform segmentation of Facebook posts based on their performance in terms of total reach and engagement level (likes, shares, comments)
+
* Based on daily total reach and number of daily engaged users<br>
# Sentiment Analysis – Identify differing sentiments based on posts and clusters
+
(iii) Identify the composition in lifetime total likes <br>
# Topic Analysis - Generate and identify topics based on posts and clusters
+
* Based on gender and age group <br>
# Content Analysis - Identify key design attributes based on posts and clusters
+
<b> Post-level Data </b><br>
# Regression Modelling – Build a regression model that includes success factors derived from analysis, to aid in predicting better performing future posts
+
(i) Identify the better performing posts and weak performing posts<br>
 
+
* Based on clustering engagement level (number of negative feedback against number of likes) <br>
</div>
+
(ii) Identify the time-series patterns (time of day) for post reach<br>
 +
* Based on hour of the day and number of post reach <br>
 +
(iii) Determine relationship between engagement level and the characteristics of a picture post
 +
* Based on the relationship between the number of likes and picture design
 +
** Picture design is based on the number of description lines, characters used (e.g. Animals, Foreign celebrities) and the number of frames
 +
(iv) Determine relationship between engagement level and the topics discussed <br>
 +
* Based on post topics and number of likes
 
</div>
 
</div>

Latest revision as of 23:09, 17 April 2016

T(eam)ROLL.png

Teamroll home.png   HOME

 

Teamroll.png   ABOUT US

 

Teamroll this.png   PROJECT OVERVIEW

 

Teamroll analysis.png   DATA ANALYSIS

 

Teamroll mgmt.png   PROJECT MANAGEMENT

 

Teamroll doc.png   DOCUMENTATION

Description Methodology Technology


About SGAG

SGAG is one of Singapore's leading local humour content creators. Distributing their creative content through multiple platforms, including popular social media sites, an online website, and a mobile app, the team at SGAG creates quality content daily to engage and entertain Singaporeans. With the goal to make "every Singaporean's day a better one", the company was founded in 2012 by two Singapore Management University undergraduates. As of today, SGAG has since gained a loyal following and have reached out to more than 300 000 Facebook users and 120 000 Twitter users, in addition to at least 200 000 users through other social media platforms, mobile apps and websites. Looking forward, SGAG aims to achieve greater growth and reach among their customers, especially for their target customers of Singaporean youths, working adults and young families between the ages of 18 to 34 years old.

Project Motivation

SGAG’s motto is “to make readers laugh at least 5 times a day, 365 days a year”. As such, SGAG places emphasis on the quality of their 5 daily posts, to ensure that readers will find their posts humorous with a local twist, a good-natured piece of fun without any intention to hurt. Much of their content focuses on a funny stereotype of everyday Singaporean life which locals are able to identify with.

Although SGAG has been very successful thus far, it also recognises that the online content space is very competitive, with newer players such as "SMRT Feedback" joining in the fray to generate local humour content. As such, SGAG needs to evaluate and improve their content strategy to ensure they stay entertaining and engaging to their audience.


Business Questions

In a nutshell, SGAG's business questions are:

  1. How can we drive audience growth through our creative content?
  2. What are the characteristics of a “great” post which appeals to our audience?

SGAG has so far thrived on an intuitive understanding of their customer's content preferences. However, they do not have a concrete or clear picture of the kinds of attributes which they can work on to make a specific post a "great" one. They hope that this project will utilise their rich pool of historical data to derive actionable insights into the concerns above, and offer recommendations how SGAG can improve their content creation strategy.

Analytical Objectives

Taking into account the analytical problems faced, we aim to:
Page-level Data
(i) Identify the trend in lifetime total likes

  • Based on the net change in count

(ii) Identify the better performing posts

  • Based on daily total reach and number of daily engaged users

(iii) Identify the composition in lifetime total likes

  • Based on gender and age group

Post-level Data
(i) Identify the better performing posts and weak performing posts

  • Based on clustering engagement level (number of negative feedback against number of likes)

(ii) Identify the time-series patterns (time of day) for post reach

  • Based on hour of the day and number of post reach

(iii) Determine relationship between engagement level and the characteristics of a picture post

  • Based on the relationship between the number of likes and picture design
    • Picture design is based on the number of description lines, characters used (e.g. Animals, Foreign celebrities) and the number of frames

(iv) Determine relationship between engagement level and the topics discussed

  • Based on post topics and number of likes