Difference between revisions of "ANLY482 AY2017-18T2 Group30 Project Overview"

From Analytics Practicum
Jump to navigation Jump to search
 
(9 intermediate revisions by 2 users not shown)
Line 9: Line 9:
 
{|style="background-color:#5A6B96; color:#5A6B96; width="100%" cellspacing="0" cellpadding="10" border="0" |
 
{|style="background-color:#5A6B96; color:#5A6B96; width="100%" cellspacing="0" cellpadding="10" border="0" |
  
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="10%" | [[ANLY482_AY2017-18_T2_Group_30|<font color="#FFFFFF" size=3><b>HOME</b></font>]]
+
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="12.5%" | [[ANLY482_AY2017-18_T2_Group_30|<font color="#FFFFFF" size=3><b>HOME</b></font>]]
  
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="13%" | [[ANLY482_AY2017-18T2_Group30 About Us |<font color="#FFFFFF" size=3><b>ABOUT US</b></font>]]
+
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="12.5%" | [[ANLY482_AY2017-18T2_Group30 About Us |<font color="#FFFFFF" size=3><b>ABOUT US</b></font>]]
  
|style="font-size:88%; border-left:1px solid #347cc4; border-right:1px solid #347cc4; text-align:center; border-bottom:1px solid #347cc4; border-top:1px solid #347cc4;" width="20%" |[[ANLY482_AY2017-18T2_Group30 Project Overview |<font color="#347cc4" size=3><b>PROJECT OVERVIEW</b></font>]]
+
|style="font-size:88%; border-left:1px solid #347cc4; border-right:1px solid #347cc4; text-align:center; border-bottom:1px solid #347cc4; border-top:1px solid #347cc4;" width="12.5%" |[[ANLY482_AY2017-18T2_Group30 Project Overview |<font color="#347cc4" size=3><b>PROJECT OVERVIEW</b></font>]]
  
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="18%" | [[ANLY482_AY2017-18T2_Group30 Data Analysis |<font color="#FFFFFF" size=3><b>PROJECT FINDINGS </b></font>]]
+
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="12.5%" | [[ANLY482_AY2017-18T2_Group30 Data Analysis |<font color="#FFFFFF" size=3><b>EDA </b></font>]]
  
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="22%" | [[ANLY482_AY2017-18T2_Group30 Project Management |<font color="#FFFFFF" size=3><b>PROJECT MANAGEMENT</b></font>]]
+
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="12.5%" | [[ANLY482_AY2017-18T2_Group30 Business Objectives |<font color="#FFFFFF" size=3><b>BUSINESS OBJECTIVES </b></font>]]
  
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="17%" |[[ANLY482_AY2017-18T2_Group30 Documentation | <font color="#FFFFFF" size=3><b>DOCUMENTATION</b></font>]]
+
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="12.5%" | [[ANLY482_AY2017-18T2_Group30 Project Management |<font color="#FFFFFF" size=3><b>PROJECT MANAGEMENT</b></font>]]
 +
 
 +
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="12.5%" |[[ANLY482_AY2017-18T2_Group30 Documentation | <font color="#FFFFFF" size=3><b>DOCUMENTATION</b></font>]]
 +
 
 +
|style="font-size:88%; border-left:1px solid #ffffff; border-right:1px solid #ffffff; text-align:center; background-color:#347cc4; " width="12.5%" |[[ANLY482_AY2017-18_Term_2 | <font color="#FFFFFF" size=3><b>MAIN PAGE</b></font>]]
 
|}  
 
|}  
 
</center>
 
</center>
Line 26: Line 30:
 
<br>
 
<br>
  
<div align="center">
+
 
<div style=" width: 85%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Project Overview</font></div>
+
==<div style=" width: 96.5%; background: #E6EDFA; padding: 12px;font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Project Overview</font></div>==
<div style="width:85%;"><font> </font></div>
+
<div style="width:96.5%;"><font> Our group aims to synthesize data from multiple platforms such as Facebook, Instagram, YouTube and blogs to deliver coherent and strategic insights for XYZ Web hosting company to improve their social media outreach as well as plan for content creation. </font></div>
 
<br/>
 
<br/>
  
<div align="center">
+
 
<div style=" width: 85%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Motivation</font></div>
+
==<div style=" width: 96.5%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Motivation</font></div>==
<div style="width:85%;">
+
<div style="width:96.5%;">
<font>Having published various content types over the years, TheSmartLocal knows what posts perform well and what doesn’t. However, monitoring content engagement over different platforms such as Facebook, Youtube, Instagram and their blogs can be highly complex due to the nature of the posts such as timing, content type, authors. As such, they would like to perform a holistic, cross-platform analysis to quantify and investigate the causes of virality.  
+
<font>Having published various content types over the years, XYZ company knows what posts perform well and what doesn’t. However, monitoring content engagement over different platforms such as Facebook, YouTube, Instagram and their blogs can be highly complex due to the nature of the posts such as timing, content type, authors.  
In addition, they learned that Facebook has recently changed ranking algorithms for posts in December 2017. Specifically, posts that encourage sharing and commenting to participate in competitions will be downgraded in rankings. TSL would like to investigate the impact of this change on their Facebook outreach, and if possible, provide recommendations.
+
 
 +
As such, they would like to perform a holistic, cross-platform analysis to quantify and investigate the causes of virality. In addition, they learned that Facebook has recently changed ranking algorithms for posts in December 2017. Specifically, posts that encourage sharing and commenting to participate in competitions will be downgraded in rankings. XYZ company would like to investigate the impact of this change on their Facebook outreach, and if possible, provide recommendations.
 
</font></div>
 
</font></div>
 
<br/>
 
<br/>
  
<div align="center">
+
 
<div style=" width: 85%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Objectives</font></div>
+
==<div style=" width: 96.5%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Objectives</font></div>==
<div style="width:85%;">
+
<div style="width:96.5%;">
<font>
+
<font style="text-align: left">
* Gather all available data from TSL across platforms to evaluate possibility of cross-platform social media engagement analysis
+
<b>Milestone 1 Objectives</b>
 +
* Gather all available data from XYZ Web hosting company across platforms to evaluate possibility of cross-platform social media engagement analysis
 
* Conduct preliminary analysis on all datasets to discover possible insights to deliver
 
* Conduct preliminary analysis on all datasets to discover possible insights to deliver
* Synthesize data from all platforms (Facebook, Blog, Instagram, Youtube)
+
* Synthesize data from all platforms (Facebook, Blog, Instagram, YouTube)
* Investigate impact of Facebook’s algorithm change on TSL’s post engagement rates
+
* Investigate impact of Facebook’s algorithm change on XYZ company’s post engagement rates
 
</font></div>
 
</font></div>
 
<br/>
 
<br/>
 +
<div style="width:96.5%;">
 +
<font style="text-align: left">
 +
<b>Milestone 2 Objectives</b>
 +
* Determine how the change in Facebook algorithm affect the videos / post engagement
 +
* Determine which are the types of facebook videos that has high drop out rates
 +
* Determine which are the youtube videos and series that are performing better
 +
* Determine the golden ratio that would best serve as a guideline that reflects organic engagement
 +
</font></div>
  
<div align="center">
+
==<div style=" width: 96.5%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Data</font></div>==
<div style=" width: 85%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Data</font></div>
+
<div style="width:96.5%;">
<div style="width:85%;">
 
 
<font>
 
<font>
TheSmartLocal will provide us with social media insights data from various platforms (Facebook, Instagram, Youtube, Blog) for the period of January 2017 - December 2017. As of the time of proposal submission, we have only received data for Facebook and Youtube.
+
XYZ will provide us with social media insights data from various platforms (Facebook, Instagram, YouTube, Blog) for the period of January 2017 to December 2017. As of the time of proposal submission, we have only received data for Facebook and YouTube.
 
 
 
</font></div>
 
</font></div>
 
<br/>
 
<br/>
  
<div align="center">
+
 
<div style=" width: 85%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Methodology</font></div>
+
==<div style=" width: 96.5%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Methodology</font></div>==
<div style="width:85%;">
+
<div style="width:96.5%;">
 
<font>
 
<font>
'''Data Collection'''<br>
+
{| class="wikitable" style="margin: 0 auto; margin-top: 10px;"
We will use the data provided to us by our project sponsor exported from FaceBook Insights, YouTube Analytics. The data comes in the format of Microsoft Excel.
+
|-style="background:#9E9E9E; color:white; text-align: center;"
<br>
+
! style="background:#5A6B96; color:white"|Step(s)
<br> 
+
! style="background:#5A6B96; color:white; width:15%"|Stage
'''Data Preparation'''<br>
+
! style="background:#5A6B96; color:white"|About
As the exported data comprises of multiple tabs and columns in multiple excel files, such as post level or video post, we will attempt to organize the data into consistent formats that are easier for our analysis especially when evaluating cross-platform social media engagement. We will also need to mask confidential information such as the company’s staff names as required by the project sponsor.  
+
|-
<br>
+
| 1
<br>
+
| Data Collection
'''Exploratory Data Analysis'''<br>
+
| We will use the data provided to us by our project sponsor exported from FaceBook Insights, YouTube Analytics. The data comes in the format of Microsoft Excel.
We will examine the entries of the various social media platform for the same campaign. From here, we will be able to discover possible insights to deliver to our sponsors, for example to find out which social media platform is most suitable for video or blog posts.  
+
|-
<br>
+
| 2
<br>
+
| Data Preparation
'''Data Cleaning'''<br>
+
| As the exported data comprises of multiple tabs and columns in multiple excel files, such as post level or video post, we will attempt to organize the data into consistent formats that are easier for our analysis especially when evaluating cross-platform social media engagement. We will also need to mask confidential information such as the company’s staff names as required by the project sponsor.  
To ensure accuracy of our model, we will identify missing values and outliers that are observed during the previous stage. We will go through these missing values separately and decide on how we should handle it (whether by replacing with the average value or simply remove the entire row). As for handling outliers, we will try attempt to analyze and come up with a reason for the outlier and see if it will affect our analysis.  
+
|-
<br>
+
| 3
<br>
+
| Exploratory Data Analysis
'''Data Normalisation and Transformation'''<br>
+
| We will examine the entries of the various social media platform for the same campaign. From here, we will be able to discover possible insights to deliver to our sponsors, for example to find out which social media platform is most suitable for video or blog posts.  
To better cater the data to our needs, we will perform data transformation to transform some of the columns into rows and transforming between categorical and numerical variables so that we can better analyze the data. If the values in certain attributes varies too much, we will normalize these attributes to ensure that the analysis will be accurate.
+
|-
<br>
+
| 4
<br>
+
| Data Cleaning
'''Data Modeling (Steps 6-8)'''<br>
+
| To ensure accuracy of our model, we will identify missing values and outliers that are observed during the previous stage. We will go through these missing values separately and decide on how we should handle it (whether by replacing with the average value or simply remove the entire row). As for handling outliers, we will try attempt to analyze and come up with a reason for the outlier and see if it will affect our analysis.  
We will develop an analytical software application if necessary, using software like SAS or JMP Pro.  
+
|-
 
+
| 5
 +
| Data Normalisation and Transformation
 +
| To better cater the data to our needs, we will perform data transformation to transform some of the columns into rows and transforming between categorical and numerical variables so that we can better analyze the data. If the values in certain attributes varies too much, we will normalize these attributes to ensure that the analysis will be accurate.
 +
|-
 +
| 6-8
 +
| Data Modelling
 +
| We will develop an analytical software application if necessary, using software like SAS or JMP Pro.  
 +
|}
 
</font></div>
 
</font></div>
 
<br/>
 
<br/>

Latest revision as of 12:56, 10 April 2018

APex Logo.PNG


HOME ABOUT US PROJECT OVERVIEW EDA BUSINESS OBJECTIVES PROJECT MANAGEMENT DOCUMENTATION MAIN PAGE



Project Overview

Our group aims to synthesize data from multiple platforms such as Facebook, Instagram, YouTube and blogs to deliver coherent and strategic insights for XYZ Web hosting company to improve their social media outreach as well as plan for content creation.



Motivation

Having published various content types over the years, XYZ company knows what posts perform well and what doesn’t. However, monitoring content engagement over different platforms such as Facebook, YouTube, Instagram and their blogs can be highly complex due to the nature of the posts such as timing, content type, authors.

As such, they would like to perform a holistic, cross-platform analysis to quantify and investigate the causes of virality. In addition, they learned that Facebook has recently changed ranking algorithms for posts in December 2017. Specifically, posts that encourage sharing and commenting to participate in competitions will be downgraded in rankings. XYZ company would like to investigate the impact of this change on their Facebook outreach, and if possible, provide recommendations.



Objectives

Milestone 1 Objectives

  • Gather all available data from XYZ Web hosting company across platforms to evaluate possibility of cross-platform social media engagement analysis
  • Conduct preliminary analysis on all datasets to discover possible insights to deliver
  • Synthesize data from all platforms (Facebook, Blog, Instagram, YouTube)
  • Investigate impact of Facebook’s algorithm change on XYZ company’s post engagement rates


Milestone 2 Objectives

  • Determine how the change in Facebook algorithm affect the videos / post engagement
  • Determine which are the types of facebook videos that has high drop out rates
  • Determine which are the youtube videos and series that are performing better
  • Determine the golden ratio that would best serve as a guideline that reflects organic engagement

Data

XYZ will provide us with social media insights data from various platforms (Facebook, Instagram, YouTube, Blog) for the period of January 2017 to December 2017. As of the time of proposal submission, we have only received data for Facebook and YouTube.



Methodology

Step(s) Stage About
1 Data Collection We will use the data provided to us by our project sponsor exported from FaceBook Insights, YouTube Analytics. The data comes in the format of Microsoft Excel.
2 Data Preparation As the exported data comprises of multiple tabs and columns in multiple excel files, such as post level or video post, we will attempt to organize the data into consistent formats that are easier for our analysis especially when evaluating cross-platform social media engagement. We will also need to mask confidential information such as the company’s staff names as required by the project sponsor.
3 Exploratory Data Analysis We will examine the entries of the various social media platform for the same campaign. From here, we will be able to discover possible insights to deliver to our sponsors, for example to find out which social media platform is most suitable for video or blog posts.
4 Data Cleaning To ensure accuracy of our model, we will identify missing values and outliers that are observed during the previous stage. We will go through these missing values separately and decide on how we should handle it (whether by replacing with the average value or simply remove the entire row). As for handling outliers, we will try attempt to analyze and come up with a reason for the outlier and see if it will affect our analysis.
5 Data Normalisation and Transformation To better cater the data to our needs, we will perform data transformation to transform some of the columns into rows and transforming between categorical and numerical variables so that we can better analyze the data. If the values in certain attributes varies too much, we will normalize these attributes to ensure that the analysis will be accurate.
6-8 Data Modelling We will develop an analytical software application if necessary, using software like SAS or JMP Pro.