Difference between revisions of "ANLY482 AY2017-18T2 Group30 Project Overview"

From Analytics Practicum
Jump to navigation Jump to search
Line 28: Line 28:
 
<div align="center">
 
<div align="center">
 
<div style=" width: 85%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Project Overview</font></div>
 
<div style=" width: 85%; background: #E6EDFA; padding: 12px; font-family: Arimo; font-size: 18px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #8c8d94 solid 32px;"><font color="#5A6B96">Project Overview</font></div>
<div style="width:85%;"><font> </font></div>
+
<div style="width:85%;"><font> Our group aims to synthesize data from multiple platforms such as Facebook, Instagram, Youtube and blogs to deliver coherent and strategic insights for TSL to improve their social media outreach as well as plan for content creation. </font></div>
 
<br/>
 
<br/>
  

Revision as of 18:38, 13 January 2018

APex Logo.PNG


HOME ABOUT US PROJECT OVERVIEW PROJECT FINDINGS PROJECT MANAGEMENT DOCUMENTATION


Project Overview
Our group aims to synthesize data from multiple platforms such as Facebook, Instagram, Youtube and blogs to deliver coherent and strategic insights for TSL to improve their social media outreach as well as plan for content creation.


Motivation

Having published various content types over the years, TheSmartLocal knows what posts perform well and what doesn’t. However, monitoring content engagement over different platforms such as Facebook, Youtube, Instagram and their blogs can be highly complex due to the nature of the posts such as timing, content type, authors. As such, they would like to perform a holistic, cross-platform analysis to quantify and investigate the causes of virality. In addition, they learned that Facebook has recently changed ranking algorithms for posts in December 2017. Specifically, posts that encourage sharing and commenting to participate in competitions will be downgraded in rankings. TSL would like to investigate the impact of this change on their Facebook outreach, and if possible, provide recommendations.


Objectives

  • Gather all available data from TSL across platforms to evaluate possibility of cross-platform social media engagement analysis
  • Conduct preliminary analysis on all datasets to discover possible insights to deliver
  • Synthesize data from all platforms (Facebook, Blog, Instagram, Youtube)
  • Investigate impact of Facebook’s algorithm change on TSL’s post engagement rates


Data

TheSmartLocal will provide us with social media insights data from various platforms (Facebook, Instagram, Youtube, Blog) for the period of January 2017 - December 2017. As of the time of proposal submission, we have only received data for Facebook and Youtube.


Methodology

Data Collection
We will use the data provided to us by our project sponsor exported from FaceBook Insights, YouTube Analytics. The data comes in the format of Microsoft Excel.

Data Preparation
As the exported data comprises of multiple tabs and columns in multiple excel files, such as post level or video post, we will attempt to organize the data into consistent formats that are easier for our analysis especially when evaluating cross-platform social media engagement. We will also need to mask confidential information such as the company’s staff names as required by the project sponsor.

Exploratory Data Analysis
We will examine the entries of the various social media platform for the same campaign. From here, we will be able to discover possible insights to deliver to our sponsors, for example to find out which social media platform is most suitable for video or blog posts.

Data Cleaning
To ensure accuracy of our model, we will identify missing values and outliers that are observed during the previous stage. We will go through these missing values separately and decide on how we should handle it (whether by replacing with the average value or simply remove the entire row). As for handling outliers, we will try attempt to analyze and come up with a reason for the outlier and see if it will affect our analysis.

Data Normalisation and Transformation
To better cater the data to our needs, we will perform data transformation to transform some of the columns into rows and transforming between categorical and numerical variables so that we can better analyze the data. If the values in certain attributes varies too much, we will normalize these attributes to ensure that the analysis will be accurate.

Data Modeling (Steps 6-8)
We will develop an analytical software application if necessary, using software like SAS or JMP Pro.