AY1516 T2 Team AP Analysis Findings

From Analytics Practicum
Jump to navigation Jump to search

Team ap home white.png HOME

Team ap overview white.png OVERVIEW

Team ap analysis white.png ANALYSIS

Team ap project management white.png PROJECT MANAGEMENT

Team ap documentation white.png DOCUMENTATION


Data Retrieval & Manipulation Findings Post interim plan

Data Exploration at Follower Level

Understanding Retweeters' preferences
Retweet frequency by category chart.PNG
We looked the combined data that we have gotten - each follower of SGAG, with a sampled dataset of the number of retweets that have for SGAG posts, grouped according to category. It is interesting to see that certain categories stand out and are much likely preferred than others. In particular, the categories of tweet of content most likely to be retweeted in order of highest to lowest preference seem to be School, National Events, and Politics, with 16.9%, 15.5% and 14.3% of the total number of retweets respectively. Our initial intuition is that since these are the more popular kinds of content that Twitters read through, and thus in effect want to share it with others through retweets, SGAG can steer themselves towards creating more of these content.

We further evaluate these 3 top categories identified to be retweeted the most, and that it is probably due to 2 reasons: firstly, general uptake and viral tweets that are generally retweeted by the general population of followers of SGAG, or secondly, certain groups of retweeters that are more active in retweeting. In other words, these are the influencers of the network, and if accurately identified, will be critical to SGAG's growth and should be SGAG's primary group of twitter users to target for maximal effect. Even within this group of influencers, each Twitter user probably has a preference for each category of retweet. We hope to find a possible profile of users that tend to select and retweet specific categories of content.

Data Exploration at Post Level

Bar Chart Category.png