YSR Project Overview

From Analytics Practicum
Jump to navigation Jump to search


HOME

 

TEAM

 

PROJECT OVERVIEW

 

PROJECT MANAGEMENT

 

DOCUMENTATION

 



Scope

Access to TrustSphere’s datasets will allow the team to build a system from scratch using previously unused raw data to better understand turnover and attrition rules.

The minimum research points we would like to address:

  • Understand the number of relationships an employee will have at different periods of time in his or her working life
  • Measure the speed of growth at which employee relationships grow in a company
  • Correlations between the sizes of internal and external relationships employees have
  • Through social network analysis, calculate the likelihood of an employee in an informal group leaving a company upon the exit of another closely tied employee
  • Identification of metrics that can help predict the likelihood of an employee leaving

It is important to note that the scope of this project is fluid and can be furthered to address additional questions TrustSphere might have regarding the dataset.

Motivation and Objectives

In addition to our initial scope of research, we increased our scope to encompass the following: a) New Employee immersion: Onboarding, also known as organizational socialization, refers to the mechanism through which new employees acquire the necessary knowledge, skills, and behaviors to become effective organizational members and insiders. Effective onboarding allows an employee to better integrate into a company thereby transforming them into an asset faster. A key metric to understand organizational immersion is the number of internal relationships the employee has made at different points of time. By benchmarking the average speed of internal relationship growth, a company can assess the effectiveness of their onboarding programs on particular employees. b) Influencer identification for adoption of new enterprise level initiatives: with the launch of new enterprise level initiatives, one of the key concerns that arises is employee level adoption of these strategic changes3. Identifying key influencers in an organization and enrolling them as champions for enterprise level change is one way to increase adoption. Through SNA, these nodes of activity can easily be discovered. c) Contextual employee performance levels: The number-one predictive element of an individual’s success in an organization is the number, the quality, and the depth of social capital—the personal relationships among those that they do business with. By creating metrics (through insights gained from SNA), a geographic and department-level system average can be created to understand employees that are underperforming or overperforming. d) Levels of collaboration: Organisations that encourage employee productivity through collaboration across networks rather than simple individual task completion will require to actively monitor collaboration silos in an organization.


Data

The dataset given to us was pulled from the outlook (mail server used at TrustSphere) database. The data basically is an exchange of emails. The dataset consists of 890,000 email exchanges and 13 variables out of which we found the following 8 to be relevant:

  • Date: includes the date and time of a particular email being sent
  • Originator: identifies the originator of an email thread
  • Direction: indicates the direction of the email sent
  • Domain Group: identifies the company to which an email address belongs to
  • Inbound Count: number of emails being received from a particular address
  • Outbound Count: number of emails being sent to a particular address
  • Size: the size of email in bytes
  • MsgID: unique identifier of a particular email thread

Review of Previous Work

Our preliminary course of action had us importing the cleaned communication log data file into Tableau. The following describes the areas covered in our initial analysis:

a) Understanding the number of active employees in the organization on a monthly basis
We defined an active employee as one attending work for that particular month. It is important to note that an employee on leave for that month would be counted as a non-active employee. We assumed that an active employee would send at least 1 email per month. The insights gained from these graphs will allow managers to understand how to increase efficiency (For eg. Optimal planning of work based on trends behind workforce participation across different months over a year).
This was done on a temporal axis of months from May, 2014 to Dec, 2015. The term Active is used to describe email activity in specific as that is what is relevant in finding social ties based on our hypothesis that higher number of emails signifies a stronger network in an organization.
As we see below, there is a steady increase from somewhere around 40 all the way up to 45 in a particular month.
<image>


b) Mapping the absolute number of emails sent per person per month through the use of a Heatmap
The absolute volume of emails sent and received by an employee per month is one of many indicators of the workload of an employee. For illustration purposes, we used a heatmap to describe workload per month for each employee. This information can help:

  • Managers allocate work across employees in a single department better
  • Understand responsiveness of employees to collaboration - an employee that receives in excess to what he or she sends could be assumed to not collaborate well. However, these insights will only flag potential issues for a manager - to get down to the root cause, more information will be needed.


<image>

c) Measuring growth or decline of the size of an employee’s network

The number of relationships an employee has at different points of time can provide an array of insights for a manager. We have assumed that a single communication flow (either forward or backward) will constitute a relationship in an organization. For the purpose of this project we focus on two insights:

  • For a new employee, the number of relationships at different points of time is a suitable estimate for his or her immersion into the organization. A slow growth or decline of a new employees network could indicate a failure to integrate well into a company.
  • Number of relationships can be used as a base for comparison of employees on the same hierarchical level. For example, Sahil (a marketing executive with 50+ relationships) can be said to be doing better compared to Ananya (a marketing executive with 30+ relationships).


<image>

d) Understanding all-organization network performers Through an initial look at our client organization’s network centrality measures for the overall organization, we understood the following insights:

  • Betweenness, a measure of influence in the organization, indicated that the client company had three highly influential individuals. These three individuals were C-Suite level employees. According to research, “perceived power distance” between executives at the top of the organization and employees at the front lines are reasons behind poor adoption of corporate level initiatives.
  • Degree, an overall measure of number of relationships employees have, proved a majority of the employees are well immersed into the organization.


<image>

Methodology

References

http://www.mckinsey.com/insights/organization/power_to_the_new_people_analyticsa

https://en.wikipedia.org/wiki/People_analytics

https://www.crunchbase.com/organization/trustsphere#/entity