Difference between revisions of "APA Final Progress"
Jump to navigation
Jump to search
Line 31: | Line 31: | ||
<p> | <p> | ||
+ | <big>'''Email Data'''<br></big> | ||
<big>'''Before Cleaning'''<br></big> | <big>'''Before Cleaning'''<br></big> | ||
Our data consists of 14 columns as described below: | Our data consists of 14 columns as described below: | ||
Line 129: | Line 130: | ||
* Subject: We are only using email data, and hence this will always have a value of 'em'. Hence keeping it will be redundant. | * Subject: We are only using email data, and hence this will always have a value of 'em'. Hence keeping it will be redundant. | ||
<br> | <br> | ||
+ | |||
+ | <big>'''Email Data'''<br></big> | ||
+ | {| class="wikitable" | ||
+ | |+ | ||
+ | |- | ||
+ | |Name | ||
+ | |Name of employee | ||
+ | |- | ||
+ | |Hierarchy | ||
+ | |Designation of employee | ||
+ | |- | ||
+ | |Department | ||
+ | |Department of employee | ||
+ | |- | ||
+ | |Location | ||
+ | |Location where the employee is based | ||
+ | |} | ||
+ | |||
+ | [[File:Department.png|300px]] | ||
+ | [[File:Hierarchy.jpg|300px]] | ||
+ | [[File:Location.png|300px]] <br> | ||
+ | |||
+ | * Department: Marketing, Development and Sales have the most number of employees | ||
+ | * Hierarchy: Associates are the highest in number | ||
+ | * Location: Singapore being the head quarters has the most number of employees | ||
+ | |||
+ | |||
+ | |||
<ul> | <ul> | ||
<li> Exploration of network : filtered for internal employees only</li> | <li> Exploration of network : filtered for internal employees only</li> |
Revision as of 15:44, 23 February 2017
Preliminary analysis
Email Data
Additionally, we also found several system email addresses that can potentially skew the data (due to mass emails). Hence, we decided to eliminate emails to and from these email addresses as well. Below, we have listed some of the email addresses and the number of times they occurred in the data set.
Further, we removed columns that we do not need in our analysis. These include:
Email Data
|