Difference between revisions of "APA Final Progress"
Line 156: | Line 156: | ||
* Location: Singapore being the head quarters has the most number of employees | * Location: Singapore being the head quarters has the most number of employees | ||
+ | Node: Each employee | ||
+ | Node Color: Hierarchy | ||
+ | Node Size: Eigenvector Centrality | ||
+ | No weights for edges – purely based on quantity | ||
+ | Many Senior Management and Upper Management Employees seem to have a low centrality score | ||
+ | Possibly a biased solution | ||
+ | Need for feature engineering to add weight that removes the bias | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
<br> | <br> | ||
</p> | </p> |
Revision as of 15:57, 23 February 2017
Preliminary analysis
Email Data
Additionally, we also found several system email addresses that can potentially skew the data (due to mass emails). Hence, we decided to eliminate emails to and from these email addresses as well. Below, we have listed some of the email addresses and the number of times they occurred in the data set.
Further, we removed columns that we do not need in our analysis. These include:
Email Data
Node: Each employee Node Color: Hierarchy Node Size: Eigenvector Centrality No weights for edges – purely based on quantity Many Senior Management and Upper Management Employees seem to have a low centrality score Possibly a biased solution Need for feature engineering to add weight that removes the bias
|