Difference between revisions of "APA Final Progress"
Jump to navigation
Jump to search
Line 111: | Line 111: | ||
|} | |} | ||
{| class="wikitable" | {| class="wikitable" | ||
− | |+ | + | |+Number of rows after removing: |
|- | |- | ||
− | |''' | + | |'''im + inbound/outbound''' |
|45,855 | |45,855 | ||
|- | |- | ||
− | |''' | + | |'''system email addresses''' |
|29,797 | |29,797 | ||
|} | |} |
Revision as of 12:57, 23 February 2017
Preliminary analysis
Before Cleaning
Additionally, we also found several system email addresses that can potentially skew the data (due to mass emails). Hence, we decided to eliminate emails to and from these email addresses as well. Below, we have listed some of the email addresses and the number of times they occurred in the data set.
Further, we removed columns that we do not need in our analysis. These include:
|