Difference between revisions of "ISSS608 2017-18 T3 Assign Jyoti Bukkapatil Data Preparation"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 1: Line 1:
 
== Data for Visualisation ==
 
== Data for Visualisation ==
  
Data provided by Kasios International insider has been 10 different CSV files. These are mainly different call records, email records, Meeting records and Purchase records from 11th May 2015 14:00:00 hours onwards. All files contain Source, destination, connection details and time in seconds. Below are details of different files providing The Kasios Insider.
+
Data provided by Kasios International insider has been 10 different CSV files. These are mainly different call records, email records, Meeting records and Purchase records from 11th May 2015 14:00:00 hours onwards. All files contain Source, destination, connection details and time in seconds. Below table shows details of different files providing The Kasios Insider.
 
 
  
 
{| class="wikitable"
 
{| class="wikitable"
Line 31: Line 30:
  
 
All data files contain only four columns:
 
All data files contain only four columns:
 +
 
#Source ID: Company ID of the person who has initiated connection i.e. either Called someone, sent email, invited someone for meeting or purchases something  
 
#Source ID: Company ID of the person who has initiated connection i.e. either Called someone, sent email, invited someone for meeting or purchases something  
 
#Etype:  Connection details i.e.  0 – Calls, 1 – Emails, 2 – Purchases and 3- Meetings
 
#Etype:  Connection details i.e.  0 – Calls, 1 – Emails, 2 – Purchases and 3- Meetings
 
#Target ID: Company ID of destination person for connection  
 
#Target ID: Company ID of destination person for connection  
 
#Time Stamp:  Time in Seconds starting from 11th May 2015 at 14:00
 
#Time Stamp:  Time in Seconds starting from 11th May 2015 at 14:00
 +
 +
Column names were changed to facilitate further analysis and easy understanding.
 +
#Source ID  was changed to Source
 +
#Target ID was changed to Target
 +
#Etype was changed to Communication Mode
 +
#Time Stamp was changed to Time in Sec.

Revision as of 18:30, 7 July 2018

Data for Visualisation

Data provided by Kasios International insider has been 10 different CSV files. These are mainly different call records, email records, Meeting records and Purchase records from 11th May 2015 14:00:00 hours onwards. All files contain Source, destination, connection details and time in seconds. Below table shows details of different files providing The Kasios Insider.

Index Data File Name Number of Records
1 Call records for the whole company company starting from 11 May 2015 14:00:00 hrs calls.csv 10606835
2 Email records for the whole company starting from 11 May 2015 14:00:00 hrs email.csv 14550085
3 Meetings records for whole company starting from 11 May 2015 14:00:00 hrs meeting.csv 127351
4 Purchases records for whole company starting from 11 May 2015 14:00:00 hrs purchaces.csv 762200
5 Company employee ID and Name list CompanyIndex.csv 642631
6 Suspicious call records of suspecious group in company ,starting from 11 May 2015 14:00:00 hrs Suspicious_calls.csv 70
7 Suspicious email records of suspecious group in company ,starting from 11 May 2015 14:00:00 hrs Suspicious_emails.csv 61
8 Suspicious meeting records of suspecious group in company ,starting from 11 May 2015 14:00:00 hrs Suspicious_purchases.csv 5
9 Suspicious purchases records of suspecious group in company ,starting from 11 May 2015 14:00:00 hrs Suspicious_meetings.csv 1
10 Suspecious 7 purchases records Other_suspicious_purchases.csv 7

All data files contain only four columns:

  1. Source ID: Company ID of the person who has initiated connection i.e. either Called someone, sent email, invited someone for meeting or purchases something
  2. Etype: Connection details i.e. 0 – Calls, 1 – Emails, 2 – Purchases and 3- Meetings
  3. Target ID: Company ID of destination person for connection
  4. Time Stamp: Time in Seconds starting from 11th May 2015 at 14:00

Column names were changed to facilitate further analysis and easy understanding.

  1. Source ID was changed to Source
  2. Target ID was changed to Target
  3. Etype was changed to Communication Mode
  4. Time Stamp was changed to Time in Sec.