Difference between revisions of "ISSS608 2017-18 T3 Assign Aakanksha Kumari Data Preparation"
(6 intermediate revisions by the same user not shown) | |||
Line 2: | Line 2: | ||
<div style="background:#DC143C; border:#DC143C; padding-left:15px; text-align:center;"> | <div style="background:#DC143C; border:#DC143C; padding-left:15px; text-align:center;"> | ||
− | <font size = 5; color="#FFFFFF"><span style="font-family:Century Gothic;"> | + | <font size = 5; color="#FFFFFF"><span style="font-family:Century Gothic;">Unraveling the Secrets of Kasios : VAST Mini Challenge 3</span></font> |
</div> | </div> | ||
<!--MAIN HEADER --> | <!--MAIN HEADER --> | ||
Line 29: | Line 29: | ||
; | ; | ||
[[ISSS608 2017-18 T3 Assign Aakanksha Kumari_Q4| <font color="#FFFFFF">Question 4</font>]] | [[ISSS608 2017-18 T3 Assign Aakanksha Kumari_Q4| <font color="#FFFFFF">Question 4</font>]] | ||
− | |||
− | |||
− | |||
− | |||
| style="font-family:Century Gothic; font-size:100%; solid #DC143C; background:#DC143C; text-align:center;" width="14.3%" | | | style="font-family:Century Gothic; font-size:100%; solid #DC143C; background:#DC143C; text-align:center;" width="14.3%" | | ||
Line 83: | Line 79: | ||
<div style="font-family:Palatino Linotype; border-radius: 1px "> <big> | <div style="font-family:Palatino Linotype; border-radius: 1px "> <big> | ||
− | All provided data files have the same format. The data are provided in comma-separated format with four columns: </big></div> | + | All provided data files have the same format. The data are provided in comma-separated format with four columns: |
+ | |||
+ | {| class="wikitable style="margin: auto;" | ||
+ | |- | ||
+ | ! Column Name!! Description | ||
+ | |- | ||
+ | | Source|| Contains the company ID# for the person who called, sent an email, purchased something, or invited people to a meeting | ||
+ | |- | ||
+ | | Etype || Contains a number designating what kind of connection is made | ||
+ | a. 0 is for calls | ||
+ | b. 1 is for emails | ||
+ | c. 2 is for purchases | ||
+ | d. 3 is for meetings | ||
+ | |||
+ | |- | ||
+ | | Destination || Information on suspicious purchases | ||
+ | |- | ||
+ | | Suspicious_meetings.csv|| Contains company ID# for the person who is receiving a call, receiving an email, selling something to a buyer, or being invited to a meeting | ||
+ | |- | ||
+ | | Time stamp|| In seconds starting on May 11, 2015 at 14:00. | ||
+ | |} | ||
+ | |||
+ | </big></div> | ||
− | |||
|} | |} | ||
Line 100: | Line 117: | ||
− | == '''Data | + | == '''Data Cleaning''' == |
{| class="wikitable" | {| class="wikitable" | ||
Line 106: | Line 123: | ||
| <div style="font-family:Palatino Linotype; border-radius: 1px "> <big> | | <div style="font-family:Palatino Linotype; border-radius: 1px "> <big> | ||
Converting the Time in all the CSV’s from seconds to the standard format and baselining the time w.r.t May 11, 2015 at 14:00. | Converting the Time in all the CSV’s from seconds to the standard format and baselining the time w.r.t May 11, 2015 at 14:00. | ||
− | Using Python | + | Using Python date-time and panda’s library the relative date-time was converted to an absolute date-time. |
+ | |||
+ | |||
</big> </div> | </big> </div> | ||
|} | |} |
Latest revision as of 13:10, 8 July 2018
Unraveling the Secrets of Kasios : VAST Mini Challenge 3
|
|
|
|
|
|
|
Data Set
The Kasios Insider has provided data from across the company. There are call records, emails, purchases, and meetings. The data only includes the source of each transaction, the recipient (destination), and the time of the transaction. Contents of emails or phone calls are not available.
There are four data files that contain information about individuals that the Insider has indicated as suspicious:
All provided data files have the same format. The data are provided in comma-separated format with four columns:
|
Tools
|
Data Cleaning
Converting the Time in all the CSV’s from seconds to the standard format and baselining the time w.r.t May 11, 2015 at 14:00. Using Python date-time and panda’s library the relative date-time was converted to an absolute date-time.
|