ISSS608 2016-17 T1 Assign3 CHIA Yong Jian Data Review
|
|
|
|
|
Data Review and PreparationsSAS JMP Pro 12 was used to review and prepare data. 1. Communications DataThree days worth of data (Friday, Saturday, Sunday) of the fateful weekend was provided, with each having between 948,739 to 1,655,866 records. Each file has the following columns:
No missing data was observed in the dataset. For any loading into Gephi later, the following columns will be renamed:
Furthermore, a node file will also be created, consisting of unique IDs from the prepared edge file.
2. Movement DataMovement data was also provided for the 3 days, with each having between 6 to 10 million records, and the following columns:
When a missing data check was performed, there was one row of record for the Sunday movement data that does not have any information on the columns other than timestamp. It is unclear if this is due to dirty data or signs of involvement by the crime perpetrators (such as sabotage of data). The movement data, together with the park map, will be plotted using Tableau to visualise movement of the individuals in the park.
|