Difference between revisions of "ISSS608 2016-17 T1 Assign3 Shishir Nehete"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 62: Line 62:
 
[[File:TimeSeries Communication.jpg|1100px|frameless|center|TimeSeries Communication]]
 
[[File:TimeSeries Communication.jpg|1100px|frameless|center|TimeSeries Communication]]
  
[[File:Check-in location.jpg|framed|Check-in Location Analysis]]
+
[[File:Check-in location.jpg|1100px|frameless|center|Check-in Location Analysis]]
 
<br clear="all"/>
 
<br clear="all"/>
  

Revision as of 00:41, 30 October 2016

To be a Visual Detective: Detecting spatio-temporal patterns

Overview


DinoFun World is a typical modest-sized amusement park, sitting on about 215 hectares and hosting thousands of visitors each day. It has a small town feel, but it is well known for its exciting rides and events.
Our task is to analyse the data for one event, which was organized last year as a weekend tribute to Scott Jones, internationally renowned football (“soccer,” in US terminology) star. Scott Jones is from a town nearby DinoFun World. He was a classic hometown hero, with thousands of fans who cheered his success as if he were a beloved family member. However, the event was marred by crime and mayhem perpetrated by a poor, misguided and disgruntled figure from Scott’s past.
In view of this mayhem, we are supposed to investigate the in-app communication data over the three days and try to figure out the patterns of communications and make hypothesis of when the vandalism was discovered.


Task


We have access to the in-app communication data over the three days of the Scott Jones celebration. This includes communications between the paying park visitors, as well as communications between the visitors and park services. In addition, the data also contains records indicating if and when the user sent a text to an external party. Our task is to use visual analytics techniques to analyze the available data and develop responses to the questions below.

  1. Identify those IDs that stand out for their large volumes of communication. For each of these IDs
    1. Characterize the communication patterns you see.
    2. Based on these patterns, what do you hypothesize about these IDs?
  2. Describe up to 10 communications patterns in the data. Characterize who is communicating, with whom, when and where. If you have more than 10 patterns to report, please prioritize those patterns that are most likely to relate to the crime.
  3. From this data, can you hypothesize when the vandalism was discovered? Describe your rationale.


Data

Data Preparation



Visualization Software



Results

Task 1


Below figure shows the data for communication that happened on Friday.

Outstanding communication IDs on Fri

Below figure shows the data for communication that happened on Saturday.

Outstanding communication IDs on Saturday

Below figure shows the data for communication that happened on Sunday.

Outstanding communication IDs on Sunday

While analysing the communication data for all three days, it is observed that there are 2 IDs that stand out in the communications happening in the park.
These 2 IDs are 1278894 and 839736. Other ID that is target of high communication is 9999999, which refers to external party. Further analysing the data for ID 1278894, it is observed that this ID communicates with majority of the visitors in the park. Hence it can be hypothesized that this ID can be check-in monitoring ID in the park setup. Also, this ID is located at Entry Corridor which confirms the hypothesis.
The other ID i.e. 839736, which also communicates with high number of visitors to the park can be hypothesized as a kind of Service ID in the park. This ID also is located at the Entry Corridor. We will further analyse the communication patterns of these IDs in the 2nd task that describes the communication patters.
Another noticeable point to note out of this analysis is that the communication has drastically increased with the ID 839736 on Sunday tough the number of visitors is close to the number on Saturday.

Change over weekend


As seen in the table above, the communication with 839736 has increased 4 folds while the increase in visitors and check-in monitoring has not significantly changed over Sunday.


Task 2



Task 3


TimeSeries Communication
Check-in Location Analysis


References



Comments