ISSS608 2016-17 T1 Assign3 Lee Gwo Mey

From Visual Analytics and Applications
Revision as of 12:39, 28 October 2016 by Gwomey.lee.2016 (talk | contribs)
Jump to navigation Jump to search

Abstract

<WIP>

Background of Case

DinoFun World is a typical modest-sized amusement park, sitting on about 215 hectares and hosting thousands of visitors each day. It has a small town feel, but it is well known for its exciting rides and events.

One event last year was a weekend tribute to Scott Jones, internationally renowned football ("soccer" in US terminology) star. Scott Jones is from a town nearby DinoFun World. He was a classic hometown hero, with thousands of fans who cheered his success as if he was a beloved family member. To celebrate his years of stardom in international play, DinoFun World declared "Scott Jones Weekend", where Scott was scheduled to appear in two stage shows each on Friday, Saturday and Sunday to talk about his life and career. In addition, a show of memorabilia related to his illustrious career would be displayed in the park's Pavilion. However, the event did not go as planned. Scott's weekend was marred by crime and mayhem perpetrated by a poor, misguided and disgruntled figure from Scott's past.

While the crimes were rapidly solved, park officials and law enforcement figures are interested in understanding just what happened during that weekend to better prepared themselves for future events. They are interested in understanding how people move and communicate in the park, as well as how patterns changes and evolve over time, and what can be understood about motivations for changing patterns.

The Tasks

Task 1 (Not More than 4 images and 300 words)

Identify those IDs that stand out for their large volume of communication. For each of these IDs,

  • Characterize the communication patterns you see
  • Based on these patterns, what do you hypothesize about these IDs?

Task 2 (Not More than 10 images and 1000 words)

Describe up to 10 communication patterns in the data. Characterize who is communicating, with whom, when and where. If you have more than 10 patterns to report, please prioritize those patterns that are most likely to relate to the crime.

Task 3 (Not More than 3 images and 300 words)

From this data, can you hypothesize when the vandalism was discovered? Describe your rationale.

Data Sets

  • DinoFunWorld_CommData.zip (3 days' in-app communication data)
  • DinoFunWorld_MoveData.zip (3 days' park movement data)
  • DinoFunWorld_LayoutMap.zip
  • DinoFunWorld_Website.zip (webpages of DinoFun World Park)

The communication data includes communications between the paying park visitors, as well as communications between the visitors and park services. In addition, the data also contains records indicating if and when the user sent a text to an external party.

Brief description of the Communication data fields are

  • Timestamp: date (yyyy-mm-dd) and time (hh:mm:ss AM/PM) of communication. Eg. 2014-06-06 08:03:19AM
  • From: identifier number that send out the communication message. Eg. ID_439105
  • To: identifier number that receive the communication message. Eg. ID_1053224
  • Location: location name where the communication message was sent/received. Eg. Kiddie Land

Visualization Software Used

  • JMP Pro 12
  • Tableau 10.0
  • Gephi 0.9.1

Responses to Tasks

Task 1: IDs with Large Volume of Communication

Overview of IDs Communication Volume
Figure1.1-Overview of IDs by Total Sent and Received Messages.png

  • Figure 1.1 shows an overview of the total number of messages sent and/or received by each ID
  • The median number of messages per ID is 428
  • The 3 IDs with exceptionally high number of messages compared to the rest are ID_1278894, ID_839736, and ID_External