Difference between revisions of "ISSS608 2016-17 T1 Assign3 Agrim Gairola"

From Visual Analytics and Applications
Jump to navigation Jump to search
m
Line 6: Line 6:
  
 
=The Task=
 
=The Task=
<p>You have access to the in-app communication data over the three days of the Scott Jones celebration. This includes communications between the paying park visitors, as well as communications between the visitors and park services. In addition, the data also contains records indicating if and when the user sent a text to an external party.<br/>  
+
<p>You have access to the in-app communication data over the three days of the Scott Jones celebration. This includes communications between the paying park visitors, as well as communications between the visitors and park services. In addition, the data also contains records indicating if and when the user sent a text to an external party.<br/> <br/>
Task1: Use visual analytics to analyze the available data and develop responses to the questions below.<br/>
+
<B>Task1:</B> Use visual analytics to analyze the available data and develop responses to the questions below.<br/>
 
a.Identify those IDs that stand out for their large volumes of communication.<br/>
 
a.Identify those IDs that stand out for their large volumes of communication.<br/>
 
b.For each of these IDs Characterize the communication patterns you see.<br/>  
 
b.For each of these IDs Characterize the communication patterns you see.<br/>  
 
c.Based on these patterns, what do you hypothesize about these IDs?<br/>  
 
c.Based on these patterns, what do you hypothesize about these IDs?<br/>  
 
+
<br/>
Task2: Describe up to 10 communications patterns in the data. Characterize who is communicating, with whom, when and where. If you have more than 10 patterns to report, please prioritize those patterns that are most likely to relate to the crime<br/>
+
<B>Task2:</B> Describe up to 10 communications patterns in the data. Characterize who is communicating, with whom, when and where. If you have more than 10 patterns to report, please prioritize those patterns that are most likely to relate to the crime<br/>
Task3: From this data, can you hypothesize when the vandalism was discovered? Describe your rationale. Note: Please limit your response to no more than 3 images and 300 words.</p>
+
<br/>
 +
</b>Task3</B>: From this data, can you hypothesize when the vandalism was discovered? Describe your rationale. Note: Please limit your response to no more than 3 images and 300 words.</p>
 
<br/>
 
<br/>
 
=Tools Used=
 
=Tools Used=

Revision as of 19:45, 28 October 2016

ISSS608 2016-17 T1 Assign1_Agrim Gairola

MAYHEM AT DINOFUN WORLD

Overview


DinoFun World is a typical modest-sized amusement park, sitting on about 215 hectares and hosting thousands of visitors each day. It has a small town feel, but it is well known for its exciting rides and events. One event last year was a weekend tribute to Scott Jones, internationally renowned football (“soccer,” in US terminology) star. Scott Jones is from a town nearby DinoFun World. He was a classic hometown hero, with thousands of fans who cheered his success as if he were a beloved family member. To celebrate his years of stardom in international play, DinoFun World declared “Scott Jones Weekend”, where Scott was scheduled to appear in two stage shows each on Friday, Saturday, and Sunday to talk about his life and career. In addition, a show of memorabilia related to his illustrious career would be displayed in the park’s Pavilion. However, the event did not go as planned. Scott’s weekend was marred by crime and mayhem perpetrated by a poor, misguided and disgruntled figure from Scott’s past.

The Task

You have access to the in-app communication data over the three days of the Scott Jones celebration. This includes communications between the paying park visitors, as well as communications between the visitors and park services. In addition, the data also contains records indicating if and when the user sent a text to an external party.

Task1: Use visual analytics to analyze the available data and develop responses to the questions below.
a.Identify those IDs that stand out for their large volumes of communication.
b.For each of these IDs Characterize the communication patterns you see.
c.Based on these patterns, what do you hypothesize about these IDs?

Task2: Describe up to 10 communications patterns in the data. Characterize who is communicating, with whom, when and where. If you have more than 10 patterns to report, please prioritize those patterns that are most likely to relate to the crime

Task3: From this data, can you hypothesize when the vandalism was discovered? Describe your rationale. Note: Please limit your response to no more than 3 images and 300 words.


Tools Used

  • Tableau version 10.0
  • JMP Pro
  • Gephi
  • Microsoft Office


Task 1

The following steps were carried out to prepare the data for effective analysis:
Data Manipulation: A unique ID was given to each record for the ease of analysis.
Data Type Conversion: On importing the data into JMP, age and work experience was kept in continuous data type. All the remaining data was converted to nominal data type.
Missing data analysis: Missing data analysis was performed on the data in order to identify the missing data and suitably recoding them.

1.jpg

Assumption: There were several unambiguous values that could be noted throughout the data set. These values were recoded based on the below assumptions:

2.jpg

<