Difference between revisions of "ISSS608 2016-17 T1 Assign3 Liu Jialin"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 1: Line 1:
 
<!--MAIN HEADER -->
 
<!--MAIN HEADER -->
 
{|style="background-color:#1B338F;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
 
{|style="background-color:#1B338F;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
| style="font-family:Century Gothic; font-size:100%; solid #000000; background:#2B3856; text-align:center;" width="25%" |  
+
| style="font-family:Century Gothic; font-size:100%; solid #000000; background:#2B3856; text-align:center;" width="20%" |  
 
;
 
;
[[ISSS608_2016-17_T1_Assign3_Liu_Jialin| <font color="#FFFFFF">Data Preparation</font>]]
+
[[ISSS608_2016-17_T1_Assign3_Liu_Jialin| <font color="#FFFFFF">Overview</font>]]
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="25%" |  
+
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |
 +
;
 +
[[Data Preparation| <font color="#FFFFFF">Data Preparation</font>]]
 +
 
 +
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
 
;
 
;
 
[[Question 1| <font color="#FFFFFF">Question 1</font>]]
 
[[Question 1| <font color="#FFFFFF">Question 1</font>]]
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="25%" |  
+
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
 
;
 
;
 
[[Question 2| <font color="#FFFFFF">Question 2</font>]]
 
[[Question 2| <font color="#FFFFFF">Question 2</font>]]
  
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="25%" |  
+
| style="font-family:Century Gothic; font-size:100%; solid #1B338F; background:#2B3856; text-align:center;" width="20%" |  
 
;
 
;
 
[[Question 3| <font color="#FFFFFF">Question 3</font>]]
 
[[Question 3| <font color="#FFFFFF">Question 3</font>]]
Line 20: Line 24:
 
|}
 
|}
 
<br/>
 
<br/>
Data preparation using JMP: <br>
 
• Use concatenate function to combine the communication records across 3 days into one file, name file <b>Communication in 3 days</b>.<br>
 
• Sort ascending on “timestamp”, then sort ascending on “from”. Now the messages send by the same ID appear together and appear in time order.<br>
 
• Hide and exclude all messages to and from 1278894 and 839736. Unexclude the rows accordingly when needed.<br>
 
• Create a column, name it "Unique Combination", apply formula: “Char(:from) || Char(:to)”.<br>
 
• Tabulate "Unique Combination" and N, make into data table, name file <b>Unique direction count of messages</b>.<br>
 
• In <b>Unique direction count of messages</b>, change column name “N” to “weight”.<br>
 
• Update <b>Communication in 3 days</b> from <b>Unique direction count of messages</b>, update with “weight” column.<br>
 
• In <b>Communication in 3 days</b>, create a column called “Timestamp difference in min”, apply formula “Dif(:Timestamp, 1) / 60”.<br>
 
• Save file as <b>Edges for communication</b>. Remove column “Timestamp difference in min”, sort ascending by “Unique Combination”.<br>
 
• Unlock the “Unique combination” row, change row information from characters to numerical, continuous.<br>
 
• Create a new column, name it “remove duplicates”, and apply formula “Dif(:Unique Combination, 1)”.<br>
 
• Select all rows with remove duplicates = 0, these are duplicate rows, delete these rows.<br>
 
• Delete columns “Location”, “Timestamp” and “remove duplicates”.<br>
 
• Change “from” to “Source” and “to” to “Target”<br>
 
• Save file, export as excel, name exported file <b>Nodes for communication</b>.<br>
 
• In excel, copy all the Target nodes at the end of Source column. Remove duplicates for this column. Delete the Target Column.<br>
 
• Change “Source” to “ID”.<br>
 
<br>
 
Gephi:<br>
 
• Import into Gephi using <b>Nodes for communication</b> and <b>Edges for communication</b>.<br>
 
• In Gephi, using Hu Yifan layout, change optimal distance to 200, run the layout to obtain a satisfactory layout.<br>
 
• Set nodes size depends on Degree and nodes colour to depend on Out-degree.<br>
 
• Set colour of edges to depend on weight.<br>
 
• In filter, select topology, drag Mutual Degree into filter. Change the filters to obtain the filtered layouts.<br>
 
• In context, check the number of nodes remained using this filter.<br>
 

Revision as of 22:01, 28 October 2016