ISSS608 2017-18 T3 Assign Joel Choo Peng Yeow Are You Guilty

From Visual Analytics and Applications
Jump to navigation Jump to search
Joel MagnifyingGlass.png

"You See, But You Do Not Observe" - Sherlock
Looking Deeper Into The Network Of Connected Individuals

Background

Methodology

Company Growth

Are You Guilty?

Conclusion

[Back To Assignments]

 


Do You Plead Guilty?"

Data Preperation

Using the list of suspects provided by the insider, we would like to determine if anyone else appears to be closely related to the group and which employees are making suspicious purchase. The below depicts the original network of the list of suspicious individuals provided by the insider. The size of nodes and node labels indicates the in-degree each node appears within the dataset. Larger node represents that many edges go in the node and communications are directed at them.

Lindsy Henion, Richard Fox and Jose Ringwalk seems to be the prominent ones here and we will investigate further.

Joel SuspiciousList.png

After filtering all activities that happened in the company, we obtain 1722 employees (nodes) and 1904 activities (edges) as seen below.

Joel Transition.png


Understanding Centrality Measures

In a connected graph, closeness centrality (or closeness) of a node is a measure of centrality in a network, calculated as the sum of the length of the shortest paths between the node and all other nodes in the graph. Thus the more central a node is, the closer it is to all other nodes. We will use this metric to identify who are close to the group of suspects.

Betweeness on the other hand represents the degree of which nodes stand between each other and high betweenness means more information will pass through that node. Removing the node will lose a large part of the graph.

After running the algorithm in Gephi, we obtain the centrality measures. Betweeness are skewed right and lesser observations have a high betweenness. Closeness on the other hand are distributed more evenly and we would expect many employees to be closely connected.

Joel Centrality distribution.png


Solving The Crime

Finding Those Who Are Close And Crucial To The Group

Using closeness as the size of a node, we have obtained a network with many who seem to be close to the suspicious group. 896 nodes remain on the filtered graph and we will use them to find out their interactions over time.

Joel Closeness transition.png


Joel Closeness.png

With betweenness as the size of the node, there are 4 big players in the company and they are likely to be very influential people.

Big4.png

An Overview of the network graph shows how important the big 4 is to the company with betweenness as a centrality measure.

Joel Betweeness.png

Suspicious Purchases

Coincidentally, the bulk of purchase orders were made by the big 4 as well, Tobi, Meryl, Lizbeth and the bulk came from Richard with a total of 15 purchases. This should not be a coincidence and further investigations should definitely be conducted on the four of them.

Suspicious purchases.png

How Have The Organisational Structure & Communications Changed Over Time?

No.

Species Name

Oscillogram

1.

Bent Beat Riffraff

O1.png

2.

Blue Collared Zipper

O2.png

3.

Bombadil

O3.png

4.

Broad Winged Jojo

O4.png

5.

Canadian Cootamum

O5.png

6.

Carries Champagne Pipit

O6.png

7.

Darkwing Sparrow

O7.png

8.

Eastern Corn Skeet

O8.png

9.

Green Tipped Scarlet Pipit

O9.png

10.

Lesser Birchbeere

O10.png

11.

Orange Pine Plover

O11.png

12.

Ordinary Snape

O12.png

13.

Pinkfinch

O13.png


14.

Purple Tooting Tout

O14.png


15.

Qax

O15.png

16.

Queenscoat

O16.png

17.

Rose-Crested Blue Pipit

O17.png

18.

Scrawny Jay

O18.png

19.

Vermillion Trillian

O19.png


Testing Birds


The oscillograms of each of the 15 test birds are as follows.

The predicted species is indicated in the last column, after visualising and comparing the similarity of the amplitude plots. Our results show that the predicted species based on oscillogram visualisation, matches the predicted species based on envelope plot visualisation. This is not a surprise because the envelope is obtained from the oscillogram.

We plot both, because the envelope gives a quick comparison while the oscillogram provides a more indepth visualisation.

ID

Oscillogram

Predicted Species

Same as earlier predicted by envelope?

1

T1.png

Eastern Corn Skeet

Yes. Though, this is quite close to the Rose-Crested Pipit. However, the Pipit produces more ‘chirps’ per 100 sec, as compared to the Skeet.

2

T2.png

Rose-Crested Pipit

Yes.

3

T3.png

Queenscoat

Yes.

4

T4.png

Bombadil

Yes.

5

T5.png

Canadian Cootamum

Yes.

6

T6.png

Qax

Yes.

7

T7.png

Canadian Cootamum

Yes.

8

T8.png

Green-Tipped Scarlet Pipit

Yes.

9

T9.png

Rose-Crested Blue Pipit

Yes.

10

T10.png

Qax

Yes.

11

T11.png

Scrawny Jay

Yes.

12

T12.png

Qax

Yes.

13

T13.png

Qax

Yes.


14

T14.png

Bombadil

Yes.


15

T15.png

Pinkfinch

Yes.