1718t1is428T12 Proposal 1

Introduction & Motivation

In data analytics and visualization, there are unique challenges faced in the security realm as compared to conventional Business Intelligence analytics; rare incidents (i.e. attacks) within massive volumes of data need to be detected. Organizations commonly use Security Information and Event Management (SIEM) software solutions to gather security events from various systems and logs, by deploying real-time alert tools optimized to do event-driven signature matching for a subset of the overall gathered data. As attacks are more sophisticated these days, threats are more difficult to detect with signature-based systems.

Extensive analytics can complement SIEM solutions to identify major anomalies, combine multiple markers of attackers’ actions, and detect customized infections. Identifying malicious behavior is challenging, as we do not always know what we are looking for — anomalous behaviors must be an attack, and some attack signatures are unknown and not upfront, so it is crucial to present findings to security analysts for human-driven analysis.

Objectives

Security analysts may not be data science specialists, so the use of visual data representation and visual analytics can make security data analytics easier to understand. The aim of the project is to help organizations identify high-level security threats and attacks by analyzing large amounts of security data (i.e. firewall and IDS logs) we have collected, such that security analysts are able to deal with such threats and attacks with the findings without being required to have strong data science proficiency.

This project tells a story of an attack on a large organization and the three main objectives are:

Discovering key incidents that took place for the time period recorded in the firewall and IDS logs.
Recognizing a security trend in the firewall and IDS logs over the course of the time period.
Identifying the root causes of the incidents identified. And bearing in mind that corporate networks cannot be shut down or disconnected from the internet, what can done to mitigate the issues.

Background Survey of Related Works

Visualizations	Explanation
Data source: http://www.digitalattackmap.com/	Digital Attack Map is a live data visualization of DDoS attacks around the globe, built through a collaboration between Google Ideas and Arbor Networks. The tool surfaces anonymous attack traffic data to let users explore historic trends and find reports of outages happening on a given day.
Data source: http://www.trendmicro.com.sg/sg/security-intelligence/current-threat-activity/global-botnet-map/index.html	Global botnet threat activity map by Trend Micro shows the monitored malicious network activity to identify command-and-control servers. It enables security analyst to identify which computers are being attack how many often over the past 2 weeks.
Data source: https://www.akamai.com/us/en/solutions/intelligent-platform/visualizing-akamai/real-time-web-monitor.jsp	Global Internet conditions map by Akamai. With this real-time data they identify the global regions target of the greatest web attack traffic, cities with the slowest web connections (latency) and geographic areas with the most web traffic (traffic density).

Datasets

Tools / Libraries

Tableau
Github
Microsoft Excel
Adobe Creative Suites
Sketch
Javascript: sigma.js, vis.js, three.js, D3.js
Python

Technical Challenges

Key Technical Challenges	Detailed Description	Proposed Solution
Unfamiliar with D3.js libraries	D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers.	Go for the d3 workshop Self learning Peer Learning
Data Cleaning and Transformation	The data set are in text format and many other different format. Integration are challenging as there are a lot of manual work to be done.	Delegate workload for cleaning datasets
Data Unavailability	Many actual security attacks and defense data are paid version.	Research databases using school links through the library portal
Determining the Most Optimal Interactive Elements	In order to enable users to understand the data sets, interactive elements needs to be suitable for this project	Develop storyboard Research on security visualization

Project Timeline & Task Assignments

References

Databases: http://vizsec.org/data/
Botnet Dataset: https://www.uvic.ca/engineering/ece/isot/datasets/
D3.js: https://d3js.org/
Detect virus: https://www.virustotal.com

Comments

Please leave comments here.

1718t1is428T12 Proposal 1

Contents

Introduction & Motivation

Objectives

Background Survey of Related Works

Datasets

Tools / Libraries

Technical Challenges

Project Timeline & Task Assignments

References

Comments

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools