Difference between revisions of "Group01 proposal"

From ISSS608-Visual Analytics and Applications
Jump to navigation Jump to search
Line 26: Line 26:
  
 
'''Market Basket Analysis (MBA)''' is one of the key techniques used by large retailers to uncover associations between products brought by consumers. In our case, we apply a sequential version of MBA, called “sequential itemset mining” or “sequential pattern mining”, to analyze whether buying one item in the past indicates a higher likelihood of buying other things in the future. For instance, whether purchasing peanut butter implies sales of bread in the near future.<br>
 
'''Market Basket Analysis (MBA)''' is one of the key techniques used by large retailers to uncover associations between products brought by consumers. In our case, we apply a sequential version of MBA, called “sequential itemset mining” or “sequential pattern mining”, to analyze whether buying one item in the past indicates a higher likelihood of buying other things in the future. For instance, whether purchasing peanut butter implies sales of bread in the near future.<br>
 +
  
 
==Market Basket Analysis==
 
==Market Basket Analysis==
Line 31: Line 32:
 
===What is Market Basket Analysis?===
 
===What is Market Basket Analysis?===
 
'''Market basket analysis''' is a type of data mining technique to find association rules between different objects in a set, find frequent patterns in a transaction database, relational databases or any other information repository.<br>
 
'''Market basket analysis''' is a type of data mining technique to find association rules between different objects in a set, find frequent patterns in a transaction database, relational databases or any other information repository.<br>
 +
 +
This is often used in retail services to understand the products that are being purchased together. This would help the companies in designing their cross-sell or up-selling strategies. For example, assume product A has a relatively higher profit margin and it is often purchased together with product B. The company could bundle these two products to increase the sales of the products in order to increase its profit margin. <br>
 +
  
 
===What is Association Rules?===
 
===What is Association Rules?===
Line 46: Line 50:
 
This visualization technique would complement the association rules mined from market basket analysis. <br>
 
This visualization technique would complement the association rules mined from market basket analysis. <br>
  
== Use Case ==
+
== Case Study: Instacart Grocery Data ==
 
To illustrate how the network visualization technique can complement the association rules derived from market basket analysis, we will be using dataset from Instacart. <br>
 
To illustrate how the network visualization technique can complement the association rules derived from market basket analysis, we will be using dataset from Instacart. <br>
  
Line 52: Line 56:
  
 
=== About Data Set ===
 
=== About Data Set ===
There are overall 3421083 number of orders of 206209 customers over 49688 products in 21 major catagories.<br>
+
Below is the summary of the dataset used: <br>
 +
- There are more than 33,000,000 transactions from about 206,000 unique customers <br>
 +
- The transactions contains the products purchased by the customers under each order <br>
 +
- The sequences of the transactions are also available in the dataset <br>
  
  

Revision as of 00:39, 21 April 2020

Market basket.jpg
Grocery.jpg

Proposal

Poster

Application

Research Paper


Abstract

Market Basket Analysis (MBA) is one of the key techniques used by large retailers to uncover associations between products brought by consumers. In our case, we apply a sequential version of MBA, called “sequential itemset mining” or “sequential pattern mining”, to analyze whether buying one item in the past indicates a higher likelihood of buying other things in the future. For instance, whether purchasing peanut butter implies sales of bread in the near future.


Market Basket Analysis

What is Market Basket Analysis?

Market basket analysis is a type of data mining technique to find association rules between different objects in a set, find frequent patterns in a transaction database, relational databases or any other information repository.

This is often used in retail services to understand the products that are being purchased together. This would help the companies in designing their cross-sell or up-selling strategies. For example, assume product A has a relatively higher profit margin and it is often purchased together with product B. The company could bundle these two products to increase the sales of the products in order to increase its profit margin.


What is Association Rules?

Association Rule Mining can tell you what items do customers frequently buy together by generating a set of rules called Association Rules in form if this then that.

How do we apply Market Basket Analysis?

The applications of Association Rule Mining are found in Marketing, Basket Data Analysis (or Market Basket Analysis) in retailing, clustering and classification. In our project, we apply it particularly by Market Basket Analysis.

Principal indicators of Association Rules:

Key indicator.jpg


Network Visualization Technique

Network Visualization is a technique often used to show the relationships between the different items. As the name suggested, this technique shows the relationship in network type of format, which is easier for the users to understand how different items are related to one another.

This visualization technique would complement the association rules mined from market basket analysis.

Case Study: Instacart Grocery Data

To illustrate how the network visualization technique can complement the association rules derived from market basket analysis, we will be using dataset from Instacart.

This data is downloaded from the The Instacart Online Grocery Shopping Dataset 2017 on Feb 2020. The data dictionary can be found this link.

About Data Set

Below is the summary of the dataset used:
- There are more than 33,000,000 transactions from about 206,000 unique customers
- The transactions contains the products purchased by the customers under each order
- The sequences of the transactions are also available in the dataset


Application UI

Data loaded and preprocessing

Gentric market basket analysis by selecting threshold

Rule network on categories or items of interest

Network.jpg

Time series analysis of selected category or item (line chart or bar chart)

Show the most ranking rules over time & Display rule between two items or categories over time

Barchart.jpg


Objects

  1. Display the overview of relationship between category using Network diagram
  2. Show product bundles that expect to be consumed simultaneously and key indicators of each
  3. Interactively visualize how these bundles evolve over time

Packages Use

Package Description
arules Provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules).
arulesViz Extends package 'arules' with various visualization techniques for association rules and item-sets. The package also includes several interactive visualizations for rule exploration.
tidyverse The tidyverse is an opinionated collection of R packages designed for data science.
readr Read Excel Files in R.
dplyr Tools for Splitting, Applying and Combining Data.
ggplot2 Create graphics and charts.
Visnetwork Create graphics and charts.


Storyboard

In this project, we adopted the following principles:
- Design thinking through having periodic check-in with Prof Kam to ensure we are heading the right directions
- Fail early through iterating the prototypes with Prof Kam to land the interface design of the ShinyApp

Prototype

After several rounds of discussions with Prof Kam and several iterations, below is the final prototype for the design apps:

Prototype

Following are key highlights of designed ShinyApps as shown in the prototype:



Note that this final prototype has incorporated the feedbacks Prof Kam provided along the way.

Team Members

References