Difference between revisions of "Analysis of User and Merchant Dropoff for Sugar App Data Source"

From Analytics Practicum
Jump to navigation Jump to search
Line 40: Line 40:
 
JMP Pro will be used to perform the survival analysis.
 
JMP Pro will be used to perform the survival analysis.
  
==<div style="background: #95A5A6; line-height: 0.3em; font-family:helvetica;  border-left: #6C7A89 solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Details of Data Source/strong></font></div></div>==
+
==<div style="background: #95A5A6; line-height: 0.3em; font-family:helvetica;  border-left: #6C7A89 solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Details of Data Source</strong></font></div></div>==
 
We have 3 years of data from Sugar’s database and dashboards, from 2013 to 2015. From our dataset, we have identified certain variables that we will using in our analysis.  
 
We have 3 years of data from Sugar’s database and dashboards, from 2013 to 2015. From our dataset, we have identified certain variables that we will using in our analysis.  
  
Line 57: Line 57:
 
#*Creation Date
 
#*Creation Date
 
#*Updated At Data
 
#*Updated At Data
 
 
#Merchant Brand Data:  
 
#Merchant Brand Data:  
ID
+
#*ID
Manager ID
+
#*Manager ID
Name
+
#*Name
Description
+
#*Description
Creation Date
+
#*Creation Date
Updated At Date
+
#*Updated At Date
 
+
#Merchant Branch Data:
Merchant Branch Data:
+
#*ID
ID
+
#*Brand ID
Brand ID
+
#*Latitude
Latitude
+
#*Longitude
Longitude
+
#*Country
Country
+
#*Name
Name
+
#*Address
Address
+
#*Description
Description
+
#*Redemption Type
Redemption Type
+
#*Redemption Time
Redemption Time
+
#*Rating
Rating
+
#*Enabled
Enabled
+
#*Creation Date
Creation Date
+
#*Updated At Date
Updated At Date
+
#Item Data
 
+
#*ID
Item Data
+
#*Brand ID
ID
+
#*Name
Brand ID
+
#*Description
Name
+
#*Creation Date
Description
+
#*Updated At Date
Creation Date
+
#*Market Price
Updated At Date
+
#Campaign Data (Each campaign is an offer on the app)
Market Price
+
#*ID
 
+
#*Brand ID
Campaign Data (Each campaign is an offer on the app)
+
#*Item ID
ID
+
#*Branch ID
Brand ID
+
#*Is all branch
Item ID
+
#*Category Name
Branch ID
+
#*Start Time
Is all branch
+
#*End Time
Category Name
+
#*Redemption Start Time
Start Time
+
#*Redemption End Time
End Time
+
#*Redemption After Buy
Redemption Start Time
+
#*Start Price
Redemption End Time
+
#*Floor Price
Redemption After Buy
+
#*Unlock Price
Start Price
+
#*Current Price
Floor Price
+
#*Bargain Range
Unlock Price
+
#*Stock
Current Price
+
#*Left
Bargain Range
+
#*Click Count
Stock
+
#*Weight
Left
+
#*Needs Booking
Click Count
+
#*Allows Take Out
Weight
+
#*Tips(description)
Needs Booking
+
#*Enabled
Allows Take Out
+
#*Creation Date
Tips(description)
+
#*Updated At Date
Enabled
+
#Campaign Skim Data
Creation Date
+
#*ID
Updated At Date
+
#*Campaign ID
 
+
#*Branch ID
Order Data (when a user buys something from a campaign)
+
#*User ID
ID
+
#*Amount
Creator ID
+
#*Hour
Owner ID
+
#*Coordinates
Campaign ID
+
#Campaign Click Data
Branch ID
+
#*ID
Price
+
#*Campaign ID
Status
+
#*Branch ID
Trade Number
+
#*User ID
Redemption Date
+
#*Amount
Expiration Date
+
#*Hour
Is Commented
+
#*Coordinates
Is Deleted
+
#Campaign Buy Data
Is Rewarded
+
#*ID
Longitude
+
#*Campaign ID
Latitude
+
#*Branch ID
Creation Date
+
#*User ID
Updated At Date
+
#*Amount
Refunded At Date
+
#*Hour
 +
#*Coordinates
 +
#Campaign View Data
 +
#*ID
 +
#*Campaign ID
 +
#*Branch ID
 +
#*User ID
 +
#*Amount
 +
#*Hour
 +
#*Coordinates
 +
#Campaign Spread Data
 +
#*ID
 +
#*Campaign ID
 +
#*Branch ID
 +
#*User ID
 +
#*Amount
 +
#*Hour
 +
#*Coordinates
 +
#Order Data (when a user buys something from a campaign)
 +
#*ID
 +
#*Creator ID
 +
#*Owner ID
 +
#*Campaign ID
 +
#*Branch ID
 +
#*Price
 +
#*Status
 +
#*Trade Number
 +
#*Redemption Date
 +
#*Expiration Date
 +
#*Is Commented
 +
#*Is Deleted
 +
#*Is Rewarded
 +
#*Longitude
 +
#*Latitude
 +
#*Creation Date
 +
#*Updated At Date
 +
#*Refunded At Date
  
 
==<div style="background: #95A5A6; line-height: 0.3em; font-family:helvetica;  border-left: #6C7A89 solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Exploratory Data Analysis Data Preparation</strong></font></div></div>==
 
==<div style="background: #95A5A6; line-height: 0.3em; font-family:helvetica;  border-left: #6C7A89 solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Exploratory Data Analysis Data Preparation</strong></font></div></div>==

Revision as of 19:40, 10 January 2016

Home

 

Project Overview

 

Findings

 

Project Documentation

 

Project Management

Background Data Source Methodology

Data Source

We will be using Sequel Pro to connect to Sugar’s SQL database in order to extract any necessary data. We will also be logging on to their Flurry or Localytics dashboard to extract any further information that we may need.

JMP Pro will be used to perform the survival analysis.

Details of Data Source

We have 3 years of data from Sugar’s database and dashboards, from 2013 to 2015. From our dataset, we have identified certain variables that we will using in our analysis.

However, before embarking on our analysis, we have to do extensive data cleaning. For example, we have 45,000 users but a number of them are throwaway test accounts, or some are dead on arrival with no orders to their name.

The users and merchants also belong to different regions, namely Singapore, Jakarta, Hong Kong and Beijing. Thus, we will have to segment them as their behaviour may differ from region to region.

  1. User Data
    • ID
    • Latitude
    • Longitude
    • Timezone
    • Country
    • Username
    • Email
    • Creation Date
    • Updated At Data
  2. Merchant Brand Data:
    • ID
    • Manager ID
    • Name
    • Description
    • Creation Date
    • Updated At Date
  3. Merchant Branch Data:
    • ID
    • Brand ID
    • Latitude
    • Longitude
    • Country
    • Name
    • Address
    • Description
    • Redemption Type
    • Redemption Time
    • Rating
    • Enabled
    • Creation Date
    • Updated At Date
  4. Item Data
    • ID
    • Brand ID
    • Name
    • Description
    • Creation Date
    • Updated At Date
    • Market Price
  5. Campaign Data (Each campaign is an offer on the app)
    • ID
    • Brand ID
    • Item ID
    • Branch ID
    • Is all branch
    • Category Name
    • Start Time
    • End Time
    • Redemption Start Time
    • Redemption End Time
    • Redemption After Buy
    • Start Price
    • Floor Price
    • Unlock Price
    • Current Price
    • Bargain Range
    • Stock
    • Left
    • Click Count
    • Weight
    • Needs Booking
    • Allows Take Out
    • Tips(description)
    • Enabled
    • Creation Date
    • Updated At Date
  6. Campaign Skim Data
    • ID
    • Campaign ID
    • Branch ID
    • User ID
    • Amount
    • Hour
    • Coordinates
  7. Campaign Click Data
    • ID
    • Campaign ID
    • Branch ID
    • User ID
    • Amount
    • Hour
    • Coordinates
  8. Campaign Buy Data
    • ID
    • Campaign ID
    • Branch ID
    • User ID
    • Amount
    • Hour
    • Coordinates
  9. Campaign View Data
    • ID
    • Campaign ID
    • Branch ID
    • User ID
    • Amount
    • Hour
    • Coordinates
  10. Campaign Spread Data
    • ID
    • Campaign ID
    • Branch ID
    • User ID
    • Amount
    • Hour
    • Coordinates
  11. Order Data (when a user buys something from a campaign)
    • ID
    • Creator ID
    • Owner ID
    • Campaign ID
    • Branch ID
    • Price
    • Status
    • Trade Number
    • Redemption Date
    • Expiration Date
    • Is Commented
    • Is Deleted
    • Is Rewarded
    • Longitude
    • Latitude
    • Creation Date
    • Updated At Date
    • Refunded At Date

Exploratory Data Analysis Data Preparation