Difference between revisions of "ANLY482 AY2016-17 T2 Group4: Project Overview"

From Analytics Practicum
Jump to navigation Jump to search
Line 32: Line 32:
  
 
==Motivation & Business Problem==
 
==Motivation & Business Problem==
Lazada is still small compared to major competitors (e.g. Aliexpress / Amazon / Tao Bao) and it still needs to continue expanding and strengthening its user base before it can start to focus on realizing its profit goals. Despite its small size, it has significant market penetrations across many different countries in South-East Asia. That said, as a relative newcomer in the industry, it needs to prepare to for the serious market competition that Amazon might bring in the future. One way to provide a barrier to entry towards Amazon is to understand its buyers and sellers better and offer more effective strategies to enhance their experiences.
+
INTENT
 +
Improve the transparency of information useful in identifying a seller’s performance to customers and sellers.
 +
PROBLEM
 +
- Customers aren’t able to identify which are the best sellers to purchase products from.
 +
- The characteristics of sellers that matter to a customer aren’t clearly defined to sellers who want to manage and improve their performance.
 +
 
 
<br>
 
<br>
=== Current Issues ===
 
  
====  ''Difficulty navigating product searches catalog'' ====  
+
== Project Objectives ==  
Lazada currently has millions of products listed where many versions of the same product are being sold. Buyers may find their online experience hindered by many conflicting factors in making a decision, such as lower prices, shorter time to delivery, and higher transaction reputation. To help buyers gain quick access to the best products in the catalogue/search results, intelligent ranking orders of all available product items needs to be generated to help buyers make preferred decisions and sellers get better sales.
+
We will identify critical features that can allow sellers to measure and manage their performance on Lazada’s platform.
+
The aforementioned features will be exposed to the customers to help them identify the better/best sellers to purchase from.
====  ''Difficulty measuring features which lead to higher conversion rates and better customer experience''  ====
 
One of the key drivers of a customer’s experience throughout the purchase journey (from online browsing to receiving a product) is the product’s quality. However, product quality is hard to determine due to no fixed way of measuring it. A methodology is required to assess what attributes contribute most to customers’ impression of product quality. On top of that, there are other useful features that have yet to be explored, including seller rating.  
 
  
 
+
== Constraints ==
== Project Objectives ==  
+
Production ready: Run data pipeline within 3 hours with 16gb RAM
As such, the objective of our team is to model possible predictors for conversion rate and customer experience through measures of product quality and seller rating features.This will be used to derive recommendations in Lazada's on-site ranking system to enhance customer experience through quick access to the best products in customer’s catalogue.
 
<br>
 
  
 
== Project Details ==
 
== Project Details ==
Line 52: Line 52:
 
<br>
 
<br>
  
===Product & Seller Features (Predictive Variables)===
+
===Predictive Variables(Seller Attributes)===
Product Quality (online)
+
Shipping Time
- What factors have an effect on conversion and customer experience?
+
Pricing
- How should we adjust our website to improve online engagement, and NPS?
+
Return Rate
<br/>
+
Seller Initiated Cancellation Rate
Product Quality (delivery)
+
Seller Category ( e.g. home & living , fashion, multi-category sellers)  
- What factors have an effect on conversion and customer experience?
+
Size of Seller
- What can we do to improve customer NPS after they have received their product?
+
Seller’s Years of experience on Lazada
<br/>
 
Seller Rating
 
- What factors have an effect on conversion and customer experience?
 
- Which factors should we prioritize when building a seller dashboard?
 
 
<br/>
 
<br/>
  
===Conversion Rate & Customer Experience Metrics===
+
===Response Variables (Seller Performance Metrics)===
====Conversion Rate====
+
Total purchases made per sales item
Conversion Rate is the percentage of visits which results in e-commerce transactions (sales). It helps to calculate how many visitors (shoppers) are actually turning into buyers (customers). High or higher conversion rate is the ultimate target for the business.
+
Product Popularity Ratio (PPR) = Total Purchase / Distinct Count of products
<br/>
+
 
 
====Customer Experience====
 
Customer Experience is the entire interactions that a customer has with the brand, organization, or e-commerce platform in this case, and an organization should aim to deliver most enjoyable and usable experience to existing and potential customers.
 
<br/><br/>
 
Formulas:
 
<br/>
 
'''Entire Website:''' E-commerce Conversion Rate = (Total E-commerce transactions / total visits on a website) * 100
 
<br/>
 
'''By Product:'''  E-commerce Conversion Rate for a product = (Product Transaction / Product page visits) * 100
 
<br/><br/>
 
For this project, we can get Conversion Rate from data collected, or get calculated Conversion Rate from Google Analytics platform since Lazada Group holds GA account.
 
<br/><br/>
 
Another important aspect of Conversion Rate is the factors that have high correlation with high Conversion Rate (eg) Shorter payment process, coupon. This will be examined by correlation and/or regression analysis from softwares like SPSS.
 
<br/>
 
 
 
 
 
Formulas:
 
<br/>
 
'''Net Promoter Score (NPS): % of Promoters - % of Detractors'''
 
<br/>
 
Promoters (score 9-10): loyal customers who keep buying and referring to others
 
<br/>
 
Detractors (score 0-6): dissatisfied customers who might spread negative comments and word-of-mouth
 
 
 
 
===Data Source===
 
===Data Source===
  
 
<br>
 
<br>
===Nature of Dataset===
+
Sensitive Data (Not to be revealed)
 
 
To be obtained on approval (11th January).
 
 
 
  
  

Revision as of 23:38, 18 February 2017


TeamInsured Home.png   HOME

 

TeamInsured About Icon.png   PROJECT OVERVIEW

 

TeamInsured Findings.png   PROJECT FINDINGS

 

TeamInsured PM.png   PROJECT MANAGEMENT

 

TeamInsured Documentation.png   DOCUMENTATION


Motivation & Business Problem

INTENT Improve the transparency of information useful in identifying a seller’s performance to customers and sellers. PROBLEM - Customers aren’t able to identify which are the best sellers to purchase products from. - The characteristics of sellers that matter to a customer aren’t clearly defined to sellers who want to manage and improve their performance.


Project Objectives

We will identify critical features that can allow sellers to measure and manage their performance on Lazada’s platform. The aforementioned features will be exposed to the customers to help them identify the better/best sellers to purchase from.

Constraints

Production ready: Run data pipeline within 3 hours with 16gb RAM

Project Details

System Architecture

LazadaSA.png


Predictive Variables(Seller Attributes)

Shipping Time Pricing Return Rate Seller Initiated Cancellation Rate Seller Category ( e.g. home & living , fashion, multi-category sellers) Size of Seller Seller’s Years of experience on Lazada

Response Variables (Seller Performance Metrics)

Total purchases made per sales item Product Popularity Ratio (PPR) = Total Purchase / Distinct Count of products

Data Source


Sensitive Data (Not to be revealed)


Methodology

LazadaMethodology.png


Data Collection

This will be done to form the pipeline of data extraction from Lazada database and Google Analytics. The challenge is to properly pull out quality data from the relevant and updated sources.

Data Exploration and Cleaning

Manage exploratory analysis of these data. These analysis will be used to improve on business questions which also affect the exploratory analysis. This process will be done repeatedly with necessary data cleaning and munging until we find business questions which accurately express business needs given the data and exploratory analysis made.

Data Modelling

After a proper exploratory phase of the analysis, we will train and test machine learning models to to answer predictive and prescriptive business questions. This will include processes such as clustering to segment user behaviours, regression to include impacts of various seller attributes to CX Metrics, etc. Various statistical learning models such as Random Forest and Regularization might also be used to reduce risk of overfitting and increase testing accuracy of models.

Data Visualization

These data analysis will be documented visually Jupyter Notebook or interactive dashboard tools which are later demonstrated and presented to business users such as Lazada suppliers and internal teams. Insights presentation techniques such as Storyboarding and Pyramid technique (Barbara Pinto) might also be used to ensure proper presentation to match findings and business needs.