Difference between revisions of "ANLY482 AY2017-18 T2 Group 05 Project Overview"

From Analytics Practicum
Jump to navigation Jump to search
 
(12 intermediate revisions by the same user not shown)
Line 33: Line 33:
 
{| style="background-color:#ffffff; margin: 3px auto 0 auto" width="55%"
 
{| style="background-color:#ffffff; margin: 3px auto 0 auto" width="55%"
 
|-  
 
|-  
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #2e2e2e" width="150px"| [[Red Dot Payment_-_Project Overview| <span style="color:#3d3d3d">Background</span>]]
+
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #2e2e2e" width="150px"| [[R2Z_-_Project Overview| <span style="color:#3d3d3d">Background</span>]]
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
  
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="150px"| [[Red Dot Payment_Data Source| <span style="color:#3d3d3d">Data Source</span>]]
+
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="150px"| [[R2Z_-_Data Source| <span style="color:#3d3d3d">Data Source</span>]]
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
  
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="150px"| [[Red Dot Payment_Methodology| <span style="color:#3d3d3d">Methodology</span>]]
+
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="150px"| [[R2Z_-_Methodology| <span style="color:#3d3d3d">Methodology</span>]]
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
 
! style="font-size:15px; text-align: center; border-top:solid #ffffff; border-bottom:solid #ffffff" width="20px"|
 
|}
 
|}
 
<!------- Details ---->
 
<!------- Details ---->
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Project Introduction and Background</strong></font></div></div>==
+
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Introduction</strong></font></div></div>==
Founded by a group of visionary financial technology payment experts in 2011, Red Dot Payment (RDP) has grown into a trusted online payment company providing premium payment solutions and expertise to the brightest merchants across Asia Pacific. Bringing best-in-class practices, RDP is the trusted FinTech partner of banks, merchants, payment schemes, payment gateways, non-banking financial institutions, security and fraud management system providers.  
+
For any entity to conduct business online, they must be able to accept payments from customers, and this involves a third-party to facilitate the online transfer of funds. Such entities are called electronic Payment Gateways , and this includes PayPal, Stripe, and our project sponsor.  
<br><br>
 
In 2016, RDP grew to handle millions of transactions across the globe, managed by a dedicated 40-person strong operation - with headquarters in Singapore, and offices in Thailand and Indonesia. Its products and services include - RDP connect, InstanCollect, InstanPay, InstanPromo and InstanToken.
 
  
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Business Problem and Motivation</strong></font></div></div>==
+
When a customer makes a purchase on a merchant’s website, our sponsor helps to process the credit card payment. This is done by transferring key information between the payment portal (e.g. merchant’s website) and the merchant’s registered bank account. For each successful transaction that our sponsor processes, they apply a commission, called a Merchant Discount Rate (MDR) , from the transaction.  
While RDP handles millions of transactions across the globe, the company has not yet been able to derive any conclusive analysis from its transaction data. There is untapped potential that is unexplored here, as they operate online and thus have the advantage of easily collecting large amounts of valuable data about their merchants as well as their customers. In addition, there are many data points captured previously with little insights derived, resulting in business decisions being driven mainly by intuition. Thus, by providing our team with their existing data, RDP hopes to gather a deeper understanding of their data.  Moving forward, RDP hopes to make more informed decisions utilising the patterns we identify from the data.
 
<br><br>
 
After meeting with our client to provide updates and generate insights for them, these are the following areas that our client is interested in:
 
<br><br>
 
'''Industry Strategy:''' RDP is interested to find if our data analysis can provide insights as to which markets or industries they should target in retaining or recruiting new merchants
 
<br>
 
'''Visual Dashboard:''' The creation of a visual dashboard to showcase our data analysis will be well anticipated by RDP
 
  
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Meetings with Project Coordinator, Project Client and Team</strong></font></div></div>==
+
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Business Motivations and Objectives</strong></font></div></div>==
Our team aims to meet our Project Coordinator, Professor Kam Tin Seong on average of at least once every week in order to ensure that we are progressing on the right track. We also met our client/project sponsor once a month to update them on our progress and validate our analysis, while our internal team meetings are held at least twice a week. All minutes can be found under "Project Documentation".
+
While our sponsor handles millions of transactions across the globe, the company has not fully been able to derive any conclusive analysis from its transaction data.
 +
 
 +
By providing our team with their data, our sponsor hopes to gather a deeper understanding of it. The project objectives include developing meaningful insights by performing exploratory data analysis. Using the results drawn from the findings, recommendations will be formulated to aid in future business decision making. 
 +
 
 +
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Meetings with Project Coordinator, Project Sponsor and Team</strong></font></div></div>==
 +
Our team aims to meet our Project Coordinator, Professor Kam Tin Seong on average of at least once every week in order to ensure that we are progressing on the right track. We also met our project sponsor once a month to update them on our progress and validate our analysis, while our internal team meetings are held at least twice a week. All minutes can be found under "Project Documentation".
  
 
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Project Objectives</strong></font></div></div>==
 
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Project Objectives</strong></font></div></div>==
RDP has collected large amounts of data on transactions with their merchants and respective customers, but they are unaware of how to utilise this data to supplement their business operations. Our project aims to help RDP make sense of their data, to make profound observations from the datasets and to develop recommendations that can help guide their future business decisions.
+
We aim to study the complexity of the online transactions carried out by our sponsor’s merchants. In particular, we hope to assess the performance of our sponsor’s merchants, by using number of approved transactions per merchant as a benchmark.
<br><br>
+
 
Previously, we have identified the following objectives in our proposal:
+
This project shows how we use funnel plots and line of best-fit graphs to compare the performance among entities in a group in an unbiased manner. Though both funnel plots and line of best-fit graphs are prevalent in medical research , they are under-utilized in e-commerce analytics. In our case study, the line of best-fit graphs provide a better model fit. By combining features of funnel plot with line of best-fit, we can identify star and laggard merchants.  
<br><br>
+
 
1. To observe and develop meaningful insights from RDP’s datasets by performing exploratory data analysis.
+
Moving on, we want to study the characteristics of transactions carried out by key merchants. We use logistic fit to establish any correlation between independent variables and the number of approved transactions. For independent variables that exhibit correlation, we conduct further analysis by utilizing decision trees to map expected approved-to-rejected transaction ratio for key merchants.
<br>
+
 
2.  To develop a visual dashboard. This will aid RDP in understanding some of their current business issues, as well as benchmark metrics and attributes that the company may not be currently analysing.
+
In summary, we show how exploratory and confirmatory techniques can be used as source of business intelligence - setting performance benchmarks for each merchant and improving their  approved-to-rejected transaction ratio.
<br>
 
3.  To formulate recommendations to improve their future business activities based on our findings.  
 
<br><br>
 
Moving forward, on top of the above project objectives, we have further added on new objectives we aim to achieve in our study in terms of exploratory data analysis:
 
<br><br>
 
<big><b>Main Objective:</b> '''Identification of Star and Laggard merchants'''</big> <br><br>
 
4. Compare the performance of merchants in terms of approved transactions, and identify top merchants contributing to the highest number and value of approved transactions. These merchants are valuable to RDP, since it earns commission based on the monetary value of the approved transactions carried out by its merchants and its commission rate is determined by the volume of approved transactions. Thus, we hope that our analysis will allows RDP to better identify these star merchants and invest its efforts to retain and attract them.
 
<br><br>
 
5. Identify merchants who are underperforming in terms of number and value approved transactions, and merchants who contribute to the highest number of rejected transaction. Thereafter, we hope to set performance benchmark for these laggard merchants should RDP intend to retain them.
 
<br><br>
 
<big><b>Sub-Objectives:</b></big>
 
<br>
 
6. In order to visualise and compare the performance among merchants on a fairer ground, we seek to group merchants into bins based on the number of transactions per merchant.
 
<br><br>
 
7. Identify any relationship between the approved/rejected transactions, and (i) the month of the year, (ii) day of the week and (iii) time of the day.  
 
  
 
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Scope of Project</strong></font></div></div>==
 
==<div style="background: #DD597D; line-height: 0.3em; font-family:calibri;  border-left: #CFCFCF solid 15px;"><div style="border-left: #FFFFFF solid 5px; padding:15px;font-size:15px;"><font color= "#F2F1EF"><strong>Scope of Project</strong></font></div></div>==
Line 95: Line 76:
 
| Project Proposal || Prepare project proposal and Wikipage
 
| Project Proposal || Prepare project proposal and Wikipage
 
|-
 
|-
| Data Exploration and Preparation || Ensuring that data is clean and can be analysed using analytical software; We also have to transform data (e.g. remove outliers and recode variables for analytical software to read); and conduct exploratory data analysis
+
| Data Exploration and Preparation || Ensuring that data is clean and can be analysed using analytical software; We have done data preparation, which include – Interactive Binning and finding the top reason code descriptions. After which, we conducted exploratory data analysis.
 
|-
 
|-
| Model Building || Through research findings and experience, we will attach and test suitable models to our data. We have used Interactive Binning and Time Series data analysis to generate insights
+
| Model Building || Through research findings and experience, we will attach suitable models to our data. We have used Interactive Binning, Line of Fit and Time Series Analysis to generate insights.
 
|-
 
|-
| Project Revision (Mid-Term) || Assisted by RDP through obtaining feedback during our sponsor meeting
+
| Project Revision (Mid-Term) || Assisted by RDP through obtaining feedback during our sponsor meeting.
 
|-
 
|-
| Mid-term Preparation || Prepare mid-term report, presentation and Wikipage
+
| Mid-term Preparation || Prepare mid-term report, presentation and Wikipage.
 
|-
 
|-
| Model Validation and Refinement || Perform Time Series Clustering; Conduct independent sample t-tests (e.g.  Ensure the results are similar when attached to different years of study) and refine analysis of data
+
| Model Validation and Refinement || Conduct independent sample t-tests (e.g.  Ensure the results are similar when attached to different years of study) and refine analysis of data.
 
|-
 
|-
| Insights and Recommendations || Create visualisation from analysis results and formulate recommendations for our client
+
| Insights and Recommendations || Create visualisation from analysis results and formulate recommendations for our sponsor.
 
|-
 
|-
| Project Revision || Assisted by RDP through obtaining feedback during our client meeting; Align our final deliverables with client requirements
+
| Project Revision || Assisted by sponsor through obtaining feedback during our sponsor meeting; Align our final deliverables with sponsor requirements.
 
|-
 
|-
| Final Preparation || Prepare abstract and full paper, final Wikipage update and final presentation with necessary deliverables
+
| Final Preparation || Prepare abstract and full paper, final Wikipage update and final presentation with necessary deliverables.
 
|}
 
|}

Latest revision as of 15:44, 15 April 2018

HOME

 

PROJECT OVERVIEW

 

PROJECT FINDINGS

 

PROJECT DOCUMENTATION

 

PROJECT MANAGEMENT

 

ANLY482 HOMEPAGE

Background Data Source Methodology

Introduction

For any entity to conduct business online, they must be able to accept payments from customers, and this involves a third-party to facilitate the online transfer of funds. Such entities are called electronic Payment Gateways , and this includes PayPal, Stripe, and our project sponsor.

When a customer makes a purchase on a merchant’s website, our sponsor helps to process the credit card payment. This is done by transferring key information between the payment portal (e.g. merchant’s website) and the merchant’s registered bank account. For each successful transaction that our sponsor processes, they apply a commission, called a Merchant Discount Rate (MDR) , from the transaction.

Business Motivations and Objectives

While our sponsor handles millions of transactions across the globe, the company has not fully been able to derive any conclusive analysis from its transaction data.

By providing our team with their data, our sponsor hopes to gather a deeper understanding of it. The project objectives include developing meaningful insights by performing exploratory data analysis. Using the results drawn from the findings, recommendations will be formulated to aid in future business decision making.

Meetings with Project Coordinator, Project Sponsor and Team

Our team aims to meet our Project Coordinator, Professor Kam Tin Seong on average of at least once every week in order to ensure that we are progressing on the right track. We also met our project sponsor once a month to update them on our progress and validate our analysis, while our internal team meetings are held at least twice a week. All minutes can be found under "Project Documentation".

Project Objectives

We aim to study the complexity of the online transactions carried out by our sponsor’s merchants. In particular, we hope to assess the performance of our sponsor’s merchants, by using number of approved transactions per merchant as a benchmark.

This project shows how we use funnel plots and line of best-fit graphs to compare the performance among entities in a group in an unbiased manner. Though both funnel plots and line of best-fit graphs are prevalent in medical research , they are under-utilized in e-commerce analytics. In our case study, the line of best-fit graphs provide a better model fit. By combining features of funnel plot with line of best-fit, we can identify star and laggard merchants.

Moving on, we want to study the characteristics of transactions carried out by key merchants. We use logistic fit to establish any correlation between independent variables and the number of approved transactions. For independent variables that exhibit correlation, we conduct further analysis by utilizing decision trees to map expected approved-to-rejected transaction ratio for key merchants.

In summary, we show how exploratory and confirmatory techniques can be used as source of business intelligence - setting performance benchmarks for each merchant and improving their approved-to-rejected transaction ratio.

Scope of Project

Task Description
Gather Requirements Confirm and gather sponsor requirements
Initial Research and Preparation Conduct preliminary data exploration and define project objectives and scope
Project Proposal Prepare project proposal and Wikipage
Data Exploration and Preparation Ensuring that data is clean and can be analysed using analytical software; We have done data preparation, which include – Interactive Binning and finding the top reason code descriptions. After which, we conducted exploratory data analysis.
Model Building Through research findings and experience, we will attach suitable models to our data. We have used Interactive Binning, Line of Fit and Time Series Analysis to generate insights.
Project Revision (Mid-Term) Assisted by RDP through obtaining feedback during our sponsor meeting.
Mid-term Preparation Prepare mid-term report, presentation and Wikipage.
Model Validation and Refinement Conduct independent sample t-tests (e.g. Ensure the results are similar when attached to different years of study) and refine analysis of data.
Insights and Recommendations Create visualisation from analysis results and formulate recommendations for our sponsor.
Project Revision Assisted by sponsor through obtaining feedback during our sponsor meeting; Align our final deliverables with sponsor requirements.
Final Preparation Prepare abstract and full paper, final Wikipage update and final presentation with necessary deliverables.