Difference between revisions of "ANLY482 AY2017-18T2 Group32 : Project Overview / Data"

From Analytics Practicum
Jump to navigation Jump to search
 
(5 intermediate revisions by the same user not shown)
Line 33: Line 33:
  
 
==<div style="background: #404040; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 0px; font-size: 16px"><font color=#ffffff >5.0 Data</font></div>==
 
==<div style="background: #404040; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 0px; font-size: 16px"><font color=#ffffff >5.0 Data</font></div>==
'''<big><font color="#fcb706">5.1 Data Sample</font></big><br>
+
'''<big><font color="#f6a228">5.1 Data Sample</font></big><br>
 
Pharma G has kindly provided us with sample data in the form of csv files. The data we have obtained consists of five datasets - HCP, Invoice, Brand Targets, Therapy Group and Customer 360 (Telesales) data that are gathered from January 2015 to February 2018.  
 
Pharma G has kindly provided us with sample data in the form of csv files. The data we have obtained consists of five datasets - HCP, Invoice, Brand Targets, Therapy Group and Customer 360 (Telesales) data that are gathered from January 2015 to February 2018.  
  
'''<big><font color="#fcb706">5.2 Metadata</font></big><br>
+
'''<big><font color="#f6a228">5.2 Metadata</font></big><br>
 
With a clear understanding of what our sponsor had in mind, our team began data munging process. This includes data cleaning, transformation, and integration to obtain an integrated data table via JMP for further analysis.  
 
With a clear understanding of what our sponsor had in mind, our team began data munging process. This includes data cleaning, transformation, and integration to obtain an integrated data table via JMP for further analysis.  
  
Line 48: Line 48:
 
|'''HCP'''
 
|'''HCP'''
 
|List of healthcare practitioners
 
|List of healthcare practitioners
|7553
+
|7405
 
|-
 
|-
 
|'''Invoice'''
 
|'''Invoice'''
Line 67: Line 67:
 
|-
 
|-
 
|}
 
|}
 
'''<big><font color="#fcb706">5.3 Initial Data Observation</font></big><br>
 
For Invoice Data, after filtering for the TCE brands, here is a distribution of all the TCE brands and its sales performance.
 
 
[[File:Data 1.png | 600px | center]]
 
[[File:Data 2.png | 600px | center]]
 
<br>
 

Latest revision as of 02:42, 17 April 2018

Pharma G Logo.png

Home

About Us

Project Overview

Project Findings

Project Management

Documentation

ANLY482 AY2017-18 Main Page

Description Data Methodology

5.0 Data

5.1 Data Sample
Pharma G has kindly provided us with sample data in the form of csv files. The data we have obtained consists of five datasets - HCP, Invoice, Brand Targets, Therapy Group and Customer 360 (Telesales) data that are gathered from January 2015 to February 2018.

5.2 Metadata
With a clear understanding of what our sponsor had in mind, our team began data munging process. This includes data cleaning, transformation, and integration to obtain an integrated data table via JMP for further analysis.

Please refer to Table 2 below for a more detailed description of the data provided.

File Name Description Count
HCP List of healthcare practitioners 7405
Invoice Transaction history of all customers across all products 337518
Brand Targets List of monthly sales target for all products 516
Therapy Group List of product categories for TCE brand products 44
Customer 360 (Telesales) How each customer was contacted by a TCE Account Representative via phone 30240