ANLY482 AY2016-17 T2 Group10 Project Overview: Data
Data Summary
With the understanding of our sponsor’s motivations, our team sets in motion the data wrangling process – one that encompasses data cleaning, transformation and integration to obtain a consolidated JMP data table used for further analysis. For this research study, we have obtained a year’s worth of data from 2016. It consists of information on invoices, call details, employees and customers – each of which described in the summary table below
File Name | Description | No. of Rows |
Call Details | Information on actual interaction between Sales Reps, Sale Targets for a Product Brand | 42915 |
Invoice Details | Transactions of product purchases by Sales Targets | 110372 |
Employee | Information on employees and their teams - “Therapy Area” | 237 |
HCP | Information on individual doctors | 5871 |
HCO | Information on clinics, organizations | 4425 |
Tools Used
SAS JMP Pro 13 is chosen as our primary tool for data preparation, exploratory and further analysis. It is an analytical software that can perform most statistical analysis on large datasets and generate results with interactive visualizations used by data scientists to manipulate data selection on the go. Furthermore, tutorials and guides are widely available online for us to learn JMP Pro 13’s different techniques and functions.
More importantly, its easy-to-use built-in tools enable us to conduct analysis of variance to determine relationship between interactions and sales revenue.
Data Dictionary
The data dictionary is available here