ANLY482 AY2017-18 T2 Group 17 Findings Finals

From Analytics Practicum
Revision as of 15:36, 14 April 2018 by Louis.tan.2014 (talk | contribs) (Created page with "__NOEDITSECTION__ __NOTOC__ <!--Header Start--> {|style="background-color:#6A8D9D; color: #F5F5F5; padding: 10 0 10 0;" width="100%" cellspacing="0" cellpadding="0" valign="to...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search


HOME

 

FINDINGS

 

PROJECT DOCUMENTATION

 

PROJECT MANAGEMENT

 

ABOUT US

 

ANLY482 HOMEPAGE

Exploratory Data Analysis Confirmatory Data Analysis

Wilcoxon test

In attempt to analyse user behaviour pattern, the time spent on each chapter is calculated using the proxy data. On the condition that ‘SessionID’ is similar for each row, the proxy for time spend on each chapter (t) is calculated using the following equation;

datetime(t-1) - datetime(t)

As time spent per chapter is a calculated field, prior information of the distribution is unknown. As such, a parametric test of means comparison between different strata will not be appropriate as certain assumption will have to be made on the distribution for instance if data follows a normal distribution. Therefore, a non-parametric test is performed on the data instead. Since the data is highly skewed towards the left-hand side, a Wilcoxon test is used to analyse if there is a significant difference in time spent between each strata of interest. In Wilcoxon test, comparison is done using the medium of each group. Using medium as a benchmark will help minimize the biasness resulting from the skewed population. In the analysis the groups of interests are as follows;
1. Analysis by distinct user utilization of books
2. Analysis by chapter view and chapter downloads
3. Analysis by different user groups


Analysis by distinct user utilization of books