Difference between revisions of "AY1516 T2 Group 18 Data"

From Analytics Practicum
Jump to navigation Jump to search
Line 33: Line 33:
 
==<div style="background:#ff4fa7; padding: 10px; font-size: 14px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #D3D3D3 solid 25px;"><font color="white">Data</font></div>==
 
==<div style="background:#ff4fa7; padding: 10px; font-size: 14px; font-weight: bold; line-height: 1em; text-indent: 15px; border-left: #D3D3D3 solid 25px;"><font color="white">Data</font></div>==
  
This study focuses on 2 main markets in the APAC region - Singapore and Malaysia. The data was collected and weighed afterwards in proportion of population representative for both countries.<br><br>
+
<b>Data Provided</b><br>
+
The Singapore and Malaysia data provided to us by TNS was collected via online panels and weighed afterwards in proportion of population representative for both countries. The data was presented to us in an SPSS format and is cleaned. This is because the online questionnaires in the online panels were programmed in a way where logic checks and routing were done during the data collection process. In addition, the data collected was also automatically coded in the background based on the specifications in the survey questionnaire, which was also given to us.<br><br>
The data was presented to us in an SPSS format and is cleaned. This is because the data was collected via online using panels, where online questionnaires are programmed in a way where logic checks and routing would have been done. Furthermore, the data collected was automatically coded in the background based on the specifications in the survey questionnaire. <br><br>
 
 
TNS has provided with a set of data, and the following variables have been identified to be useful to answer the objectives of our project:<br><br>
 
<b>Target Consumer Profile</b>
 
*Gender
 
*Age
 
*Education
 
*Employment status
 
*Household structure
 
*Household decision maker
 
*Household income
 
*Device ownership/usage
 
*Frequency of online activities
 
*Social/IM usage
 
*Favorite social/IM
 
*Digital engagement
 
*Social influence
 
*Products purchased P4W/P12M
 
*Products purchased online
 
*eCommerce barriers
 
*Shopper mission - short/long cycle
 
*Open or decided
 
*Pre-purchase action
 
<br>
 
<b>Digital Media Platforms</b>
 
*Device and media usage
 
*Typical day
 
*Time spent on device/media/activities
 
*Activity usage by daypart
 
*Engagement methods
 
*Category engagement
 
*Role of social
 
*eCommerce websites
 
<br>
 
<b>Devices</b>
 
*Device purchase intention
 
*Proportion of activities by device
 
  
 +
<b>Data Preparation</b><br>
 +
Even though the data presented to us has been preliminarily cleaned by the online panels, there is still a need to go through a second stage of cleaning to ensure that the data is relevant for the purpose of this project. There is also a need to filter out outliers and anomalies. As we are only focusing on one particular sector, we also need to filter out all the questions catered specifically for other irrelevant market sectors. This allows us to focus on the key objectives better and provide more meaningful visualization.<br><br>
 +
 +
In order to provide more in-depth analysis and insights, we will need to explore the data to understand various aspects of the data further. After which, we will also need to manipulate certain variables to create multi-dimensional views of the data. E.g. creation of a derivative variable.<br><br>
 +
 +
<i>Note:</i><br>
 +
We have identified variables from the dataset to be useful for the purpose of our analysis. However. due to NDA with the company, more information will be only be made available in the project proposal.<br>
 +
Sample dataset will be in the project proposal as well.
  
 
<div align="left"> <!-- END CHUNK-->
 
<div align="left"> <!-- END CHUNK-->

Revision as of 06:58, 12 January 2016

HOME

 

PROJECT OVERVIEW

 

DATA

 

PROJECT MANAGEMENT

 

DOCUMENTATION


Data

Data Provided
The Singapore and Malaysia data provided to us by TNS was collected via online panels and weighed afterwards in proportion of population representative for both countries. The data was presented to us in an SPSS format and is cleaned. This is because the online questionnaires in the online panels were programmed in a way where logic checks and routing were done during the data collection process. In addition, the data collected was also automatically coded in the background based on the specifications in the survey questionnaire, which was also given to us.

Data Preparation
Even though the data presented to us has been preliminarily cleaned by the online panels, there is still a need to go through a second stage of cleaning to ensure that the data is relevant for the purpose of this project. There is also a need to filter out outliers and anomalies. As we are only focusing on one particular sector, we also need to filter out all the questions catered specifically for other irrelevant market sectors. This allows us to focus on the key objectives better and provide more meaningful visualization.

In order to provide more in-depth analysis and insights, we will need to explore the data to understand various aspects of the data further. After which, we will also need to manipulate certain variables to create multi-dimensional views of the data. E.g. creation of a derivative variable.

Note:
We have identified variables from the dataset to be useful for the purpose of our analysis. However. due to NDA with the company, more information will be only be made available in the project proposal.
Sample dataset will be in the project proposal as well.