Difference between revisions of "IS428 AY2019-20T2 Assign TANNY LAI"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 13: Line 13:
 
* The staff
 
* The staff
  
== Data preparation ==  
+
== Dataset Analysis & Transformation Process ==  
 +
 
 +
'''Understanding the dataset'''
 
Before diving in to do an in-depth analysis and creating charts, it is essential to understand the dataset given and identify the best method for data preparation by understanding the respective format and attributes(columns). The dataset provided in the assignment consists of 2 datasheets:
 
Before diving in to do an in-depth analysis and creating charts, it is essential to understand the dataset given and identify the best method for data preparation by understanding the respective format and attributes(columns). The dataset provided in the assignment consists of 2 datasheets:
 
   
 
   
Line 21: Line 23:
  
 
This section will elaborate on the dataset analysis and transformation process for the dataset in order to prepare the data for import and analysis on interactive visualization.
 
This section will elaborate on the dataset analysis and transformation process for the dataset in order to prepare the data for import and analysis on interactive visualization.
 +
 +
Firstly, I identified that the dataset contains three types of data - demographic, behavioral and feedback/opinions.
 +
 +
Under demographic data types, the dataset has surveyee's data on:
 +
<ol>
 +
<li> StudyArea - Major area of study, research or teaching </li>
 +
<li> Position - Position in SMU (1 column)</li>
 +
<li> ID - International (non-exchange) student (1 column)</li>
 +
</ol>
 +
 +
Under behavioral data types, the dataset has surveyee's data on:
 +
<ol>
 +
<li>Campus - Library mostly used (1 column)</li>
 +
<li>HowOften- How frequently do you visit the library/campus/access library resources (3 columns)</li>
 +
</ol>
 +
 +
Under feedback data types, the dataset has surveyee's data on:
 +
<ol>
 +
<li>NPS - How likely surveyee is to recommend the library service to other students (1 column)</li>
 +
<li>Importance- A list of importance ratings for 26 library services (26 columns)</li>
 +
<li>Performance- A list of performance ratings for 26 library services (26 columns) and surveyee's satisfaction with the library (1 column)</li>
 +
<li> Comments - Suggestions for improvement or any other comments about the Library (1 column)</li>
 +
<li> NA - Are the services applicable to the surveyee? (1 column)</li>
 +
</ol>
 +
 +
'''Transformation'''
 +
1. After understanding the dataset, it is noticeable that the data attributes HowOften, Importance, Performance, and NA needs to be pivot as they have many columns and this would make data analysis for the charts tough at the later steps.
 +
 +
Solution: In order to solve the issue of too many columns for the same data attribute I put the dataset into TableauPrep
 +
  
 
== Dashboard ==
 
== Dashboard ==

Revision as of 19:10, 14 March 2020

Problem & Motivation

Problem
Every two years, SMU Libraries conduct a comprehensive survey in which faculty, students and staff have the opportunity to rate various aspects of SMU library's services. The survey provides SMU libraries with input to help enhance existing services and to anticipate emerging needs of SMU faculty, students and staff. However, the survey results presented by InSync lack of friendly and interactive visualization to aid the library staffs to easily understand the results and derive value-adding insights.

Motivation
In class, we have learned that we should always use a divergent stack bar chart to do analysis on Likert scales and not to find the average of the scale. Upon viewing the survey report done by Insync, I feel inspired to apply the concepts, methods, and techniques I have learned in my Visual Analytics IS428 class. Furthermore, it would be far more interesting and practical to be able to resolve real-world problems using visual analytics skills that I have learned.

The interactive visualization can reveal the level of services provided by SMU libraries as perceived by:

  • The undergraduate students,
  • The postgraduate students,
  • The faculty,
  • The staff

Dataset Analysis & Transformation Process

Understanding the dataset Before diving in to do an in-depth analysis and creating charts, it is essential to understand the dataset given and identify the best method for data preparation by understanding the respective format and attributes(columns). The dataset provided in the assignment consists of 2 datasheets:

1. The data in codes and number
Excel Legends

2. The legend to describe the data in layman terms.
Dataset

This section will elaborate on the dataset analysis and transformation process for the dataset in order to prepare the data for import and analysis on interactive visualization.

Firstly, I identified that the dataset contains three types of data - demographic, behavioral and feedback/opinions.

Under demographic data types, the dataset has surveyee's data on:

  1. StudyArea - Major area of study, research or teaching
  2. Position - Position in SMU (1 column)
  3. ID - International (non-exchange) student (1 column)

Under behavioral data types, the dataset has surveyee's data on:

  1. Campus - Library mostly used (1 column)
  2. HowOften- How frequently do you visit the library/campus/access library resources (3 columns)

Under feedback data types, the dataset has surveyee's data on:

  1. NPS - How likely surveyee is to recommend the library service to other students (1 column)
  2. Importance- A list of importance ratings for 26 library services (26 columns)
  3. Performance- A list of performance ratings for 26 library services (26 columns) and surveyee's satisfaction with the library (1 column)
  4. Comments - Suggestions for improvement or any other comments about the Library (1 column)
  5. NA - Are the services applicable to the surveyee? (1 column)

Transformation 1. After understanding the dataset, it is noticeable that the data attributes HowOften, Importance, Performance, and NA needs to be pivot as they have many columns and this would make data analysis for the charts tough at the later steps.

Solution: In order to solve the issue of too many columns for the same data attribute I put the dataset into TableauPrep


Dashboard

Analysis & Insights

Undergraduate Students

Postgraduate Students

Faculty

Staff