Difference between revisions of "IS428 AY2019-20T2 Assign TANNY LAI"

Revision as of 15:36, 15 March 2020

Problem & Motivation

Problem
Every two years, SMU Libraries conduct a comprehensive survey in which faculty, students and staff have the opportunity to rate various aspects of SMU library's services. The survey provides SMU libraries with input to help enhance existing services and to anticipate emerging needs of SMU faculty, students and staff. However, the survey results presented by InSync lack of friendly and interactive visualization to aid the library staffs to easily understand the results and derive value-adding insights.

Motivation
In class, we have learned that we should always use a divergent stack bar chart to do analysis on Likert scales and not to find the average of the scale. Upon viewing the survey report done by Insync, I feel inspired to apply the concepts, methods, and techniques I have learned in my Visual Analytics IS428 class. Furthermore, it would be far more interesting and practical to be able to resolve real-world problems using visual analytics skills that I have learned.

The interactive visualization can reveal the level of services provided by SMU libraries as perceived by:

The undergraduate students,
The postgraduate students,
The faculty,
The staff

Dataset Analysis & Transformation Process

Understanding the dataset

Before diving in to do an in-depth analysis and creating charts, it is essential to understand the dataset given and identify the best method for data preparation by understanding the respective format and attributes(columns). The dataset provided in the assignment consists of 2 datasheets:

1. The data in codes and number

2. The legend to describe the data in layman terms.

This section will elaborate on the dataset analysis and transformation process for the dataset in order to prepare the data for import and analysis on interactive visualization.

Firstly, I identified that the dataset contains three types of data - demographic, behavioral and feedback/opinions.

Under demographic data types, the dataset has surveyee's data on:

StudyArea - Major area of study, research or teaching
Position - Position in SMU (1 column)
ID - International (non-exchange) student (1 column)

Under behavioral data types, the dataset has surveyee's data on:

Campus - Library mostly used (1 column)
HowOften- How frequently do you visit the library/campus/access library resources (3 columns)

Under feedback data types, the dataset has surveyee's data on:

NPS - How likely surveyee is to recommend the library service to other students (1 column)
Importance- A list of importance ratings for 26 library services (26 columns)
Performance- A list of performance ratings for 26 library services (26 columns) and surveyee's satisfaction with the library (1 column)
Comments - Suggestions for improvement or any other comments about the Library (1 column)
NA - Are the services applicable to the surveyee? (1 column)

Transformation

1. After understanding the dataset, it is noticeable that the data attributes HowOften, Importance, Performance, and NA needs to be pivot as they have many columns and this would make data analysis for the charts tough at the later steps.

Solution: In order to solve the issue of too many columns for the same data attribute I put the dataset with the codes into TableauPrep and pivot it three times. Firstly, I pivot the HowOften attribute. Continuously, I pivot Importance and Performance together and lastly, I pivot NA. Finally, I output it in a TDE file format.

2. Despite preparing the data by using pivot, the data was still not ready for analysis as they were in code format which was hard for a user who is not familiar to understand.

Solution: Since the dataset is in code format, to make it easier for the user who will be using the analytic dashboard, I had created calculated fields to help convert the data. I avoided using Alias although it gets the job done faster, the user will have to create their own alias every time they create a new Tableau file and it will be troublesome. By creating a calculated field, the code can be reused and the user will not be required to do alias multiple times. Overall, for the dataset, I had created calculated fields to rename the codes for the attributes: Study Area, Position, HowOften Responses, Yes No(NA) Question Title, Importance and Performance, Importance and Performance Question Categories, Importance and Performance Question Title, Response Label, and YN (NA) Question Categories.

Also, as we're only interested in certain positions that fall in the groups - undergraduate students, postgraduate students, faculty, and staff. I had created a calculated field just to retrieve our interested groups.

3. The survey question involves getting the net promoter score to gauge the loyalty of the library's relationships with its users but it is not possible to do the analysis without separating the results of the data attributes into the detractors, promoters, and neutrals and calculate the NPS score

Solution: In order to be able to create a visualization for the data, it is necessary to create four calculated fields to calculate the percentage of detractors, promoters, and neutrals as well as the NPS score.

4. In the dataset, the survey questions under the data attribute for Importance and Performance are on a Likert scale format. To perform an analysis of the Likert scale data type, the survey results need to be presented in the divergent stacked bar chart format. However, the current dataset is not friendly in allowing users to create a divergent stacked bar chart.

Solution: To be able to create a divergent stacked bar chart visualization for the Likert Scale data type, it is necessary to create four calculated fields. First, I created a calculated field to separate the number of positive and half of the number of neutral responses to build the right side of the chart. Second, I create another calculated field to separate the number of negative responses and half of the number of neutral responses to build the left side of the chart. Third, I created another calculated field to calculate the percentage of negative responses. Lastly, I created a calculated field to calculate the percentage of positive responses.

5. The dataset involved data on Performance(similar to satisfaction) and Importance, thus, I deemed that it is necessary to do a visualization on gap analysis for the performance compared to the Importance, however, the current dataset does not allow me to do so.

[[File:Gap3.jpg|400px|frameless]

Solution: To be able to create visualization charts for gap analysis, I had to create a parameter called sentiment to act as a filter to users to select if they would like to a gap analysis for positive/neutral/negative responses. Also, I had created a calculated field to determine when the sentiment parameter is selected what falls under the Performance and a calculated field to determine when the sentiment parameter is selected what falls under the Importance. Lastly, I created a calculated field to find the gap percentage of performance and importance.

6. After all the creation of necessary dimensions and measures, we need to ensure that the visualization charts would only show the data belonging to the 4 groups mentioned earlier on, using filters and sets for excluding certain unnecessary data.

Solution: Make use of the previously created dimensions as filters to exclude out unnecessary data and create sets to ensure that only required data categories are shown.

Interactive Visualization

The interactive visualization can be accessed here:https://public.tableau.com/profile/tanny.lai#!/vizhome/TannyIS428IndividualAssignment/Story1?publish=yes

Throughout all the dashboard pages, useful guides/tips are provided to help users navigate through the different filters and actions so that their analysis can be performed smoothly. The following interactivity elements are also used throughout all the dashboards to maintain consistency:

Interactive Technique	Purpose	Steps
Navigate across dashboard pages with buttons	Every page in the dashboard will have this navigation buttons on the header to provide users the flexibility of moving from one dashboard to the other, with an easy and simple to use interface	Example
Filter buttons to select interested group for each dashboard pages	To provide users with contextual information about the dashboard and allowing users the flexibility to filter the charts to the group that they are interested in viewing to gain insights.

Home Page

The dataset given contains a substantial amount of data attributes captured in the dataset provided. As such, it will not be possible to display all the attributes for a proper analysis in a single dashboard as users will be bombarded by the amount of information and insights. Furthermore, although many of these attributes are interrelated to each other, there were no clear guidelines and orders given to how the data attributes should be analyzed. To resolve this issue, flexibility has to be provided for users to navigate between different dashboards. To do so, a homepage is created to introduce the dataset and problem that the dashboard is trying to solve along with the groups that we are interested in analyzing to gather insights. The homepage explains that the visualization will be broken down into 4 groups to be analyzed. This will allow users to understand how the dashboard will work and that they will be able to choose the analysis that they are interested to look into to gather in-depth analysis.

The following shows the home dashboard:

Performance Vs Importance

This dashboard page has charts to analyze the questions under the performance and importance data attributes as well as being able to filter them according to the group and position of the surveyee, sentiment based on the responses of the questions and if the services are applicable to the respondents.

Box No.	Description
1.	Navigation bar for users to navigate across the dashboards.
2.	Filter icons for users to select a group they are interested in getting its insights with tooltips to prompt users to select a group. Default(Before filtering): Insights of all the 4 groups will be presented.
3.	This shows the current filters in place for the dashboard. Users may also select the filters for: 1. Applicable Services (Based on NA Questions' results) with the selections: All - Everyone who responded Yes - Those who are aware of the service library offered No - Those who consider the services as not applicable to them. 2. Sentiment (Based on Importance and Performance Questions' results) with the selections: Negative - Responses with 1-3 in scale for each Importance and performance Questions Neutral- Responses with 4 in scale for each Importance and performance Questions Positive- Responses with 5-7 in scale for each Importance and performance Questions According to the choices depending on what they are interested to know.
4.	The chart in this box allows users to view the overall satisfaction of the selected group's positions (based on box 2) in a divergent stack bar chart. Also, users can select a position that they would drill down on for in-depth analysis by clicking on the word of the position. For example, by clicking on the word in the first column, undergraduate year 1, the visualization will filter all other charts in the dashboard to display only data from the position undergraduate year 1.
5.	The chart in this box compares the performance to the importance based on the selected sentiment (Box 3's Filter 2) as well as any other filters selected beforehand. Users can select a category of their interest and drill down for in-depth analysis by clicking on the word of the category. For example, by clicking on the word in the first column, Facilities and Equipments, the visualization will filter the charts in box 6 and 7 of the dashboard to display only data from the category Facilities and Equipments.
6.	The chart in box 6 shows which services based on the survey question are performing above the respondent's expectation based on whether performance is higher than importance. If the service in the question title is performing well, the circle plot will be in green color, if the service is underperforming, the plot will be in red color if performance is the same as importance, the circle plot will be yellow in color. Users are able to click on a specific question that they are interested to filter the divergent stack bar chart in box 7 accordingly.
7.	This chart usually displays the overall responses into a divergent stacked bar of all available questions for the dashboard page. However, once a filter is selected, the chart will filter the data to display accordingly.

Overall with demographics

This dashboard page has charts to display the behavioral and demographic data of the survey's respondents as well as the NPS score. The dashboard have filters like the group, faculty, and position of the respondents, as well as filters by the library that is most used by the surveyees.

Box No.	Description
1.	Navigation bar for users to navigate across the dashboards.
2.	Filter icons for users to select a group they are interested in getting its insights with tooltips to prompt users to select a group. Default(Before filtering): Insights of all the 4 groups will be presented.
3.	This shows the current filters in place for the dashboard which includes the group, position, faculty of the surveyee as well the library most used by the surveyee. Users can filter the dashboard page according to what they are interested to learn.
4.	The chart in this box tells the users the distribution of which library is mostly used. Users are able to select the library that they are interested in doing further analysis as the remaining charts in the dashboard will be filtered to display the data of those who have voted for the selected library.
5.	This chart presents data on the NPS role in Position along with the NPS score. Users are able to select a position of the choice in this chart to filter the remaining charts to show data of their selected position.
6.	The chart in box 6 shows the respondent's faculty distribution. The user may also select a faculty of their choice by clicking on the bar for the respective faculty to filter the charts in the dashboard and derive necessary insights.
7.	This chart shows the number of respondents for each faculty based on the chart in box 6.
8.	The chart shows the frequency of each respondent visiting the campus/ library/ using the library resources. The results will be filtered based on the selected filters if any.

@@ Line 151: / Line 151: @@
 | style="text-align: center;"|5. || The chart in this box compares the performance to the importance based on the selected sentiment (Box 3's Filter 2) as well as any other filters selected beforehand. Users can '''select a category''' of their interest and drill down for in-depth analysis by clicking on the word of the category. For example, by clicking on the word in the first column, Facilities and Equipments, the visualization will filter the charts in box 6 and 7 of the dashboard to display only data from the category Facilities and Equipments.
 |-
-| style="text-align: center;"|6. || The chart in box 6 shows which services based on the survey question are performing above the respondent's expectation based on whether performance is higher than importance. If the service in the question title is performing well, the circle plot will be in green color, if the service is underperforming, the plot will be in red color. Users are able to '''click on a specific question''' that they are interested to filter the divergent stack bar chart in box 7 accordingly.
+| style="text-align: center;"|6. || The chart in box 6 shows which services based on the survey question are performing above the respondent's expectation based on whether performance is higher than importance. If the service in the question title is performing well, the circle plot will be in green color, if the service is underperforming, the plot will be in red color if performance is the same as importance, the circle plot will be yellow in color. Users are able to '''click on a specific question''' that they are interested to filter the divergent stack bar chart in box 7 accordingly.
 |-
 | style="text-align: center;"|7. || This chart usually displays the overall responses into a divergent stacked bar of all available questions for the dashboard page. However, once a filter is selected, the chart will filter the data to display accordingly.

Difference between revisions of "IS428 AY2019-20T2 Assign TANNY LAI"

Revision as of 15:36, 15 March 2020

Contents

Problem & Motivation

Dataset Analysis & Transformation Process

Understanding the dataset

Transformation

Interactive Visualization

Home Page

Performance Vs Importance

Overall with demographics

Analysis & Insights

Undergraduate Students

Postgraduate Students

Faculty

Staff

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools