IS428 AY2019-20T2 Assign SHERRY TAO SHI HUA

From Visual Analytics for Business Intelligence
Revision as of 15:47, 15 March 2020 by Sherry.tao.2017 (talk | contribs)
Jump to navigation Jump to search

Overview

Background

In light of the ever-changing needs and requirements of faculty, students and staff, an extensive and comprehensive survey is conducted every two years, in which the users of both the Li Ka Shing Library and Kwa Geok Choo Law Library have the opportunity to rate various aspects of their services. The results aim to learn about the users’ perception of both libraries, especially in terms of importance and performance, and provides SMU libraries with input to help enhance existing services to adapt and improve their services.

Objective and Tasks

A Library Survey Report is generated after the survey. However, upon closer scrutiny of the report, we can see that it comprises of many pages of tables and confusing charts, which are difficult for users of the report to comprehend and find out the areas to improve. Hence, our objective is to apply appropriate data visualisation using what we have learnt in class to transform these into interactive visualisations so that users can save effort in understanding and making out the meaning of the tables and charts and focus on the main perceptions and areas of improvement, allowing for greater gain of useful insights to SMU Libraries.


The task is split into four groups of users’ perceptions:

  • undergraduates,
  • postgraduates,
  • faculty and
  • staff.

Since these four groups are so varied, they may have different perceptions of the various importance and performance of the library services, of which we need to further explore and gleam insights from.

Data preparation

Dataset

The 2018 Library Survey dataset is used for this Assignment, since the 2020 Library survey ended in late February and the dataset is not published. There are 2639 responses in this dataset, each to a unique Response ID. Of the 88 variables left in the dataset (excluding the unique identifier Response ID), 26 of the 88 are not applicable (NA01 to NA26) and will be excluded from the visualisations.


Of the 62 variables remaining in the dataset, 9 variables contains information on demographics of the respondent, such as which library they use more frequently (Campus), what their field of study is (StudyArea), how frequently they visit the campus, the library and accessing library resources (HowOftenL, HowOftenC, HowOftenW), which year of study they are in (Position), whether they are an international student (ID), net promoter score (NPS1) and an open-ended comment section (Comment1). The open-ended comment section will be excluded from the visualisations as well.


Of the rest of the 53 variables, 26 of them (I01 to I26) measures the importance of each assessment item while another 26 (P01 to P26) measures the performance of the same assessment items. All measures are in Likert Scale, from 1 to 7, stating how important or well-performed the assessment item is in each respondent’s view. The final variable (P27) is an overall satisfaction of the library, measured under Performance.

Data Cleaning and Transformation

Screenshots Description and Steps Taken
Datacleaning1.png

Since the fields NA01 to NA26 are not applicable, they are excluded from the visualisations


Steps Taken:

  • Highlight the columns NA01 to NA26
  • Click on the triangle on the top right of column NA26
  • Click Hide
Datacleaning2.png

The data is in a row format, with every row corresponding to a ResponseID, indicating the respondent. However, Tableau cannot do calculations on data like this, and it needs to be pivoted so that we can proceed with the later steps


Steps Taken:

  • Highlight the columns I01 to I26 and NA01 to NA27
  • Click on the triangle on the top right of column NA27
  • Click Pivot
Datacleaning3.1.png
Datacleaning3.2.png

After pivoting the data, Tableau automatically names the column with the I01 to I26 and NA01 to NA27 as "Pivot Field Names" and the column with the respective scores as "Pivot Field Values". To help us recognise the fields better, "Pivot Field Names" is renamed as "Importnance/Performance" and "Pivot Field Values" is renamed as "Score".


Steps Taken:

  • Highlight the column to be renamed
  • Click on the triangle on the top right of the column
  • Rename the columns accordingly
Datacleaning4.1.png
Datacleaning4.2.png
Datacleaning4.3.png
Datacleaning4.4.png
Datacleaning4.5.png

There are several fields with numbers as the data in the Excel file, but they are actually codes that symbolises some discrete data. To have a clearer representation of the data, aliases are created in place of the numbers.


Steps Taken:

  • Highlight the column to have aliases added
  • Click on the triangle on the top right of the column
  • Click Aliases
  • Create Aliases for the respective columns with reference to the legend in the Excel sheet
Datacleaning5.png

There are several other fields that need renaming for quicker recognition, namely "HowOftenL", "HowOftenC" and "HowOftenW". These are renamed as "How frequently do you visit the library?", "How frequently do you visit the Campus?" and "How often do you access library resources (e.g. online articles, databases, ebooks)?" respectively.


Steps Taken:

  • Highlight the column to be renamed
  • Click on the triangle on the top right of the column
  • Rename the columns accordingly
Datacleaning6.1.png
Datacleaning6.2.png

There are data in the columns that can be grouped, such that similar traits are represented in the group. One column where the responses can be grouped is Net Promoter Score, responses from 0 to 5 are Detractors, responses from 6 to 7 are Passives and responses from 8 to 10 are promoters, according to the Library Survey Report. Another column is the Position, where Undergraduates from the different years are grouped together, Postgraduates Masters or Doctoral are grouped together, different positions in the Faculty are grouped together and both types of Staff are grouped together.


Steps Taken:

  • Go to a new sheet
  • Convert both of these fields into Dimension
  • Right-click on the field to be grouped
  • Hover over Create
  • Click on Group
  • Select the numbers to form a group according to the Excel
  • Click the Group button on the bottom-left to form a group
  • Repeat for all groups
Datacleaning7.png

To create a divergent stacked bar chart for the responses in the Likert Scale, several calculated fields need to be created. These are:

  1. Importance/Performance Level
  2. Negative Score
  3. Total Negative Scores
  4. Total Scores
  5. Gantt Start
  6. Percent of Total Sizing
  7. Gantt Percent

After each calculated field, drag the newly created field into the sheet to create a Crosstab to check the validity and accuracy of the calculations.


Steps Taken:

  • Click on Analysis on the top bar in the sheet
  • Click Create Calculated Field...
  • A window should appear to input the formulae for the calculated field
Datacleaning8.3.png
Datacleaning8.2.png
Datacleaning8.1.png

There are a total of 26 services in the survey that are ranked by both their importance and performance. However, we do not want to analyse the entire list of factors all at once, but focus on the top few based on how positive the services are ranked by scoring each score. To achieve this, we first have to create a Positive/Negative scoring system, then set a parameter so that the number of services that are being looked at can be varied. Then we need a calculated field to filter the list to only the number of services stipulated by the parameter.


Steps Taken:

  • Create calculated fields for Positive/Negative Scoring and Ranking
  • Create parameter called "Top N Factors"
  • Drag and drop Ranking into filters where needed and filter only values that are True

Interactive Visualization

Link

Link to Dashboard: https://public.tableau.com/profile/sherry.tao#!/vizhome/IS428_AY2019-20T2_Assign_SherryTaoShiHua/Dashboard1?publish=yes

Dashboard Overview

Dashboard1 sherry.png

This is the overview of my Dashboard. By default, the Dashboard shows all responses from Undergraduates for both the Li Ka Shing Library and the Kwa Geok Choo Library. The Importance and Performance of the services have been ranked and the top 10 are shown by default. The individual components that make up this dashboard is further explained below.

Components

Frequency of Library visits

Dashboard2.1.png

On the top left of the Dashboard is the bar graph for frequency of library visits. The columns (Daily, Weekly, Monthly, Quarterly, Never) show the frequency and the height of the bar shows the percentage of respondents by group (default being Undergraduate Students) that corresponded to each column.

Net Promoter Score

Dashboard2.2.png

On the bottom left of the Dashboard is the bar graph for Net Promoter Score. The columns (0 to 10) show the possible range of Net Promoter Scores and the height of the bar shows the percentage of respondents by group (default being Undergraduate Students) that corresponded to each column. The colour on the graph shows those who are Detractors, Passives and Promoters. Detractors are those that are unhappy about the Library, Passives are those that are satisfied but not enough to recommend the Library, and Promoters are those that would encourage others to use the Library.

Importance and Performance

Dashboard2.3.png
Dashboard2.4.png

On the top right of the Dashboard is the divergent stacked bar chart for respondents' perception on the importance of each service that the Library offers, and the bottom right of the Dashboard is the divergent stacked bar chart for respondents' perception on the performance. The responses are on a Likert Scale, hence a divergent stacked bar chart is appropriate to draw insights from the respondents' perceptions. By default, the Top N services can be seen as stipulated by the user input of N, up to a value of 26. The colour of the graphs correspond to those who believe the Importance/Performance to be low to high, with the lower values coloured red and the higher values coloured blue. Those that are on the fence are coloured grey, since they are on neither end of the spectrum.

Interactive Elements

Sidebar

There are several interactive elements present on the side bar.

Interactive1.2 sherry.png

Firstly, this is a filter for the Campus that the user would like to analyse. By changing the filter of Campus, all 4 components in the Dashboard will be filtered and showing only the responses from respondents that use Li Ka Shing Library more frequently in this case. Notice that the titles of each of the 4 components are being changed as well.

Interactive1.1 sherry.png

Next, this is a filter for the Top N services to look at by Importance/Performance. Users are able to change the values from 1 to 26, and by changing the filter, the Importance and Performance sheets will be filtered to only the Top N services.

Interactive1.3 sherry.png

Finally, this is a Pages filter for the group of respondents by position (Undergraduates, Postgraduates, Faculty and Staff). Users are able to scroll through the pages manually, or click the play button to let it loop automatically. Notice that the titles of each of the 4 components are being changed as well.

Charts

Apart from the sidebar, the graphs for frequency of library visits and Net Promoter Score are interactive as well. By clicking on individual bars on either chart, it will filter the other 3 components by that bar.

Interactive2.1.png

Here is an example of filtering by frequency of library visits. By clicking on the "Daily" bar, the other 3 visualisations have been changed to show only those responses where the respondents input "Daily" as their frequency of library visits.

Interactive2.2.png

Similarly, this filters by Net Promoter Score. By clicking on the "10" bar, the other 3 visualisations have been changed to show only those Promoters who input "10" as whether they will recommend the Library.

Sorting

Interactive3.png

By default, both the Importance and Performance are sorted by descending order, meaning that the most important and best performed services are shown based on the Top N filter. We can show the least important and worst performed services by changing the sorting order on the chart.

Analysis & Insights

Undergraduate Students

Overall

Undergrad1.1.png

This is the overview of the responses from all Undergraduate students across all years and both Libraries. From the frequency of library visits graph, most undergraduates visit the libraries either daily or weekly, with less than 15% for the other 3 options. Undergraduates as a whole have a Net Promoter Score of 63.29%, which is relatively high.


Top 5 most Important services are:

  1. I can get wireless access in the Library when I need to
  2. I can find a quiet place in the Library to study when I need to
  3. Printing, scanning and photocopying facilities in the Library meet my needs
  4. Opening hours meet my needs
  5. Laptop facilities (e.g. desks, power) in the Library meet my needs


Top 5 best Performance services are:

  1. I can get wireless access in the Library when I need to
  2. I can get help from library staff when I need it
  3. Opening hours meet my needs
  4. Online resources (e.g. online articles, databases, ebooks) are useful for my studies and help me with my learning and research needs
  5. Library staff provide accurate answers to my enquiries

Of the 5 most important services, two of them are also in the top 5 best performing ones, namely "I can get wireless access in the Library when I need to" and "Opening hours meet my needs". The Libraries have done well in these services to comply to the needs of undergraduates.


Top 5 worst Performance services are:

  1. I can find a place in the Library to work in a group when I need to
  2. A computer is available when I need one
  3. I can find a quiet place in the Library to study when I need to
  4. I find it easy to use mobile devices (e.g. tablets and phones) to access online resources
  5. Books and articles I have requested from other Libraries are delivered promptly

Of the 5 most important services, one of them appear in the top 5 worst performing ones, namely "I can find a quiet place in the Library to study when I need to". The Libraries should look into providing more study spaces by better utilising the spaces in the Libraries so that more undergraduates are able to study in the Library when they need to.

Li Ka Shing Library

Kwa Geok Choo Library

Postgraduate Students

Li Ka Shing Library

Kwa Geok Choo Library

Faculty

Li Ka Shing Library

Kwa Geok Choo Library

Staff

Li Ka Shing Library

Limitations