IS428 AY2019-20T2 Assign LIM SI LING

From Visual Analytics for Business Intelligence
Revision as of 22:34, 15 March 2020 by Siling.lim.2017 (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Overview

Every two years, SMU Libraries conduct a comprehensive survey in which faculty, students and staff can rate various aspects of SMU library's services. The survey results are an important feedback for SMU libraries to understand the quality of their services and facilities and identify key areas that needs improvement to better meet the needs of their users.

Objectives

I am tasked by SMU libraries to use visual analytics approach to reveal the level of services provided by them as perceived by:

  • the undergraduate students
  • the postgraduate students
  • the faculty
  • the staff

There was previously a report (Library Survey Report) generated. However, it consists of many separated graphs and charts which is too disjointed to be read properly. Additionally, the report incorrectly uses Mean as a measure. The survey questions are graded based on a 7 point Likert scale. The mean in a Likert scale can't be found because it is not possible to find an average of “agree”, “disagree”, and “neutral. Therefore, the goal is to produce a visualisation of the survey results by reducing the number of charts and providing dashboards to highlight performance of SMU libraries in terms of overall satisfaction, facilities, resources and services provided.

Survey Data

Description of Data

The survey data provided is a single excel workbook containing two sheets:

Sheetname Description
SMU The raw survey results containing survey respondents and each of their reply to the survey questions in the form of ratings.

The raw survey results have a total of 2639 rows or number of participants. There are a total of 89 column. However, there are missing data as well that needed to be removed. The attributes of the data can be divided into the following:

  1. Information on participants:
    • ResponseID (unique ID representing survey respondents)
    • Campus (which library do respondents use more)
    • Position (Type of user)
    • ID (international Student or not)
    • HowOftenL (frequency of visit to library)
    • HowOftenC (frequency of visit to Campus)
    • HowOftenW (frequency of accessing library resources)
  2. Survey Questions on quality of library facilities and services:
    • Divided into 3 types (Importance, Performance and N/A)
    • Respondents who picked 'N/A' for the question need not select a rating for both importance and performance.
    • There are 26 questions for importance and N/A and 27 questions for performance, represented by 78 different columns
  3. Free response questions:
    • Comment1 (suggestions for improvement)
Legend The legend that contains the exact survey questions and the question code that they represent.

Data Preparation

The data collected from the survey had many missing values apart from those who have picked N/A for the question. Therefore, data with key areas missing are to be excluded from the data visualisation since they were no longer useful in providing insights.
To remove missing data, codes were written in python to determine which rows to remove.

Screenshot Action taken and description
Import python.png
The excel workbook "Raw data 2018-03-07 SMU LCS data file - KLG.xlsx" is imported into jupyter notebook.
2.png
To make it easier to deal with the data, cells that are empty are filled with -1 as their values
6.png

7.png
Next, the criteria for which data to be removed is determined as follows:
  1. Identifier questions like Campus, Position, StudyArea, HowOftenL, HowOftenC, HowOftenW and ID must not be empty at all costs, otherwise data is to be removed
  2. If N/A is selected, then corresponding importance and performance must be 0, otherwise data is to be removed


The code will create a new column named" To_remove" indicating if the row of data is to be deleted. The column is added to the DataFrame and exported back to excel format and renamed "cleaned_survey_data.xlsx".
121 rows of data will be deleted.

8.png
Legend is copied over to the newly exported dataset.
9.png
The new excel workbook is then imported into Tableau.
5.png
However, the dataset cannot be used directly, as the question codes are not linked to the actual question itself. Pivoting of the data set is required.


The survey question columns are highlighted (from I01 to NPS1) and pivoted.
'Pivot Field Names' is renamed to 'Q_code' and 'Pivot Field Values' to 'Rating'.

10.png
Lastly, the survey questions have to be mapped to the question code.


This was done by dragging the Legend Sheet right next to SMU and join them by left outer join, keeping all the data rows in SMU. The two sheets are joint by their question code (Q_code in SMU and Code in Legend).


On top of the data preparation above, the survey questions can be split into the following categories to better identify the areas of improvement for SMU Libraries:

Category Survey Questions
Overall Satisfaction 27: Overall how satisfied are you with the Library?
Library Resources

11: The items I'm looking for on the library shelves are usually there
20: Information resources located in the Library (e.g. books, journals, DVDs) meet my learning and research needs
21: Online resources (e.g. online articles, databases, ebooks) are useful for my studies and help me with my learning and research needs
22: Course specific resources (online and in the reserve collection) meet my learning needs
25: Access to Library information resources has helped me to be successful at university

Facilities

14: I can find a quiet place in the Library to study when I need to
15: I can find a place in the Library to work in a group when I need to
16: A computer is available when I need one
17: Laptop facilities (e.g. desks, power) in the Library meet my needs
18: I can get wireless access in the Library when I need to
19: Printing, scanning and photocopying facilities in the Library meet my needs

Services Provided

01: I am informed about the Library services
02: The Library website provides useful information
03: Library signage is clear
04: Library workshops, classes and tutorials help me with my learning and research needs
05: The Library anticipates my learning and research needs
06: Opening hours meet my needs
07: Books and articles I have requested from other Libraries are delivered promptly
08: Self Service (e.g. self-check loans, requests, renewals, holds) meets my needs
09: Online enquiry services (e.g. Email, Library Chat) meet my needs
10: Face-to-face enquiry services meet my needs
12: Library staff provide accurate answers to my enquiries
13: I can get help from library staff when I need it
23: When I am away from campus I can access the Library resources and services I need
24: The Library search engine enables me find relevant library resources quickly
26: I find it easy to use mobile devices (e.g. tablets and phones) to access online resources

Interactive Visualization

Link to Story on Tableau Public

Dashboard Screenshot Description
Survey Demographics
Demographic.png
The demographic Dashboard is built to allow SMU Libraries to tell which group of users are made up the majority of participants (Undergraduates in this case)
Library Usage
Library usage.png
The Library Usage tells how often participants visited the library and used its resources.

Additionally, the Study Area vs Library Preference graph will provide insight on the which faculty preference of library.

Facilities, Library Resources & Services Provided
Facilities.png
There are 3 Dashboards showing the performance and importance rating given to SMU library facilities, resources and services provided respectively.

Each Dashboard allows users to filter and select the Position (role) of survey participants to filter results and compare differences between the groups.

The Importance and Performance graphs are put side by side highlight whether SMU libraries’ performance has met the expectation of its users (represented by importance rating)

Overall Satisfaction
Satisfaction1.png
The overall satisfaction will represent the overall rating given by survey participants.

The graphs are split by Study Areas and Position respectively to compare the different demographics rating.

Net Promoter Score
NPSlll.png
The net promoter score will provide the library insights of which group of people they should focus their marketing strategy towards as they are the ones that will promote their services to others.

Analysis and Insights

The importance represents user’s expectation of a libraries service. The higher the importance rating, the higher a user’s expectation for quality of that service. I have split the survey questions into the following categories to better compare the performance across different kinds of services provided by the library.

Demographics

The primary audience of the survey is undergraduate students, taking up 80.62% of the survey's respondents. Postgraduate students came in next at 9.04%. In proportion to the overall demographics of SMU, Business students take up 39.12% of the respondents, making up largest among the other schools. Information Systems comes in second, with about half of Business's students at 14.43% of the total students, with Accountancy Students coming in at a close third of 14.38%.

Library usage

Both the Library Visit and Resource Access Frequency have roughly the same distribution with minor differences.

The libraries campus has seen more proportion daily frequents than their online resources. Majority of the users have frequented the library campus or access library resources on a weekly basis. What was unexpected was that there was a significantly larger proportion of people that has never access the library’s resources despite having a more convenient online platform of databases to access. On the contrary, there is almost no one (0.44%) that has never been to the Library.

This maybe due to the fact that there are many more users who frequented the libraries for its facilities such as having a quiet studying area and project rooms with monitors for group discussions than to access its learning resources, whether online or physically visiting the library.

Facilities

From the overall data, it can be concluded that computer availability is the lowest rated attribute under facilities. Despite having the largest proportion of ratings that are less than 4 (19.6% for both performance and importance), there is no gap in the amongst any ratings from 1 to 7. Thus, while performance is low, expectation of having computer availability is also low. One underlying cause could be that most if not all students will have brought a laptop whenever they visit the school campus. Thus, there is little need for the library to provided computers.

Differences between importance and performance is small even among the different positions of people.

Faculty members All members

By varying the filters, it is evident that for faculty members (left chart) have a larger proportion of low ratings of importance and performance of library facilities than compared to all the students and staff (right chart). This may be due to lower needs of library’s facilities like computers, groups study rooms and printing needs. Faculty members often have offices to serve as their personal working spaces to meet students and other staff for work or projects.

Library Resources

From the overall data, it would seem that the physical copies of information resource available in the library is not fully meeting the needs of students. Performance rating has a larger proportion than importance rating. The difference is larger for students (postgraduate and undergraduate) than for staff and faculty.

Faculty members All members

Compared to the ratings given in Facilities (left chart), this time, Faculty members gave the highest ratings instead compared to all members (right chart), giving almost no negative ratings. The results for Faculty members have no ratings less than 4 other than the attribute of availability of physical resources in the library, which have a small proportion of 2.1% for performance rating.

Services Provided

Postgraduate members All members

For service provided by the library, postgraduates have the largest difference in importance and performance ratings, showing that this group have the largest portion of needs not being met. This may because postgraduate have the largest needs of library services and the highest expectation of them because of the many research projects and papers as compared to undergraduate.

Additionally, similar to the previous ratings given to resources, faculty members gave the lowest proportion of negative ratings. This may be because of a lower need of services provided by the library such as workshops, information on library services, and opening hours.

Overall Satisfaction

Comparing across the different positions, it seems that Exchange Students and Faculty were the most satisfied with library facilities and services. The higher ratings given by exchange students may be due to having lower expectations as exchange students are only studying in SMU for a few months.

Faculty members also have lesser needs in library facilities and services as compared to students as they have their own offices and less need to attend workshops offered or for information provided by the library.

On the other hand, we see a larger proportion low ratings given by students who access the library facilities more and have more need for the library’s resources.

Net Promoter Score (NPS)

Comparing across the different faculties, School of Economics have the lowest net promoter score whereas School of Social Sciences have the Largest, with law coming in a close behind. As a humanities area of studies, the students require more readings and research resources from the library.

On the other hand, across the different positions, Undergraduates have the lowest proportion of Promoters and the lowest NPS. On the other hand, Faculty members have the highest NPS score.

References

Likert Scale Tableau Tutorial