Difference between revisions of "ANLY482 AY2016-17 T2 Group09: Project Overview"

From Analytics Practicum
Jump to navigation Jump to search
Line 57: Line 57:
 
<font size =3 face=Georgia >
 
<font size =3 face=Georgia >
  
LKS library is the main study area for SMU students. Having been studying in SMU for four years, our group are motivated to know how our peers are using the library as well. As SIS students, we are curious about how differently will the students from different schools use the library. Are IS students come to the library the most often? What’s the average number of days for students from each school to come to the library? How frequent do undergraduate students visit library in each academic week?
+
LKS library is the main study area for SMU students. The management team of Li Ka Shing Library is striving for better user experience. Unlike e-book usage or search request which data can be easily collected, they have little information about physical usage of the library, especially the usage of specific user groups.  Our group has been studying in SMU for four years,and we are motivated to know how our peers are using the library as well.
 
 
We have friends who are pursuing their master's degrees and come to the library frequently so we also want to know whether master students will come to the library more often than the undergraduate students do?
 
 
 
During exam weeks, the library is usually full that we have to find study space elsewhere so we are curious about how many people are visiting the library during week 14 and what’s the peak hours during exam week.
 
 
 
Moreover, as one of our group members was working as library assistant for two years, she observed that students from different schools and studying in different year might use the library differently. Therefore, we are interested to see is it true that students in year 4 come less frequently than students in year 1 do.
 
 
 
The management team of LKSLIB are striving for better customer experience, especially the physical study environment in LKSLIB. Given the library entrance data provided by the LKSLIB management team, we believe those questions can be answered by the data.
 
  
 
</font>
 
</font>
Line 83: Line 75:
 
Compared with EDA data visualisation, statistical test in terms of one-way ANOVA brings a number of advantages and great statistical power due to increased precision and more informative interpretation of the results. It provides us with deep insights on the relationships between different variables and statistically prove our hypothesis based on the data. In our case, we use one-way ANOVA test to further validate the conclusion we drawn from EDA process. With our statistical test result, we reject or accept our null hypothesis, and this provides us with statistical support for our final conclusion.
 
Compared with EDA data visualisation, statistical test in terms of one-way ANOVA brings a number of advantages and great statistical power due to increased precision and more informative interpretation of the results. It provides us with deep insights on the relationships between different variables and statistically prove our hypothesis based on the data. In our case, we use one-way ANOVA test to further validate the conclusion we drawn from EDA process. With our statistical test result, we reject or accept our null hypothesis, and this provides us with statistical support for our final conclusion.
  
Correlation between the number of entries, number of days, nationality, and whether they are dean’s list students is statistically validated by using parametric ANOVA in this study. The statistical data and graphs are processed with the use of SAS JMP. ANOVA provides a statistical test of whether or not the means of the number of entries and days made by several groups of students are equal. The one-way ANOVA was used to determine whether there are differences at the level of the entries by different groups of student.
 
  
The null hypothesis says that means are equal (H0: μ1 = μ2) and the alternative hypothesis says that two means are not equal. The null hypothesis is rejected or accepted on the basis of statistical significance (the significance level α  = 0.05). In this case, our null hypothesis is that there is no significant difference between the mean of the two groups.
 
  
 
</font>
 
</font>
 
</div>
 
</div>

Revision as of 00:08, 24 April 2017


Back To Main Home   Project Overview   Project Findings   Project Management   Documentation


SMU Library.jpg

Li Ka Shing Library (LKSLIB) is the first library of Singapore Management University, officially opened on 24 February 2006. The Library is named after Hong Kong businessman Dr. Li Ka-shing, and the Li Ka Shing Foundation donated and endowment to the library for collections. The main objective of the library is to offer an interactive study and research space for SMU community.

The LKSLIB includes four floors that comprise about 8,800 square meters with 1,800 seats. Inside the library, there are a variety of spaces including open spaces for individual and collaborative use, learning commons which opens 24/7, quiet areas that for individuals to focus on their work, project rooms with LCD panels, investment studio, postgraduate lounges etc. As a modern library, it is also well equipped with high-speed wireless network, color printers, scanners, public computers with professional financial software available, up-to-date newspapers and magazines, collections of lifestyle videos and games, and this is also the reason why LKSLIB is so attractive for SMU community.


Project Background

In our project, our focus is on analyzing the library entry information from the card reader logs. The card readers are located at the entrance of the library gantries, both located at the main entrance of LKSLIB and at the linkbridge side entrance. Students need to tap their card whenever they enter the library. This provides us with the entry information, which includes timestamp and basic information about the student. To better understand the library usage, the library management team is interested to know whether we could find any usage pattern for library of a particular user group (e.g. Dean’s List student), and see if any other business insights could be drawn from the data. We will also work on statistical analysis in order to confirm on our insights.

We use R to build a web application to clean the raw data and use Tableau to do data visualization to compare the usage level for dean’s list and non-dean’s list students, Singaporean students and international students. Then, we will do one-way ANOVA confirmative analysis using SAS JMP.

Motivations

LKS library is the main study area for SMU students. The management team of Li Ka Shing Library is striving for better user experience. Unlike e-book usage or search request which data can be easily collected, they have little information about physical usage of the library, especially the usage of specific user groups. Our group has been studying in SMU for four years,and we are motivated to know how our peers are using the library as well.

Objectives

We aim to study the correlation between the two sets of variable:

  • Dean's Lister vs. Non-Dean's Lister
  • Singaporean Students vs. International Students

Exploratory data analysis (EDA) is an approach to analysis data sets to summarise their main characteristics. Bar charts are usually used to present the grouped data with rectangular bars with lengths proportional to the values that they represent in order to show the comparison among categories. Primarily EDA is often used for seeing what the data can tell us and explore the data to see the possible hypotheses that could lead to model fitting and hypothesis testing. In our study, we use EDA to have an overview of the analysis and draw a primary conclusion from that.

Compared with EDA data visualisation, statistical test in terms of one-way ANOVA brings a number of advantages and great statistical power due to increased precision and more informative interpretation of the results. It provides us with deep insights on the relationships between different variables and statistically prove our hypothesis based on the data. In our case, we use one-way ANOVA test to further validate the conclusion we drawn from EDA process. With our statistical test result, we reject or accept our null hypothesis, and this provides us with statistical support for our final conclusion.