ISSS608 2016-17 T1 Assign2 Linda Teo Kwee Ang

From Visual Analytics and Applications
Revision as of 18:02, 23 September 2016 by Ka.teo.2016 (talk | contribs)
Jump to navigation Jump to search

Just trying to type in some stuff

Abstract


“If you can’t beat it, edit it” This was mentioned in an article by Cornell University on how they embraced the use of Wikipedia as a teaching resource[1]].Likewise, for this course ISSS608 Visual Analytics and Applications itself, we have been using Wiki to communicate and share course material, assignments, etc. Empirical studies showed that Wikipedia is heavily and frequently used by a large majority of university students to carry out different assignments and tasks (Wannemarcher and Schulenburg 2010) (Wannemacher, K.; and Schulenburg, F. 2010. Wikipedia in Academic Studies: Corrupting or Improving the Quality of Teaching and Learning? [2]]


Introduction


In this study, the dataset was taken from a research project on factors that influence the teaching use of Wikipedia in Higher Education. That research project was based on an online survey of faculty members from two Spanish universities on teaching uses of Wikipedia, conducted in 2015 (Source: Factors that influence the teaching use of Wikipedia in Higher Education by Antoni Meseguer-Artola, Eduard Aibar, Josep Lladós, Julià Minguillón, Maura Lerga, published in JASIST, Journal of the Association for Information Science and Technology.[3]]. There, the team had used Technology Acceptance Model (TAM) to predict the “intention to use” and “acceptance of new information system”.

This study will aim to examine the dataset used for the above research project, and apply different data visualisation techniques to discover patterns.


Understanding the data


The dataset Wiki4HE was in csv format. It was first uploaded using JMP Pro for data analysis. There were various columns pertaining to the details of the respondent, followed by Likert scores on each question. To begin, each respondent was assigned with a unique ID. Thereafter, the variables in the dataset was compared to the research paper for better understanding of the data. Using JMP Pro, the data was checked for completeness using Missing Data Pattern, and Distribution to see the types of values captured in each column. It was noted that there were a number of “?” in some of the variables. For the purpose of this study, the “?” for the demographic details will be treated as “unknown”, while that for the responses will be treated as “null”.