Difference between revisions of "ISSS608 2016-17 T1 Assign2 Aditya Hariharan"

From Visual Analytics and Applications
Jump to navigation Jump to search
Line 22: Line 22:
 
There are certain columns whose data types must be changed for logical purposes. For example 'years of experience' is changed to continuous data while question columns can be changed from categorical to continuous.
 
There are certain columns whose data types must be changed for logical purposes. For example 'years of experience' is changed to continuous data while question columns can be changed from categorical to continuous.
 
[[File:Image 2 3.jpg|720px|thumbnail|center]]
 
[[File:Image 2 3.jpg|720px|thumbnail|center]]
 +
 +
Mean values have been taken for certain question types as new columns but only for question types with questions of similar meanings for whom taking a mean value would be logical.<br>
 +
[[File:Image 2 4.jpg|720px|thumbnail|center]]
 +
 +
=Data Exploration and Analysis=
 +
 +
The first inference made is from the ternary plot below which shows the relationship between the perceived quality, perceived visibility and ease of use experienced by the academia within the university.
 +
[[File:Image 2 5.jpg|thumbnail]]

Revision as of 18:03, 26 September 2016

Overview

Data is a precious thing and will last longer than the systems themselves.
~ Tim Berners-Lee
The purpose of this assignment is to find and gain certain useful insights with respect to a survey taken among the academia at a university about the usage of wikipedia and various questions regarding its relevance, usefulness and effectiveness as an open platform for learning and discovery. Wikipedia, as we all know, has grown into a worldwide open source of knowledge and information and this analysis seeks to find out what a sample set of academics think about the platform.

Data Set

The Dataset used for the analysis was taken from an online repository https://archive.ics.uci.edu/ml/datasets/wiki4HE which contained information regarding an ongoing research on university faculty perceptions and practices of using Wikipedia as a teaching resource.
wiki4HE dataset

Theme


The purpose of this analysis is to check for various insights related to the dataset chosen and to answer certain questions related to the answers given by the different age, domain and gender demographics within the sample data

Data Preparation

The first step in the data preparation process after loading data from the given excel file is to check for any missing data patterns within the data set

Image 2 1.jpg

Next we recode various columns in the data for a better understanding according to the meta data provided.

Image 2 2.jpg

There are certain columns whose data types must be changed for logical purposes. For example 'years of experience' is changed to continuous data while question columns can be changed from categorical to continuous.

Image 2 3.jpg

Mean values have been taken for certain question types as new columns but only for question types with questions of similar meanings for whom taking a mean value would be logical.

Image 2 4.jpg

Data Exploration and Analysis

The first inference made is from the ternary plot below which shows the relationship between the perceived quality, perceived visibility and ease of use experienced by the academia within the university.

Image 2 5.jpg