ISSS608 2016-17 T1 Assign2 Abhinav Ghildiyal

From Visual Analytics and Applications
Jump to navigation Jump to search

Abstract


Wikipedia is a multilingual, web-based, free-content encyclopedia project supported by the Wikimedia Foundation and based on a model of openly editable content. Now days many people use Wikipedia because of its ease, usefulness, visibility, quality, social image, incentives and many other factors. In the similar fashion many faculty members across the world use Wikipedia as a teaching tool in recent years. In this assignment we will try to investigate what are the main factors which makes Wikipedia a likable or an unlikable tool based on the surveyed data collected from the 2 Spanish universities, by surveying the faculty members of those universities.

Theme of Interest

In the context of Wiki4HE assignment, i am undertaking an investigation on the perception of the faculty members towards the use of Wikipedia as a teaching tool and do a study on their attitude towards Wikipedia. I have specifically selected the Profile and Sharing Attitude to answer the following question -

  1. What are the perceptions of people about contributing to Wikipedia ?
  2. How many people participate in social network and what is their perception about social networking ?
  3. Perception about publishing the work in open platforms ?
  4. How the perception of people change with Age and PhD degree?
  5. Perception when it comes to gender and domain ?
  6. How perception of people change when they are registered users of Wiki across all age types ?

Tools Used

JMP
Tableau
Hhigh D

Interactive Result

https://public.tableau.com/profile/publish/Book6_814/DemograhicDashboard#!/publish-confirm

Data Preparation

The data for this assignment is taken from the UCI Machine Learning Repository. This data is about the ongoing research on university faculty perceptions and practices of using Wikipedia as a teaching resource.
The data set wiki4HE is in the csv format. The first step here is to make the data in the readable format. The wiki4HE.csv file is delimited with semicolon, using the Text to Column the csv file that is delimited with semi-colon is changed to tabular format.

1.JPG


Loading the data in JMP and preparing it for Analysis -

  1. There are few variables whose data type need to be changed such as Gender, Phd and University, which need to be changed from Continuous to nominal. At the same time Years of Experience need to be changed to Continuous.
  2. The data types of the responses are in Character - Nominal, which should be changed to Numeric - Nominal.
  3. Checking the missing values using the Missing Value Pattern of JMP.
  4. Using the Distribution, I will check the type of values captured in the columns and the statistics of that variable. While analyzing the distribution I saw that many columns has “?” as the value. Now based on the column we need to change “?” to “Other Domian” in Domains and “Others” in other columns.
  5. The data has a Age range from 23 to 69, so based on the general statistics I have recoded the age, 23 - 35 as Young, 35 - 50 as Middle and 50 - 69 as Old.
  6. Now for ease we will recode the data, based on the attribute information that we have, we will recode the data in JMP.
        a.  	 AGE: numeric 
b. GENDER: 0=Male; 1=Female
c. DOMAIN: 1=Arts & Humanities; 2=Sciences; 3=Health Sciences; 4=Engineering & Architecture; 5=Law & Politics
d. PhD: 0=No; 1=Yes
e. YEARSEXP (years of university teaching experience): numeric
f. UNIVERSITY: 1=UOC; 2=UPF
g. UOC_POSITION (academic position of UOC members): 1=Professor; 2=Associate; 3=Assistant; 4=Lecturer; 5=Instructor; 6=Adjunct
h. OTHER (main job in another university for part-time members): 1=Yes; 2=No
i. ‘OTHER_POSITION (work as part-time in another university and UPF members): 1=Professor; 2=Associate; 3=Assistant; 4=Lecturer; 5=Instructor; 6=Adjunct
j. USERWIKI (Wikipedia registered user): 0=No; 1=Yes


Initial and Exploratory Analysis -

  1. Checking the correlation between the Age and Year of Experience
3.jpg

From the bivariate analysis we can infer that the Age and Year of Experience is not much correlated. They have the RSquare value of 0.30 which shows that the collinearity is less between these two variables.

2. From the Contingency analysis of Domain and USERWIKI we can infer that professors with domain “Art and Humanities” are the maximum who are registered wiki users followed by “Engineering and Architecture”, “Law and Politics”, “Health Science” and “Sciences”.

4.jpg


3. From oneway analysis of age by domain we can infer that all domains have age ranging from 23 to 69.

5.jpg


4. Ternary plot to see the correlation between the responses faculty has given using Likert scale ranging from strongly disagree (1) to strongly agree (5). From the chart below we can see that most of the data points lies in the centre, which means that if a person has given rating 1 for PU1 question then he has given the same or with a difference of 1 rating for the PU2 and PU3 questions, the same pattern is seen in other categories as well.

6.jpg

There are 43 questions, for which we have the response, which were asked by the professors and the ratings were taken, however these 43 questions are categorized under 13 questions. So I have taken the mean of the scores of the subcategory questions and formed the new column with the main category question with that mean value. So now the 43 questions columns are reduced to 13 columns. Below is the formula that I have used to take the mean.

2.jpg

5. The mosaic between Age and Domain shows that the ratio of Middle age people is more as compared to Young and Old age and in that, out of the 5 domains 20% are in Arts and Humanities followed by Engineering and Architecture.

7.jpg


6. Now we see the ratio of university position in UOC university, we do the distribution of university and the UOC_Position, and if we select UOC , we could see that 72% are adjunct followed by 5% assistant, 7% associates, 2% lecturer, .2% instructor, .3% professor and 12% others. At the same time if we select UOC_POSITION adjunct, we could see that all the 3 age groups are in adjunct but maximum are from middle followed by young and old.

8.jpg

7. From the below graph we can infer that –

       a.	Almost all the Associates have PhD degree except 1.
b. There are 2 instructors and 1 has PhD.
c. All the professors has PhD.
d. None of the Lecturer has PhD degree.
e. 62% of Adjunct have PhD.
9.jpg

Visual Analysis

1. How perception varies according to Age and PhD

The observation that we can make here is that people when young have varied perception depending on whether they have done there PhD or not but as they grow old the perception of the people irrespective of whether they are PhD or not become quite similar when it comes to the usefulness, ease, enjoyment, visibility, quality and other features of the wiki. From the below graph we can see that in the young column the difference between the shapes is more, in the middle the difference has reduces but in the old column the shapes are overlapping stating the perception of old people irrespective of there PhD is similar.

10.jpg


2. Peoples perception upon contributing to Wikipedia

From the below analysis we can deduce that people from any position or from any domain have given the least rating in Profile survey questions, which state the contribution to blogs, active participation in social networks and publishing academic content in open platforms. However, at the same time they also confirm by giving the highest rating for the Sharing attitude question that it is important to share academic content in open platforms, it is important to publish research results in other media than academic journals or books and it is important that students become familiar with online collaborative environments. Which means that people who do not contribute to blogs, actively participate in social networks and do not publish academic content actually confer that these things should be done.

12.jpg
13.jpg


3. Contribution to Wikipedia by Age

In the above analysis we saw that irrespective of domain and position, people rated Profile the least and Sharing attitude the maximum. However in this analysis we see how perception of contributing to Wikipedia changes with age. Here we see that people who are in the old age category have rated Profile the least whereas the young and the middle age people have given the least value to the Use Behaviour. So, from the trend above we can confer that old age people are the one who least contribute to blogs, have very less participation in social networks and have very less publications in open networks followed by young. Middle age people contribute a little more to blogs and social media.

A.jpg


4. If the respondent is a registered user

People who are registered user of Wiki have established opinion and there is not much variation in there decision when it comes to scoring the wiki based on different criteria, but the unregistered respondents, whom we are assuming have less insight of wiki, have varied perception about the use, quality, visibility and other factors of Wiki. From the graph below we can analyse that irrespective of the age, respondents who are registered user have a very uniform perception, as they are the registered users of the Wikipedia and they know the they can determine the acceptance and resistance of it.

B.jpg

5. Usage of Wikipedia based on Domain and Gender

From this visualization we can perceive that males who are in “Engineering and Architecture” , “Science” and “Law and Politics” domain prefer more wiki than the females in these domain, however female in “Arts and Humanity” and “Health Science” prefer wiki more than males in these domains. But the maximum difference in the perception of males and females is in the “Engineering and Architecture” domain. The other thing that we can conclude is wiki is most used for the "Engineering and Architecture" as well as for "Arts and Humanities"

C.jpg


6. Perception of people on usage of Wiki

From this parallel plot we can confer that if we select domain as engineering and age as middle, adjunct are the ones who say that they use wiki to develop their teaching material. All other have given less rating when it comes to using wiki to develop teaching material. Everyone has given relatively less rating when it comes to USE2 i.e usage of wiki as a platform to develop educational activities with students. However, everyone has given comparatively more rating when it comes to the recommendation of Wiki to students and colleagues, and they all agree that their students use Wikipedia in their courses.

E.jpg



Comments