Difference between revisions of "ISSS608 2016-17 T1 Assign2 Agrim Gairola"
Agrimg.2016 (talk | contribs) m |
Agrimg.2016 (talk | contribs) |
||
Line 43: | Line 43: | ||
[[File:Ag7.jpg|400px|frameless|center]] | [[File:Ag7.jpg|400px|frameless|center]] | ||
Experience: 50% participants have over 4-15 Years of experience. This shows that the data set has a wide range of experience among participants<br/> | Experience: 50% participants have over 4-15 Years of experience. This shows that the data set has a wide range of experience among participants<br/> | ||
− | [[File:Ag8| | + | [[File:Ag8.jpg|400px|frameless|center]] |
− | It is interesting to note that almost 72% of the faculty is adjunct staff.</br> | + | UOC Position:It is interesting to note that almost 72% of the faculty is adjunct staff.</br> |
− | + | [[File:Ag9.jpg|400px|frameless|center]] | |
− | + | Domain: For 39.5% of the participants domain mentioned as 6 which has been assumed as “others”. A large number of participants belong to Arts and Humanities and Science. | |
− | + | [[File:Ag10.jpg|400px|frameless|center]] | |
− | + | Registered User: Another interesting thing to note is that majority of users of Wikipedia are unregistered. | |
− | + | [[File:Ag11.jpg|500px|frameless|center]] | |
=<b>2015 VS 2016 TRENDS</b>= | =<b>2015 VS 2016 TRENDS</b>= |
Revision as of 17:09, 26 September 2016
ISSS608 2016-17 T1 Assign1_Agrim Gairola
Contents
Abstract
The assignment involves study of data based on a survey conducted among the faculty of two Spanish Universities on various aspects of Wikipedia. A set of 44 questions were asked from 913 members of the University on 13 different subjects of perception. The task at hand is to identify interesting patterns revealed in the survey regarding the perception of Wikipedia
Motivation
The assignment would enable us to gather interesting insights and patterns into the perception of people on Wikipedia based on its use, image, ease and several other factors.
Tools Used
- Tableau version 10.0
- JMP Pro
- Treemaps HCI
- Microsoft Office
Data Preparation
The following steps were carried out to prepare the data for effective analysis:
Data Manipulation: A unique ID was given to each record for the ease of analysis.
Data Type Conversion: On importing the data into JMP, age and work experience was kept in continuous data type. All the remaining data was converted to nominal data type.
Missing data analysis: Missing data analysis was performed on the data in order to identify the missing data and suitably recoding them.
Assumption: There were several unambiguous values that could be noted throughout the data set. These values were recoded based on the below assumptions:
All “?” values in survey items were taken as 2.5 such that it does not hamper the analysis while comparing the mean scores.
Additional Columns for Categories: Additional columns were created for each of the categories such that it represented the survey items under it. For e.g.: A new column was created for Quality which would have the mean of values in QU1,QU2,QU3,QU4,QU5 thus representing the overall score for quality for the ease of analysis.
Demographics
In order to understand the data set accurately, let us first analyse the demographics.
Treemap: Below is a screenshot along with the link to the video of a treemap with several different hierarchies. This treemap accurately shows the demographics of the data in one look.
Distribution: On analysis of the distribution of the data, the following interesting patterns can be seen regarding the demographics of the participants:
Age: Most participants (80%) who took part in the survey were between the age 32-53
Gender: The survey comprised of 58% males and 42% females.
Experience: 50% participants have over 4-15 Years of experience. This shows that the data set has a wide range of experience among participants
UOC Position:It is interesting to note that almost 72% of the faculty is adjunct staff.
Domain: For 39.5% of the participants domain mentioned as 6 which has been assumed as “others”. A large number of participants belong to Arts and Humanities and Science.
Registered User: Another interesting thing to note is that majority of users of Wikipedia are unregistered.
2015 VS 2016 TRENDS
Approaches
- Sales trend:In order to analyse the sales trends, we plot a area chart indicating sales quarter by quarter for 2015 and first half of 2016
- Understanding the data:On analysis of the trend, we notice that the sales peak during the second quarter of 2015 as well as 2016. On a more general note, we also notice that the end of 2015 saw a fall in the sales of flats. However, 2016 saw an increase in the sales.
- Sales Trends by location: On analyzing the sales trends by locations, we notice that most of the locations follow a trend similar to the general trend.
- On deep-diving into this graph, we notice that the downtown Area does not follow the general trend.
Results
From the above graphs, the following conclusions can be made:
- We notice the highest sales of flats happens in Q2 of 2015 and the same trend is seen in 2016. Ear end bonus recieved at the financial year could be the reason behind this trend.
- The market appears to be going down at the end of the year 2015 however the overall trend shows that the market picks up in the Q2 of 2016 after seeing a drop in Q1 of 2016.
DISTRIBUTION OF PRICES IN 2015
Approaches
- Geographic distribution of prices: Below is the map of Singapore with color gradients indicating the median prices. It shows the prices based on geographical location.It can be observed that most expensive flats are located in the area around the downtown area
- Relation of Resale price with Number of Stories: As can be seen from the graphic, as the number of stories go higher, the prices appear to increase. On isolating the Downtown area, it can be noted almost all high storied expensive flats are located in Downtown Core.
- Relation of price with flat type: As can be observed form the below figure, the highest price is garnered by Multi-generation housing. This is closely followed by Executive housing and then 5 room housing.
Results
From the above graphs, the following conclusions can be made:
- The most expensive flats are located around the Central Business District area. These high prices could be the reason for the sales dropping in Q2 2016 in the downtown core area as discussed in the previous section.
- Flats situated at higher stories garner higher prices. Most of these High storied expensive flats are located in Downtown area.
- The highest prices are garnered by Multi-generation housing. This is closely followed by Executive housing and then 5 room housing.
SHARE OF PUBLIC HOUSING IN 2015
Approaches
- Share of Sale of Flat by number of Rooms: A large population of Singapore seems to believe that a 4 bedroom HDB suits their needs the best.
- Share of flat type by Location: The highest number of sales appear to be in the extreme east (Tampines) and extreme west (Jurong West) of Singapore. On cross referencing the below figure with the map in the previous section, we notice that the sale is higher in areas farther away from the downtown area.
- Share of property by number of stories: Contrary to the popular belief, flats at higher floors are not very popular. Most people prefer to buy flats in stories between 3-12. This could be because of the direct relation between the number of story and the prices.
Results
From the above graphs, the following conclusions can be made:
- 4 Room flats are the most popular types of houses among Singaporeans.
- The highest sales take place in Jurong west and Tampines. This is closely followed by Woodlands. We can conclude that the sales are higher in areas where the prices are lower(referring to the map in section 2) ie away from downtown area.
- Most Singaporeans prefer to buy flats on stories between 3-12.