Difference between revisions of "CATalytics"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 3: Line 3:
  
 
{| style="background-color:#f3e180; color:#f3e180 padding: 5px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" |
 
{| style="background-color:#f3e180; color:#f3e180 padding: 5px 0 0 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" |
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; border-left:2px solid #f3e180; background:#614333; text-align:center;" width="25%" |  
+
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; border-left:2px solid #f3e180; text-align:center;" width="19%" |  
 +
 
 
[[Project_Groups|<font color="#FFFFFF"><B>HOME</B></font>]]
 
[[Project_Groups|<font color="#FFFFFF"><B>HOME</B></font>]]
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="20%" |
+
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#614333; text-align:center;" width="19%" |
 +
 
 
[[CATalytics|<font color="#FFFFFF"><B>PROPOSAL</B></font>]]
 
[[CATalytics|<font color="#FFFFFF"><B>PROPOSAL</B></font>]]
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="20%" |   
+
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="19%" |   
 +
 
 
[[CATalytics_Poster|<font color="#FFFFFF"><B>POSTER</B></font>]]
 
[[CATalytics_Poster|<font color="#FFFFFF"><B>POSTER</B></font>]]
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="20%" |   
+
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="19%" |   
 +
 
 
[[CATalytics_Application|<font color="#FFFFFF"><B>APPLICATION</B></font>]]
 
[[CATalytics_Application|<font color="#FFFFFF"><B>APPLICATION</B></font>]]
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="20%" |
+
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="19%" |
 +
 
 
[[CATalytics_Research Paper|<font color="#FFFFFF"><B>RESEARCH PAPER</B></font>]]
 
[[CATalytics_Research Paper|<font color="#FFFFFF"><B>RESEARCH PAPER</B></font>]]
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
 
| style="border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180;" width="1%" | &nbsp;
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="20%" |
+
| style="padding:0.3em; font-family:Segoe UI; font-size:120%; border-bottom:2px solid #f3e180; border-top:2px solid #f3e180; background:#f3e180; text-align:center;" width="25%" |
 
|}
 
|}
 
<BR>
 
<BR>

Revision as of 17:05, 19 October 2016

CATALYTICS Logo.jpg


HOME

 

PROPOSAL

 

POSTER

 

APPLICATION

 

RESEARCH PAPER

 


PROBLEM & MOTIVATION

According to a survey of more than 14,000 international students in Australia and the UK, 77% of students listed both university and subject rankings as “very important” when deciding a place to study. As such, given the importance of these rankings in decision-making, we were inclined to provide an interactive outlet to aid students in their decision-making process.

In addition, most rankings systems online are in a tabular format. This makes it difficult for anyone to truly visualize the information being displayed and how various factors interact with each other. Thus, our team aims to create a dynamic visual that not only provides interactivity between factors but also reveal hidden patterns of associations that tabular formats might miss out.

Who can use our visual?

  • Pre-university selection for students
  • University students planning to go for exchange

What can be derived from our visual?

  • Effect of factors on ranking of university
  • Location of universities
  • Trends of school make-up
  • Overall ranking of universities
  • Multi-Comparisons between universities based on factors


SELECTED DATASET

Our project will be based on the Times Higher Education World University Rankings. (Extracted from: https://www.kaggle.com/mylesoneill/world-university-rankings)

Our project will analyze the following attributes from the dataset:

Label Description
world_rank world rank for the university. Contains rank ranges and equal ranks (eg. =94 and 201-250)
university_name name of university
country country of each university
teaching university score for teaching (the learning environment)
international university score international outlook (staff, students, research)
research university score for research (volume, income and reputation
citations university score for citations (research influence)
income university score for industry income (knowledge transfer)
total_score total score for university, used to determine rank
num_students number of students at the university
student_staff_ratio number of students divided by number of staff
international_students percentage of students who are international
female_male_ratio female student to Male student ratio
year year of the ranking (2011 to 2016 included)

Performance indicators were grouped into five areas:

Performance Indicator Description
Teaching The learning environment
Research Volume, income and reputation
Citations Research influence
International outlook Staff, students and research
Industry income Knowledge transfer

Note: Universities were excluded from the rankings if (i) they do not teach undergraduates or (ii) research output amounted to fewer than 200 articles per year over a five year period (2010 to 2014).

Breakdown of Performance Indicators

1teaching.png 2research.png
3citations.png 4IntoutlookIncome.png
Source: https://www.timeshighereducation.com/news/ranking-methodology-2016

APPROACH

After referencing to multiple previous works that display university rankings, we decided to focus only on The Times Higher Education World University Rankings The rationale is that the dataset provided is robust enough to tell a good story, and that we cannot feasibly combine data from multiple sources without having missing or irrelevant values. Our research highlighted existing visualisations of university rankings as shown below. These serve as references to our project so that we are aware of what visualisations are already being used for similar datasets.

TableRank.png
1) World University Rankings 2015-2016

We started with the web app provided by the official Times Higher Education website itself to see what how they visualized their own dataset. We realised they provided good filtering systems, intuitive drill down/up processes and efficient sorting systems. However, the app was found to be too tabular which could potentially miss out underlying patterns a proper visualization could show. In addition, there were limited ways we could compare between universities of our choices. Thus, we decided that this reference could be used as a baseline to help us understand which filtering categories are important.

Parallelcoord.png
2) Caleydo LineUp: Visual Analysis of Multi-Attribute Rankings

Next, we wanted to value add to an existing visualisation for university rankings and its factors. In this reference, the user utilizes bar charts and parallel coordinates to show how each key indicators relate to one another. Though it is already informative, we wanted to find an alternative way to not only showcase different types of comparisons between universities within a year but also attempt to form a visualization that could compare universities across the years.

Correlogram.PNG

Scatterplot.PNG
3) Which universities do good science?

This work from Kaggle itself shows us the benchmark and inspiration for which variables are used to compare and gain insights from. The work features mostly scatterplots linked to a correlogram. The correlogram shows the proportion of the dataset which linearly correlates with each other based on specific 2 variables. Specific scatterplots, which are linked from this correlogram, are selected to be further analysed. Through this, we decided to create a similar multi-variate approach to visualise the data through the use of glyphs and ternary plots.

Referencing from these visualisations, we decided to employ different visualisation techniques. We have explained the rationale below.

Glyph.PNG
Ternary Plot
  • Glyphs can handle multiple variables. In our dataset, we have pin-pointed 5 main performance indicators which affect the university rankings.
  • Glyphs also allow layering of data for comparison, either across years or among different schools.
  • The larger the area plotted in a glyph, the higher the different performance indicators as well as the ranking of the selected school. Hence, at a glance, we can observe which schools are ranked higher than the other.
  • Glyphs also allow us to pinpoint which performance indicators of a specific school are more highly ranked.
TernaryPlot.PNG
Spider Chart (Glyph)
  • Ternary plots allow us to observe the correlation of 3 variables - M/F ratio, Student/Teacher ratio and Local/International student ratio. This allows us to observe the movement in trend across years of specific schools through the movement of data points within a ternary chart.
  • From this chart, we can also observe how important one ratio is over another, and how this affects the ranking of the specific school.
D3js map.PNG
World Map (with zoom/pan capabilities)
  • A zoomable map allows us to focus on certain countries or regions where schools on the university ranking are located
  • For more information on the school, users can click the pins on the map to be brought to a Google search of the school
STORYBOARD

The pictures below serve as a mock-up to the proposed application.

Screen Description
Storyboard - Page 1.jpg
Screen 1 - Analysing a single university/multiple universities according to performance indicator
The first button "Factors" will bring users to this page.
  • Users will first select whether they want to look into the performance indicators of a single university or multiple universities by clicking on the respective radio button
  • Users can choose the year(s) to display as well as the university/universities to be displayed
  • If a single university is selected, the map will zoom into the country of the university. A tooltip pop-up will appear showing the rank, country and number of students and international students of the university. The female-male ratio and the staff-student ratio will be shown as bar graphs.
  • The star glyph will show the respective positions of the performance indicators, with overlapping graphs showing either the performance indicators of a single university across years, or the performance indicators across universities in one year.
Storyboard - Page 2.jpg
Screen 2 - Analysing the trend of university make-up through the years

The second button "Trend" will bring users to this page.

  • Users select the rank type of the universities. Proposed types include:
    • Selecting specific countries
    • Top 10 universities
    • Top 100 universities etc
  • Users can drag the arrow to view the resulting ternary plot across the years
  • The ternary plot will animate to show the respective data points in the respective years
  • The map will also show which country/countries the university/universities is/are from
Storyboard - Page 3.jpg
Screen 3 - Viewing the ranking of universities

The third button "Ranking Table" will bring users to this page.

  • Users will be able to see the ranking of the universities
  • Users can filter by the respective elements
TIMELINE


CATALYTICS Timeline.jpg
REFERENCES

Visuals html code examples

Problem and motivation survey statistics

Dataset


OUR TEAM

GROUP 12
1. Albert BINGEI
2. Cornelia Tisandinia LARASATI
3. Timothy TAN Swee Guang

COMMENTS AND/OR FEEDBACK?