Difference between revisions of "Group03 proposal"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 111: Line 111:
 
!| Learning Points
 
!| Learning Points
 
|-
 
|-
| [[File:RelatedWork1.png]]
+
| [[File:RelatedWork1.png]] https://www.daxx.com/blog/development-trends/number-software-developers-world
 
|  
 
|  
 
This Choropleth map shows the distribution of the number of professional software developers in Europe by country  
 
This Choropleth map shows the distribution of the number of professional software developers in Europe by country  
Line 120: Line 120:
 
** There is a lack of labeling on the map. For countries that have similar shading, it can become difficult for readers to differentiate the rankings for those countries.
 
** There is a lack of labeling on the map. For countries that have similar shading, it can become difficult for readers to differentiate the rankings for those countries.
 
|-
 
|-
| [[File:RelatedWork2.png]]
+
| [[File:RelatedWork2.png]] https://www.wearedevelopers.com/business/developer-survey/#summary-download
 
|  
 
|  
This Choropleth map shows the distribution of the number of professional software developers in Europe by country
+
This dashboard shows the demand for the top programming languages, frameworks and level experience.
 
* Pros:  
 
* Pros:  
** It is very effective in showing the distribution of professional software developers in a glance. Darker shades represent a higher concentration of developers and vice versa.
+
** This dashboard is pretty comprehensive. Labels are also clear and concise. It is easy for readers to get the overview
* Cons:
+
 
** There is a lack of labeling on the map. For countries that have similar shading, it can become difficult for readers to differentiate the rankings for those countries.
+
* Cons:  
 +
** The bar colors used for the top 10 programming languages are not consistent. For the bottom 5  languages, it used a gradient of grey, while the top 5 used different colors. Gradient of the same color should be used for conveying quantity/intensity and not for differentiating between different categories.
 
|-
 
|-
| [[File:RelatedWork3.png]]
+
| [[File:RelatedWork3.png]] https://insights.stackoverflow.com/survey/2019
|  
+
|
This Choropleth map shows the distribution of the number of professional software developers in Europe by country
+
This visualization shows the relation between Men/Women developers to Developer Role. The X-axis shows the ratio of men’s to women’s developers.
 
* Pros:  
 
* Pros:  
** It is very effective in showing the distribution of professional software developers in a glance. Darker shades represent a higher concentration of developers and vice versa.
+
** The visualization shows a good correlation between Gender against Developer Role
 
 
 
* Cons:
 
* Cons:
** There is a lack of labeling on the map. For countries that have similar shading, it can become difficult for readers to differentiate the rankings for those countries.
+
** The X-axis can be confusing for the readers on what the 10x represents, clearer labels or axis title could have been used.  
 
|-
 
|-
 
| [[File:RelatedWork4.png]]
 
| [[File:RelatedWork4.png]]
 
|  
 
|  
This Choropleth map shows the distribution of the number of professional software developers in Europe by country
+
This Proportional Symbol Map shows the hottest programing languages across different countries
 
* Pros:  
 
* Pros:  
** It is very effective in showing the distribution of professional software developers in a glance. Darker shades represent a higher concentration of developers and vice versa.
+
** This is a good visualization for coders to view which languages are more relevant in the different countries.
  
 
* Cons:
 
* Cons:
** There is a lack of labeling on the map. For countries that have similar shading, it can become difficult for readers to differentiate the rankings for those countries.
+
** For readers with less geographical knowledge, it can be difficult to identify the exact countries  
 
|}
 
|}
 
<br/>
 
<br/>
 +
 
==<div style="background:#f67c6c; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font color=#fff face="Century Gothic"><center>BRAINSTORMING SESSION</center></font></div>==
 
==<div style="background:#f67c6c; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font color=#fff face="Century Gothic"><center>BRAINSTORMING SESSION</center></font></div>==
 
<br/>
 
<br/>

Revision as of 16:56, 24 February 2020


Insert Logo


Team

 

Proposal

 

Poster

 

Application

 

Research Paper


<--- Go Back to Project Groups

PROBLEM & MOTIVATION


In recent years, there is massive growth in the software industry. In order to help prospective coders who wish to pursue a career in this industry, it is crucial for them to understand what is in demand and what to expect from the industry. This will help to guide young coders, such as ourselves, to better manage our expectations and make more informed decisions while we are preparing to join this booming industry.

OBJECTIVE


For our project, we will be focusing on 3 main objectives. It is as follows:

  • Gain overall insights on developers demographics (the Stack Overflow community)
  • Gain insights on job prospects for developers and their work culture. This would provide a glimpse into the software industry and allow users to understand the relations among salary, work hours and challenges faced by developers
  • Understand the most popular/relevant programming languages, databases, frameworks and platforms. This will allow prospective coders to know which skills and knowledge that give them the best advantage in the industry.


SELECTED DATASET

We chose the StackOverflow Developer Survey 2019 dataset (at https://www.kaggle.com/mchirico/stack-overflow-developer-survey-results-2019), as StackOverflow is currently the largest online developer community. The dataset provided is freely accessible, and analysis of this dataset would provide a glimpse about the overall developer community.

The dataset contains 88,883 survey responses, with each row corresponding to one respondent, and each of the 85 different columns corresponding to the survey questions. Below is a quick summary about the data provided and their attributes, categorized by each of our 3 main objectives as mentioned above.

Data Attributes Data Provided
Background Likert
  • Extent of considering oneself as a stack overflow member
Numerical, Discrete
  • Age
Categorical
  • Gender
  • Ethnicity
  • Profession
  • Education
  • Frequency and Purpose of using StackOverflow
Binary
  • Coding for hobby
  • Have dependents
Job prospects Likert
  • Job satisfaction
  • Job competence
Numerical, Continuous
  • Salary
Numerical, Discrete
  • Hours worked a week
  • Hours spent on code review
Categorical
  • Work structure, work challenges, working remotely
  • Code review
Skills Categorical
  • Programming languages, databases, platforms, and web frameworks
  • Developer tools used
  • Operating system used

BACKGROUND SURVEY


Reference of Other Visualization Learning Points
RelatedWork1.png https://www.daxx.com/blog/development-trends/number-software-developers-world

This Choropleth map shows the distribution of the number of professional software developers in Europe by country

  • Pros:
    • It is very effective in showing the distribution of professional software developers in a glance. Darker shades represent a higher concentration of developers and vice versa.
  • Cons:
    • There is a lack of labeling on the map. For countries that have similar shading, it can become difficult for readers to differentiate the rankings for those countries.
RelatedWork2.png https://www.wearedevelopers.com/business/developer-survey/#summary-download

This dashboard shows the demand for the top programming languages, frameworks and level experience.

  • Pros:
    • This dashboard is pretty comprehensive. Labels are also clear and concise. It is easy for readers to get the overview
  • Cons:
    • The bar colors used for the top 10 programming languages are not consistent. For the bottom 5 languages, it used a gradient of grey, while the top 5 used different colors. Gradient of the same color should be used for conveying quantity/intensity and not for differentiating between different categories.
RelatedWork3.png https://insights.stackoverflow.com/survey/2019

This visualization shows the relation between Men/Women developers to Developer Role. The X-axis shows the ratio of men’s to women’s developers.

  • Pros:
    • The visualization shows a good correlation between Gender against Developer Role
  • Cons:
    • The X-axis can be confusing for the readers on what the 10x represents, clearer labels or axis title could have been used.
RelatedWork4.png

This Proportional Symbol Map shows the hottest programing languages across different countries

  • Pros:
    • This is a good visualization for coders to view which languages are more relevant in the different countries.
  • Cons:
    • For readers with less geographical knowledge, it can be difficult to identify the exact countries


BRAINSTORMING SESSION



PROPOSED STORYBOARD



COMMENTS


Feel free to leave us some comments on where we can improve!

No. Name Date Comments
1. Insert your name here Insert date here Insert comment here
2. Insert your name here Insert date here Insert comment here
3. Insert your name here Insert date here Insert comment here