Difference between revisions of "Group03 proposal Version 2"
Line 185: | Line 185: | ||
** Clicking on a dot or country on the map applies a filter that will update the other 2 charts based on the selected country | ** Clicking on a dot or country on the map applies a filter that will update the other 2 charts based on the selected country | ||
** Hovering over a country on the map shows a tooltip that describes the number of developers. Same thing for the bar charts. | ** Hovering over a country on the map shows a tooltip that describes the number of developers. Same thing for the bar charts. | ||
− | Tools/R Libraries used: ggplot2 | + | *Tools/R Libraries used: ggplot2 |
− | Data fields: Country, Gender, | + | *Data fields: Country, Gender, |
|- | |- | ||
| <center> | | <center> | ||
Line 194: | Line 194: | ||
|| | || | ||
*This relation network chart allows user to see which are the technologies that developers use or know. | *This relation network chart allows user to see which are the technologies that developers use or know. | ||
− | Tools/R Libraries used: ggplot2 | + | *Tools/R Libraries used: ggplot2 |
− | Data fields: LanguagesWorkedWith | + | *Data fields: LanguagesWorkedWith |
|- | |- | ||
| <center> | | <center> | ||
Line 206: | Line 206: | ||
* The charts are sorted in descending order of count, so that users are able to see the most popular ones first | * The charts are sorted in descending order of count, so that users are able to see the most popular ones first | ||
** The sort order can be changed from top to bottom, and the number of box plots to show can be set by the user | ** The sort order can be changed from top to bottom, and the number of box plots to show can be set by the user | ||
− | Tools/R libraries used: Plotly | + | *Tools/R libraries used: Plotly |
− | Data fields: LanguagesWorkedWith, LanguagedDesiredNextYear | + | *Data fields: LanguagesWorkedWith, LanguagedDesiredNextYear |
|- | |- | ||
| <center> | | <center> | ||
Line 219: | Line 219: | ||
** Users can select the sort order and the category from the menu on the left | ** Users can select the sort order and the category from the menu on the left | ||
** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ||
− | Tools/R Libraries used: ggplot2 | + | *Tools/R Libraries used: ggplot2 |
− | Data fields: LanguagesWorkedWith, LanguagedDesiredNextYear | + | *Data fields: LanguagesWorkedWith, LanguagedDesiredNextYear |
|- | |- | ||
| <center> | | <center> | ||
Line 232: | Line 232: | ||
** Users can select the sort order and the category from the menu on the left | ** Users can select the sort order and the category from the menu on the left | ||
** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ||
− | Tools/R Libraries used: ggplot2 | + | *Tools/R Libraries used: ggplot2 |
− | Data fields: JobFactors | + | *Data fields: JobFactors |
|- | |- | ||
| <center> | | <center> | ||
Line 245: | Line 245: | ||
** Users can select the sort order and the category from the menu on the left | ** Users can select the sort order and the category from the menu on the left | ||
** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ||
− | Tools/R Libraries used: plotly | + | *Tools/R Libraries used: plotly |
− | Data fields: | + | *Data fields: |
|- | |- | ||
| <center> | | <center> | ||
Line 258: | Line 258: | ||
** Users can select the sort order and the category from the menu on the left | ** Users can select the sort order and the category from the menu on the left | ||
** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ||
− | Tools/R Libraries used: ggplot2 | + | *Tools/R Libraries used: ggplot2 |
− | Data fields: JobSat, Gender, Country | + | *Data fields: JobSat, Gender, Country |
|- | |- | ||
| <center> | | <center> | ||
Line 271: | Line 271: | ||
** Users can select the sort order and the category from the menu on the left | ** Users can select the sort order and the category from the menu on the left | ||
** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ||
− | Tools/R Libraries used: Plotly | + | *Tools/R Libraries used: Plotly |
− | Data fields: WorkWeekHrs, Gender, Country | + | *Data fields: WorkWeekHrs, Gender, Country |
|} | |} | ||
Revision as of 21:40, 5 April 2020
Version 2
|
<--- Go Back to Project Groups
Contents
PROBLEM & MOTIVATION
Stack Overflow is arguably the biggest online community for developers all around the world. Each year, Stack Overflow field a survey with questions ranging from developers’ favorite technologies to their job preferences. This is done to allow Stack Overflow to better understand its active users. For 2019, nearly 90,000 developers participated in this 20-minute survey. Although the official Stack Overflow website provides a series of visualizations for every survey question, we found that there is a lack of comprehensive and interactive visualizations. After much deliberation, our team has narrowed down to three main aspects that we think are the most interesting and useful to the general users: Technology, Salary, and Job.
Our team aims to build user-friendly dashboards that highlight the most interesting aspects of the active developers in Stack Overflow. This will allow any interested users to get a quick overview of the essential results of this annual survey without the need to tediously scroll through a long list of static visualizations. Additionally, through various useful interactive filters, users can customize their explorations and gain more meaningful insights that suit their needs.
In order to help us expedite the building of these interactive visualizations, our team has decided to use R which gives us access to a wide variety of libraries and tools for data preprocessing and building of user-friendly dashboards.
OBJECTIVE
For our project, we will be focusing on 3 main objectives. It is as follows:
Gain overall insights on developers demographics (the Stack Overflow community)
Gain insights on job prospects for Stack OverFlow developers and their work culture.
Understand the most popular/relevant programming languages, databases, frameworks, and platforms that are used by the Stack OverFlow developers.
SELECTED DATASET
We chose the StackOverflow Developer Survey 2019 dataset (at https://www.kaggle.com/mchirico/stack-overflow-developer-survey-results-2019), as StackOverflow is currently the largest online developer community. The dataset provided is freely accessible, and analysis of this dataset would provide a glimpse about the overall developer community.
The dataset contains 88,883 survey responses, with each row corresponding to one respondent, and each of the 85 different columns corresponding to the survey questions. Below is a quick summary about the data provided and their attributes, categorized by each of our 3 main objectives as mentioned above.
Data Attributes | Data Provided | |
---|---|---|
Background | Likert |
|
Numerical, Discrete |
| |
Categorical |
| |
Binary |
| |
Job prospects | Likert |
|
Numerical, Continuous |
| |
Numerical, Discrete |
| |
Categorical |
| |
Skills | Categorical |
|
BACKGROUND SURVEY
Reference of Other Visualization | Learning Points |
---|---|
![]() |
This Choropleth map shows the distribution of the number of professional software developers in Europe by country
|
![]() |
This dashboard shows the demand for the top programming languages, frameworks and level experience.
|
![]() |
This visualization shows the relation between Men/Women developers to Developer Role. The X-axis shows the ratio of men’s to women’s developers.
|
![]() |
This Proportional Symbol Map shows the hottest programing languages across different countries
|
PROPOSED STORYBOARD
Storyboard | Insights / Comments |
---|---|
Title: OVERVIEW OF DEVELOPER DEMOGRAPHICS |
|
Title: ANALYSIS OF DEVELOPER'S TECHNOLOGY |
|
Title: ANALYSIS OF DEVELOPER'S LOVED/NOT LOVED TECH |
|
Title: ANALYSIS OF DEVELOPER'S DESIRED TECH |
|
Title: ANALYSIS OF DEVELOPER'S DESIRED TECH |
|
Title: ANALYSIS OF DEVELOPER'S JOB FACTORS |
|
Title: ANALYSIS OF DEVELOPER'S JOB SATISFACTION |
|
Title: ANALYSIS OF DEVELOPER'S WORK HOURS |
|
TECHNICAL CHALLENGES
Challenges | Mitigation Plan |
---|---|
|
|
|
|
PROJECT TIMELINE
COMMENTS
Feel free to leave us some comments on where we can improve!
No. | Name | Date | Comments |
---|---|---|---|
1. | Insert your name here | Insert date here | Insert comment here |
2. | Insert your name here | Insert date here | Insert comment here |
3. | Insert your name here | Insert date here | Insert comment here |