Difference between revisions of "Group03 proposal"
(30 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
<br/> | <br/> | ||
− | + | [[File:Devbuzz.jpg|300px|frameless|center]] | |
<br/> | <br/> | ||
− | <!--Header--> | + | <!-- Header Start --> |
<p></p><br/> | <p></p><br/> | ||
<div style="width:100%; text-align:center;"> | <div style="width:100%; text-align:center;"> | ||
− | {|style="background-color:# | + | {|style="background-color:#fbd242; color:#fff; padding: 10 0 10 0;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0" | |
− | |||
− | |||
− | |||
| style="background:none;" width="1%" | | | style="background:none;" width="1%" | | ||
− | | style="padding:0.2em; font-size:100%; background-color:# | + | | style="padding:0.2em; font-size:100%; background-color:#fff069; border-bottom:0px solid #3D9DD7; text-align:center; color:#100c08" width="10%" | |
− | [[ | + | [[Group03_proposal |<font color="#100c08" size=3 face="Helvetica">Proposal</font>]] |
| style="background:none;" width="1%" | | | style="background:none;" width="1%" | | ||
− | | style="padding:0.2em; font-size:100%; background-color:# | + | | style="padding:0.2em; font-size:100%; background-color:#fbd242; border-bottom:0px solid #3D9DD7; text-align:center; color:#fff" width="10%" | |
− | [[ | + | [[Group 3_Poster|<font color="#100c08" size=3 face="Helvetica">Poster</font>]] |
| style="background:none;" width="1%" | | | style="background:none;" width="1%" | | ||
− | | style="padding:0.2em; font-size:100%; background-color:# | + | | style="padding:0.2em; font-size:100%; background-color:#fbd242; border-bottom:0px solid #3D9DD7; text-align:center; color:#fff" width="10%" | |
− | [[ | + | [[Group 3_Application|<font color="#100c08" size=3 face="Helvetica">Application</font>]] |
| style="background:none;" width="1%" | | | style="background:none;" width="1%" | | ||
− | | style="padding:0.2em; font-size:100%; background-color:# | + | | style="padding:0.2em; font-size:100%; background-color:#fbd242; border-bottom:0px solid #3D9DD7; text-align:center; color:#fff" width="10%" | |
− | [[ | + | [[Group 3_Research Paper|<font color="#100c08" size=3 face="Helvetica">Research Paper</font>]] |
|} | |} | ||
</div> | </div> | ||
+ | <!-- Header End --> | ||
+ | |||
+ | <!--Sub Header Start--> | ||
+ | {| style="background-color:white; text-transform: uppercase; font-size:14px; font-weight: bold; text-align:center; padding: 0px 0 0 0;" width="100%" height=50px cellspacing="0" cellpadding="0" valign="top" border="0" | | ||
+ | |||
+ | <!-- Current Page --> | ||
+ | | style="vertical-align:top;width:33%;" | <div style="padding: 3px; line-height: wrap_content; border-bottom:5px solid #000000;"><font color =#272d2d><b>Version 1</b></font> | ||
+ | |||
+ | | style="vertical-align:top;width:33%;" | <div style="padding: 3px; line-height: wrap_content; border-bottom:1px solid #999999;"> [[Group03_proposal Version 2 | <font color =#272d2d><b>Version 2</b>]] | ||
+ | |} | ||
− | |||
<br/> | <br/> | ||
<big> [[Project Groups|<--- Go Back to Project Groups]] </big> | <big> [[Project Groups|<--- Go Back to Project Groups]] </big> | ||
<br/><br/> | <br/><br/> | ||
− | ==<div style="background:# | + | ==<div style="background:#fbd242; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font face="Century Gothic"><center>PROBLEM & MOTIVATION</center></font></div>== |
<br/> | <br/> | ||
− | In recent years, there is massive growth in the software industry. In order to help prospective coders | + | In recent years, there is massive growth in the software industry. In order to help prospective coders, such as ourselves, to better manage their expectations and make more informed decisions, it is crucial for them to understand what is in demand and also what to expect from this booming industry. |
<br/> | <br/> | ||
<br/> | <br/> | ||
− | ==<div style="background:# | + | ==<div style="background:#fbd242; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font face="Century Gothic"><center>OBJECTIVE</center></font></div>== |
<br/> | <br/> | ||
For our project, we will be focusing on '''3 main objectives'''. It is as follows: | For our project, we will be focusing on '''3 main objectives'''. It is as follows: | ||
Line 44: | Line 50: | ||
<br/> | <br/> | ||
− | ==<div style="background:# | + | ==<div style="background:#fbd242; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font face="Century Gothic"><center>SELECTED DATASET</center></font></div>== |
We chose the StackOverflow Developer Survey 2019 dataset (at https://www.kaggle.com/mchirico/stack-overflow-developer-survey-results-2019), as StackOverflow is currently the largest online developer community. The dataset provided is freely accessible, and analysis of this dataset would provide a glimpse about the overall developer community. | We chose the StackOverflow Developer Survey 2019 dataset (at https://www.kaggle.com/mchirico/stack-overflow-developer-survey-results-2019), as StackOverflow is currently the largest online developer community. The dataset provided is freely accessible, and analysis of this dataset would provide a glimpse about the overall developer community. | ||
Line 105: | Line 111: | ||
|} | |} | ||
− | ==<div style="background:# | + | ==<div style="background:#fbd242; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font face="Century Gothic"><center>BACKGROUND SURVEY</center></font></div>== |
<br/> | <br/> | ||
{| class="wikitable" | {| class="wikitable" | ||
Line 137: | Line 143: | ||
** The X-axis can be confusing for the readers on what the 10x represents, clearer labels or axis title could have been used. | ** The X-axis can be confusing for the readers on what the 10x represents, clearer labels or axis title could have been used. | ||
|- | |- | ||
− | | [[File:RelatedWork4.png]] | + | | [[File:RelatedWork4.png]] https://hired.com/blog/candidates/data-reveals-hottest-coding-languages/ |
| | | | ||
This Proportional Symbol Map shows the hottest programing languages across different countries | This Proportional Symbol Map shows the hottest programing languages across different countries | ||
Line 148: | Line 154: | ||
<br/> | <br/> | ||
− | + | ||
<br/> | <br/> | ||
+ | ==<div style="background:#fbd242; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font face="Century Gothic"><center>PROPOSED STORYBOARD</center></font></div>== | ||
<br/> | <br/> | ||
− | == | + | |
+ | {| class="wikitable" style="background-color:#FFFFFF;" width="90%" | ||
+ | |- | ||
+ | ! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 45%;" | Storyboard | ||
+ | ! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 55%" | Insights / Comments | ||
+ | |- | ||
+ | | <center> | ||
+ | '''Title''': STORY 1 - OVERVIEW OF DEVELOPER DEMOGRAPHICS | ||
+ | [[File:Storyboard1.jpg|600px|frameless|center]] | ||
+ | </center> | ||
+ | || | ||
+ | * The first story aims to provide viewers with the overall demographic information about developers | ||
+ | * Top left is an age pyramid chart of the selected group of developers | ||
+ | * Top right is a bar chart showing what are the different profiles of developers (students, professionals, etc.), and whether they code as a hobby or not | ||
+ | * Bottom shows a proportional symbol map with dot sizes representing the number of developers in that country/region | ||
+ | ** Clicking on a dot or country on the map applies a filter that will update the other 2 charts based on the selected country | ||
+ | ** Hovering over a dot or country on the map shows a tooltip that describes the number of developers and the median salary of the selected country | ||
+ | |||
+ | |- | ||
+ | | <center> | ||
+ | '''Title''': STORY 2 - ANALYSIS OF DEVELOPER'S SALARIES | ||
+ | [[File:Storyboard2.jpg|600px|frameless|center]] | ||
+ | </center> | ||
+ | || | ||
+ | * The second story aims to tell viewers which types of developers have the highest median salaries. All charts shown are box plots that are sorted in descending order of salary | ||
+ | * Top left is a box plot comparing median salaries between different types of developers (full stack, database administrators, data scientist, etc.) | ||
+ | * Top right is a box plot comparing median salaries between developers of different educational background | ||
+ | * Bottom is a box plot comparing median salaries between developers who worked with which programming languages | ||
+ | ** Clicking on a box plot bar applies a filter that will update the other 2 charts based on the selection. It is possible to apply multiple filters | ||
+ | ** The sort order can be changed from top to bottom, and the number of box plots to show can be set by the user | ||
+ | |||
+ | |- | ||
+ | | <center> | ||
+ | '''Title''': STORY 3 - ANALYSIS OF DEVELOPER'S SKILLS | ||
+ | [[File:Storyboard3.jpg|600px|frameless|center]] | ||
+ | </center> | ||
+ | || | ||
+ | * The third story aims to give aspiring developers more insights based on the selected programming language, which can help them to decide what to use and learn. | ||
+ | * By selecting a particular language as a filter, they are able to see how many developers are using which database, platform, developer tools, and so on, each represented by a bar chart. | ||
+ | * The charts are sorted in descending order of count, so that users are able to see the most popular ones first | ||
+ | ** The sort order can be changed from top to bottom, and the number of box plots to show can be set by the user | ||
+ | |||
+ | |- | ||
+ | | <center> | ||
+ | '''Title''': STORY 4 - ANALYSIS OF DEVELOPER'S JOB SATISFACTION | ||
+ | [[File:Storyboard4.jpg|600px|frameless|center]] | ||
+ | </center> | ||
+ | || | ||
+ | * The last story aims to show viewers which developers have higher job satisfaction based on the selected category | ||
+ | * This is done using a divergent stacked bar chart, which is good for comparing between different categories for Likert data. The count of records in each category is displayed on the right of the chart | ||
+ | * There are many options for the categories, such as those mentioned in previous stories, as well as interesting ones such as whether the developer can work from home | ||
+ | ** Users can select the sort order and the category from the menu on the left | ||
+ | ** Users can also set the reference line for the divergent stacked bar chart to control how the chart is visualized. | ||
+ | |} | ||
+ | |||
<br/> | <br/> | ||
+ | ==<div style="background:#fbd242; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font face="Century Gothic"><center>TECHNICAL CHALLENGES</center></font></div>== | ||
<br/> | <br/> | ||
− | ==<div style="background:# | + | <center> |
+ | {| class="wikitable" style="background-color:#FFFFFF;" width="90%" | ||
+ | |- | ||
+ | ! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 45%;" | Challenges | ||
+ | ! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 55%" | Mitigation Plan | ||
+ | |- | ||
+ | | | ||
+ | * Unfamiliarity with R, R Shiny and Tableau | ||
+ | || | ||
+ | * Ask any seniors or friends who have taken any R-related courses to share their slides with us for references | ||
+ | * Watch video tutorials from YouTube | ||
+ | * Peer Learning | ||
+ | |- | ||
+ | | | ||
+ | * Unfamiliarity of data cleaning and transformation using R | ||
+ | || | ||
+ | * Read online articles and forums for guidance | ||
+ | * Watch video tutorials on how to fully utilise packages such as tidyr and dplyr | ||
+ | * Trial and error | ||
+ | |} | ||
+ | |||
+ | ==<div style="background:#fbd242; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font face="Century Gothic"><center>PROJECT TIMELINE</center></font></div>== | ||
+ | [[File:Photo 2020-03-01 18-02-12.jpg|800px|frameless|center]] | ||
+ | |||
+ | ==<div style="background:#fbd242; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font face="Century Gothic"><center>COMMENTS</center></font></div>== | ||
<br/> | <br/> | ||
Feel free to leave us some comments on where we can improve! | Feel free to leave us some comments on where we can improve! | ||
Line 162: | Line 248: | ||
{| class="wikitable" width="100%" | {| class="wikitable" width="100%" | ||
|- | |- | ||
− | ! style="font-weight: bold;background: # | + | ! style="font-weight: bold;background: #fff069;width: 5%;" |No. |
− | ! style="font-weight: bold;background: # | + | ! style="font-weight: bold;background: #fff069;width: 25%;" |Name |
− | ! style="font-weight: bold;background: # | + | ! style="font-weight: bold;background: #fff069;width: 20%;" |Date |
− | ! style="font-weight: bold;background: # | + | ! style="font-weight: bold;background: #fff069;width: 50%;" |Comments |
|- | |- | ||
| 1. | | 1. |
Latest revision as of 21:42, 12 April 2020
Version 1
|
<--- Go Back to Project Groups
Contents
PROBLEM & MOTIVATION
In recent years, there is massive growth in the software industry. In order to help prospective coders, such as ourselves, to better manage their expectations and make more informed decisions, it is crucial for them to understand what is in demand and also what to expect from this booming industry.
OBJECTIVE
For our project, we will be focusing on 3 main objectives. It is as follows:
- Gain overall insights on developers demographics (the Stack Overflow community)
- Gain insights on job prospects for developers and their work culture. This would provide a glimpse into the software industry and allow users to understand the relations among salary, work hours and challenges faced by developers
- Understand the most popular/relevant programming languages, databases, frameworks and platforms. This will allow prospective coders to know which skills and knowledge that give them the best advantage in the industry.
SELECTED DATASET
We chose the StackOverflow Developer Survey 2019 dataset (at https://www.kaggle.com/mchirico/stack-overflow-developer-survey-results-2019), as StackOverflow is currently the largest online developer community. The dataset provided is freely accessible, and analysis of this dataset would provide a glimpse about the overall developer community.
The dataset contains 88,883 survey responses, with each row corresponding to one respondent, and each of the 85 different columns corresponding to the survey questions. Below is a quick summary about the data provided and their attributes, categorized by each of our 3 main objectives as mentioned above.
Data Attributes | Data Provided | |
---|---|---|
Background | Likert |
|
Numerical, Discrete |
| |
Categorical |
| |
Binary |
| |
Job prospects | Likert |
|
Numerical, Continuous |
| |
Numerical, Discrete |
| |
Categorical |
| |
Skills | Categorical |
|
BACKGROUND SURVEY
Reference of Other Visualization | Learning Points |
---|---|
![]() |
This Choropleth map shows the distribution of the number of professional software developers in Europe by country
|
![]() |
This dashboard shows the demand for the top programming languages, frameworks and level experience.
|
![]() |
This visualization shows the relation between Men/Women developers to Developer Role. The X-axis shows the ratio of men’s to women’s developers.
|
![]() |
This Proportional Symbol Map shows the hottest programing languages across different countries
|
PROPOSED STORYBOARD
Storyboard | Insights / Comments |
---|---|
Title: STORY 1 - OVERVIEW OF DEVELOPER DEMOGRAPHICS |
|
Title: STORY 2 - ANALYSIS OF DEVELOPER'S SALARIES |
|
Title: STORY 3 - ANALYSIS OF DEVELOPER'S SKILLS |
|
Title: STORY 4 - ANALYSIS OF DEVELOPER'S JOB SATISFACTION |
|
TECHNICAL CHALLENGES
Challenges | Mitigation Plan |
---|---|
|
|
|
|
PROJECT TIMELINE
COMMENTS
Feel free to leave us some comments on where we can improve!
No. | Name | Date | Comments |
---|---|---|---|
1. | Insert your name here | Insert date here | Insert comment here |
2. | Insert your name here | Insert date here | Insert comment here |
3. | Insert your name here | Insert date here | Insert comment here |