Difference between revisions of "Group03 proposal"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 43: Line 43:
 
==<div style="background:#f67c6c; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font color=#fff face="Century Gothic"><center>SELECTED DATASET</center></font></div>==
 
==<div style="background:#f67c6c; padding: 15px; font-weight: bold; line-height: 0.3em; letter-spacing:0.5em;font-size:20px"><font color=#fff face="Century Gothic"><center>SELECTED DATASET</center></font></div>==
  
The dataset is taken from the Stack Overflow Developer Survey in 2019 (at https://www.kaggle.com/mchirico/stack-overflow-developer-survey-results-2019), with 88,883 respondents in total. Each row represents one respondent, containing 85 different columns representing the survey responses. Below is the quick summary about the data provided and their attributes, categorized by each of our 3 main objectives as mentioned above.
+
We chose the StackOverflow Developer Survey 2019 dataset (at https://www.kaggle.com/mchirico/stack-overflow-developer-survey-results-2019), as StackOverflow is currently the largest online developer community. The dataset provided is freely accessible, and analysis of this dataset would provide a glimpse about the overall developer community.
 +
 
 +
The dataset contains 88,883 survey responses, with each row corresponding to one respondent, and each of the 85 different columns corresponding to the survey questions. Below is a quick summary about the data provided and their attributes, categorized by each of our 3 main objectives as mentioned above.
  
 
{| class="wikitable"
 
{| class="wikitable"

Revision as of 15:55, 24 February 2020


Insert Logo


Team

 

Proposal

 

Poster

 

Application

 

Research Paper


<--- Go Back to Project Groups

PROBLEM & MOTIVATION




OBJECTIVE




SELECTED DATASET

We chose the StackOverflow Developer Survey 2019 dataset (at https://www.kaggle.com/mchirico/stack-overflow-developer-survey-results-2019), as StackOverflow is currently the largest online developer community. The dataset provided is freely accessible, and analysis of this dataset would provide a glimpse about the overall developer community.

The dataset contains 88,883 survey responses, with each row corresponding to one respondent, and each of the 85 different columns corresponding to the survey questions. Below is a quick summary about the data provided and their attributes, categorized by each of our 3 main objectives as mentioned above.

Data Attributes Data Provided
Background Likert Extent of considering oneself as a stack overflow member
Numerical, Discrete Age
Categorical Gender, Ethnicity, Profession, Education, Frequency and Purpose of using StackOverflow
Binary Coding for hobby, Have dependents
Job prospects Likert Job satisfaction, Job competence
Numerical, Continuous Salary
Numerical, Discrete Hours worked a week, hours spent on code review
Categorical Work structure, work challenges, working remotely, code review
Skills Categorical Language, database, platform and web framework worked with, Developer tools used, Operating system

BACKGROUND SURVEY



BRAINSTORMING SESSION



PROPOSED STORYBOARD



COMMENTS


Feel free to leave us some comments on where we can improve!

No. Name Date Comments
1. Insert your name here Insert date here Insert comment here
2. Insert your name here Insert date here Insert comment here
3. Insert your name here Insert date here Insert comment here