Difference between revisions of "1718t1is428T1"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 2: Line 2:
 
{|style="background-color:#f48024;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
 
{|style="background-color:#f48024;" width="100%" cellspacing="0" cellpadding="0" valign="top" border="0"  |
 
| style="font-family:Century Gothic; font-size:85%; solid #f48024 ; border-bottom:0px solid #f48024 ; background:#f48024 ; text-align:center;" width="20%" |  
 
| style="font-family:Century Gothic; font-size:85%; solid #f48024 ; border-bottom:0px solid #f48024 ; background:#f48024 ; text-align:center;" width="20%" |  
[[IS480_Team_wiki: 2016T1 Airvolution | <font color="#000">Home</font>]]
+
[[IS480_Team_wiki: 2016T1 Airvolution | <font color="#fff">HOME</font>]]
 
| &nbsp;
 
| &nbsp;
  
Line 19: Line 19:
 
[[File:Stackoverflow.png|200px]]
 
[[File:Stackoverflow.png|200px]]
 
<div style="margin-top:-50px">
 
<div style="margin-top:-50px">
<p>
+
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">INTRODUCTION</font></div>
StackOverflow is the largest, most trusted online community for developers to learn, share ​their programming ​knowledge, and build their careers. This is conducted through questions and answers posted on a wide range of topics related to computer programming. Such a platform heavily depends on high-quality questions and answers as well as an engaged community. Thus, some measures to screen for quality includes using ‘badges’ for users, votes on post (upvote or downvote) and closing questions that have been answered before or are irrelevant. To sustain its popularity and success, the site needs to continually draw new knowledge from its users in the fast-paced software field and these said users must constantly be engaged to contribute. As such, StackOverflow can understand better the relationships of its users via network graphs in relation to particular topics tagged to each question. By understanding social interactions better, it can guide StackOverflow to make necessary changes to keep its user base continually engaged. For instance, if StackOverflow notices a significant group of users with high upvotes for posts but do not post frequently, it can think of how can they encourage such users to use the platform regularly.</p>
+
StackOverflow is the largest, most trusted online community for developers to learn, share​ ​their programming ​knowledge, and build their careers. This is conducted through questions and answers posted on a wide range of topics related to computer programming. Such a platform heavily depends on high quality questions and answers as well as an engaged community. Thus, some measures to screen for quality includes using ‘badges’ for users, votes on post (upvote or downvote) and closing questions that have been answered before or are irrelevant.  
 
</div>
 
</div>
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">PROBLEM & MOTIVATION</font></div>
+
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">PROBLEM</font></div>
<br/>
+
To sustain its popularity and success, the site needs to continually draw new knowledge from its users in the fast-paced software field. In addition, these said users must constantly be engaged to contribute to the site. As such, StackOverflow can better understand the relationships of its users via network graphs in relation to particular topics tagged to each question. By understanding social interactions better, it can guide StackOverflow to make necessary changes to keep its user base continually engaged. For instance, if StackOverflow notices a significant group of users with high upvotes for posts but do not post frequently, it can think of how can they encourage such users to use the platform regularly.
 
 
StackOverflow serves as an important information hub for software developers of diverse background to share questions and answers with one another. But for beginners, without knowing where to start or how to participate in this social network, would either find themselves overwhelmed or may find that their questions go unanswered. As such, we would like to create an application that serves to educate anyone on how to best gain or contribute back from and to StackOverflow or merely learn about the community through interactive visualizations.
 
 
<br />
 
<br />
 +
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">MOTIVATIONAL</font></div>
 +
Some of the general problems (questions) that we wish to address are namely:
 +
To sustain its popularity and success, the site needs to continually draw new knowledge from its users in the fast-paced software field. In addition, these said users must constantly be engaged to contribute to the site. As such, StackOverflow can better understand the relationships of its users via network graphs in relation to particular topics tagged to each question. By understanding social interactions better, it can guide StackOverflow to make necessary changes to keep its user base continually engaged. For instance, if StackOverflow notices a significant group of users with high upvotes for posts but do not post frequently, it can think of how can they encourage such users to use the platform regularly.
  
 +
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">OBJECTIVES</font></div>
 +
Objectives
 
Some of the general problems (questions) that we wish to address are namely:
 
Some of the general problems (questions) that we wish to address are namely:
# What questions are more likely to be answered and answered quickly.
+
# What questions are more likely to be answered and which questions are answered promptly?
# What are the popular subcommunities and who are the biggest contributors.
+
# What are the popular subcommunities and who are the biggest contributors?
# What is the distribution of experience levels within each community.
+
## What is the distribution of experience levels within each community
 
## Measured by badges
 
## Measured by badges
## Measured by osts
+
## Measured by posts
 
## Measured by upvoted posts (or comments)
 
## Measured by upvoted posts (or comments)
# Who are the influential members within each StackOverflow subcommunity.
+
## Measured by years of experience on StackOverflow
<br />
+
# Who are the influential members within each StackOverflow subcommunity?
  
 
And because we are aspiring data analytics practitioners, here are some specific problems (questions) that we wish to address.
 
And because we are aspiring data analytics practitioners, here are some specific problems (questions) that we wish to address.
<br />
+
What are the little known technologies that are used for data analytics?
# What are the little known technologies that are used for data analytics
+
What do influential members in Stackoverflow know? (What stuff do they answer on, perhaps we can build a knowledge graph for them)
# What do influential members in stackoverflow know? (What stuff do they answer on, perhaps we can build a knowledge graph for them)
+
 
 +
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">KEY TECHNICAL CHALLENGES</font></div>
 +
* Storing a extremely large network of StackOverflow posts, comments and users in a graph database and creating indexes for fast retrieval.
 +
* Using StackOverflow API to retrieve information on-demand and transform the data into insight on the visualization.
 +
* Learning how to use the relevant tools for graph analysis such as neo4j, vis.js, sigma.js and D3.js
  
 
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">PROJECT TIMELINE</font></div>
 
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">PROJECT TIMELINE</font></div>
 
[[File:Timeline.jpg|center|1200px]]
 
[[File:Timeline.jpg|center|1200px]]
 +
 +
<div style="background: #f48024 ; margin-top: 40px; font-weight: bold; line-height: 0.3em;letter-spacing:-0.08em;font-size:20px"><font color=#f48024 face="Century Gothic">REFERENCE</font></div>
 +
http://sotagtrends.com
 +
This web application plots the number of questions asked over time by tags on Stackoverflow. Examples of tags include “node.js”, “d3.js” and “python”. Multiple tags can be added by the user and the user can also toggle a relative comparison across the tags. Other interesting features this visualization has includes a moving reference line that slides along the X-axis, showing the exact number of questions asked for each tag.
 +
 +
https://insights.stackoverflow.com/survey/2017
 +
This report created by SO describe insights about SO users using data collected through their annual developer survey. Visualizations used here are simple but effective charts – bar charts, scatter plots and line charts – that describe intriguing phenomena such as preferred programming languages among ethnicities,  representation of women in technology roles and correlated technologies used by developers.

Revision as of 17:22, 12 October 2017

HOME

 

PROPOSAL

 

POSTER

 

APPLICATION

 

Stackoverflow.png

INTRODUCTION

StackOverflow is the largest, most trusted online community for developers to learn, share​ ​their programming ​knowledge, and build their careers. This is conducted through questions and answers posted on a wide range of topics related to computer programming. Such a platform heavily depends on high quality questions and answers as well as an engaged community. Thus, some measures to screen for quality includes using ‘badges’ for users, votes on post (upvote or downvote) and closing questions that have been answered before or are irrelevant.

PROBLEM

To sustain its popularity and success, the site needs to continually draw new knowledge from its users in the fast-paced software field. In addition, these said users must constantly be engaged to contribute to the site. As such, StackOverflow can better understand the relationships of its users via network graphs in relation to particular topics tagged to each question. By understanding social interactions better, it can guide StackOverflow to make necessary changes to keep its user base continually engaged. For instance, if StackOverflow notices a significant group of users with high upvotes for posts but do not post frequently, it can think of how can they encourage such users to use the platform regularly.

MOTIVATIONAL

Some of the general problems (questions) that we wish to address are namely: To sustain its popularity and success, the site needs to continually draw new knowledge from its users in the fast-paced software field. In addition, these said users must constantly be engaged to contribute to the site. As such, StackOverflow can better understand the relationships of its users via network graphs in relation to particular topics tagged to each question. By understanding social interactions better, it can guide StackOverflow to make necessary changes to keep its user base continually engaged. For instance, if StackOverflow notices a significant group of users with high upvotes for posts but do not post frequently, it can think of how can they encourage such users to use the platform regularly.

OBJECTIVES

Objectives Some of the general problems (questions) that we wish to address are namely:

  1. What questions are more likely to be answered and which questions are answered promptly?
  2. What are the popular subcommunities and who are the biggest contributors?
    1. What is the distribution of experience levels within each community
    2. Measured by badges
    3. Measured by posts
    4. Measured by upvoted posts (or comments)
    5. Measured by years of experience on StackOverflow
  3. Who are the influential members within each StackOverflow subcommunity?

And because we are aspiring data analytics practitioners, here are some specific problems (questions) that we wish to address. What are the little known technologies that are used for data analytics? What do influential members in Stackoverflow know? (What stuff do they answer on, perhaps we can build a knowledge graph for them)

KEY TECHNICAL CHALLENGES
  • Storing a extremely large network of StackOverflow posts, comments and users in a graph database and creating indexes for fast retrieval.
  • Using StackOverflow API to retrieve information on-demand and transform the data into insight on the visualization.
  • Learning how to use the relevant tools for graph analysis such as neo4j, vis.js, sigma.js and D3.js
PROJECT TIMELINE
Timeline.jpg
REFERENCE

http://sotagtrends.com This web application plots the number of questions asked over time by tags on Stackoverflow. Examples of tags include “node.js”, “d3.js” and “python”. Multiple tags can be added by the user and the user can also toggle a relative comparison across the tags. Other interesting features this visualization has includes a moving reference line that slides along the X-axis, showing the exact number of questions asked for each tag.

https://insights.stackoverflow.com/survey/2017 This report created by SO describe insights about SO users using data collected through their annual developer survey. Visualizations used here are simple but effective charts – bar charts, scatter plots and line charts – that describe intriguing phenomena such as preferred programming languages among ethnicities, representation of women in technology roles and correlated technologies used by developers.